1. Gama, J., Žliobaitė, I., Bifet, A., Pechenizkiy, M., & Bouchachia, A. (2014). A survey on concept drift adaptation. ACM Computing Surveys (CSUR), 46(4), 1-37. Section 2, "Concept Drift," defines the problem as changes in the underlying data distribution over time, which is a primary cause of model performance degradation. (https://doi.org/10.1145/2527846)
2. Stanford University. (2021). CS 329S: Machine Learning Systems Design, Lecture 13: Data and Concept Drift. This courseware explains that monitoring for and detecting distribution drift between training and inference data is a crucial step in maintaining model performance.
3. Lu, J., Liu, A., Dong, F., Gu, F., Gama, J., & Zhang, G. (2018). Learning under concept drift: A review. IEEE Transactions on Knowledge and Data Engineering, 31(12), 2346-2363. This paper emphasizes that "the major cause of performance deterioration of a learned model is that the distribution of the data changes over time." (https://doi.org/10.1109/TKDE.2018.2876857)