1. Finlayson, S. G., Subbaswamy, A., Singh, K., et al. (2021). The clinician and dataset shift in artificial intelligence. Nature Medicine, 27, 1687–1692.
Reference Point: The abstract and introduction (p. 1687) state, "A major challenge for the safe and equitable real-world translation of clinical artificial intelligence (AI) is the problem of dataset shift... When a model is deployed on data drawn from a different distribution than its training data, its performance can degrade substantially." This directly supports data drift as the cause for performance degradation over time in a clinical AI setting.
2. Subbaswamy, A., & Saria, S. (2020). From development to deployment: dataset shift, causality, and shift-stable models in health AI. Biostatistics, 21(2), 345–352. https://doi.org/10.1093/biostatistics/kxz050
Reference Point: Section 1, "Introduction" (p. 345), explains that models trained on a specific data distribution often fail when deployed because the deployment (target) distribution differs. The paper notes, "This problem, referred to as dataset shift, is a key reason why models that perform well in development fail to translate to practice." This confirms that drift is a primary cause of performance failure after deployment.
3. MIT OpenCourseWare. (2021). 6.S191: Introduction to Deep Learning - Lecture 10: Limitations & New Frontiers.
Reference Point: In discussions on the deployment and limitations of deep learning models, the topic of distributional shift (data drift) is highlighted as a critical real-world challenge. The lecture materials emphasize that models are not static and that their performance (including accuracy and other metrics) will decay if the real-world data they encounter diverges from their training data, necessitating continuous monitoring and retraining.