1. University Courseware:
Ng, A. (2018). CS229 Machine Learning Course Notes. Stanford University. In the section on "Evaluation Metrics," it is explicitly stated that for tasks with skewed classes, "classification accuracy is not a good metric to use." The notes then introduce precision and recall as necessary alternatives to understand model performance more deeply. (See Part V, "Learning Theory," section on Evaluation Metrics).
2. Peer-reviewed Academic Publications:
Hossin, M., & Sulaiman, M. N. (2015). A Review on Evaluation Metrics for Data Classification Evaluations. International Journal of Data Mining & Knowledge Management Process, 5(2), 1-11. DOI: 10.5121/ijdkp.2015.5201. This paper reviews various metrics and highlights that "accuracy is not a proper measure when the data set is imbalanced" (p. 3), advocating for the use of a confusion matrix, precision, recall, and other metrics for a comprehensive evaluation.
3. Official PMI Publications (Principles):
Project Management Institute. (2022). AI in Project Management. PMI White Paper. While not a technical manual, this paper emphasizes that the goal of AI in projects is to "deliver value" and achieve "desired outcomes" (p. 4). Relying on a single technical metric like error rate without considering its impact on business value and outcomes contradicts this core principle of value-driven project management.