Question 4 - PMI-CPMAI Real Exam Questions [July 2026 Update]

Q: 4

A financial services firm is assessing the success of a newly operationalized AI system for fraud detection. The project manager needs to evaluate the model against business key performance indicators (KPIs). What is an effective method to help ensure the accuracy of this evaluation?

Options

Correct Answer:

Explanation

Evaluating an AI system, especially for a critical function like fraud detection, requires a multi-faceted approach to ensure accuracy. A single metric can be highly misleading. Utilizing a diverse set of validation techniques—such as A/B testing against a control group, backtesting on historical data, and analyzing a suite of performance metrics (e.g., precision, recall, F1-score, false positive rate)—provides a holistic and robust view. This approach allows the project manager to understand the complex trade-offs (e.g., catching more fraud vs. inconveniencing legitimate customers) and accurately map the model's technical performance to specific business KPIs like fraud loss reduction and customer satisfaction.

Why Incorrect

A. A single comprehensive metric is often insufficient and can obscure critical performance aspects, especially with imbalanced datasets typical in fraud detection.

C. Quarterly financial reports are lagging indicators influenced by numerous factors, making it difficult to isolate and accurately attribute performance directly to the AI system.

D. While valuable for governance, consulting external experts is a verification step for an evaluation, not the primary method for conducting the evaluation itself.

References

1. Kreuzberger, D., et al. (2023). Machine Learning Operations (MLOps): Overview, Definition, and Architecture. IEEE Access, 11, 31756-31775. In Section IV-A, "Model Evaluation," the authors emphasize that "evaluation of ML models is a multi-faceted problem" and that a variety of metrics and techniques are necessary for a comprehensive assessment, especially in continuous monitoring post-deployment. (DOI: https://doi.org/10.1109/ACCESS.2023.3262138)

2. Stanford University. (n.d.). CS229 Machine Learning Course Notes: Evaluation Metrics. The course materials discuss the importance of using metrics beyond simple accuracy for classification problems, particularly the precision-recall trade-off. For applications like fraud detection, evaluating this trade-off is critical to understanding business impact, which requires multiple evaluation points, not a single metric. (See discussion on Precision, Recall, and F1-score in the course's public materials).

3. Saleh, M. (2022). MLOps: The Ultimate Guide. Towards Data Science. While not a formal academic paper, this guide, widely referenced in data science curricula, explains that robust model evaluation in production (a core MLOps principle) involves a combination of techniques including A/B testing, canary deployments, and monitoring a dashboard of metrics, not just one. This aligns with the need for a diverse set of validation techniques.

Premium Access Includes

FLASH OFFER

avail 10% DISCOUNT on YOUR PURCHASE