Question 12 - PMI-CPMAI Real Exam Questions [July 2026 Update]

Q: 12

A company plans to operationalize an AI solution. The project manager needs to ensure model performance is meeting selected thresholds before release. What is an effective way to confirm these thresholds before this release?

Options

Correct Answer:

Explanation

Testing a trained AI model against a validation dataset is a fundamental and critical step in the machine learning lifecycle. The validation dataset contains data that the model has not seen during training, providing an unbiased assessment of its performance. By calculating key performance metrics (e.g., accuracy, precision, F1-score) on this dataset, the project manager can directly and quantitatively compare the model's performance against the pre-defined acceptance thresholds. This process confirms that the model meets the required quality and performance standards before it is released into production.

Why Incorrect

B. Implementing an impact evaluation assesses the broader, often post-deployment, effects of the solution, not the model's specific pre-release performance metrics.

C. Running multiple end-user acceptance tests focuses on system usability and meeting business requirements, not on the statistical validation of the model's performance.

D. Conducting a series of penetration tests is a security-focused activity to find system vulnerabilities, which is unrelated to measuring a model's predictive performance.

References

1. Ng, A. (2018). Machine Learning Yearning. Chapter 11: Splitting your data. This chapter explains the critical role of a development (or validation) set to "evaluate ideas" and provides an unbiased measure of model performance, which is essential for making decisions like shipping a product.

2. Google. (n.d.). Machine Learning Crash Course: Validation Set. Google Developers. Retrieved from https://developers.google.com/machine-learning/crash-course/validation/what-is-a-validation-set. This official documentation states, "The validation set is used to evaluate the model's performance during development... to check if the model is generalizing well to unseen data," which directly corresponds to confirming performance against thresholds.

3. Wirth, R., & Hipp, J. (2000). CRISP-DM: Towards a standard process model for data mining. Proceedings of the 4th International Conference on the Practical Applications of Knowledge Discovery and Data Mining, 29-39. The "Evaluation" phase of the CRISP-DM methodology (Section 2.5) explicitly details the task of "Evaluate Results," which involves assessing the model against business success criteria (i.e., performance thresholds) before proceeding to deployment.

Premium Access Includes

FLASH OFFER

avail 10% DISCOUNT on YOUR PURCHASE