Q: 3
A company is building an ML model. The company collected new data and analyzed the data by
creating a correlation matrix, calculating statistics, and visualizing the data.
Which stage of the ML pipeline is the company currently in?
Options
Discussion
This looks super close to one I had on a mock, the correlation matrix part basically nails it as C.
Skip B, C here. Calculating stats and making correlation matrices is classic EDA, not engineering features. Trap answer is B.
C or B-if "analyzed" also means creating new features, would that count as feature engineering instead? Just want to be sure on what they mean by 'analyzed.'
B tbh. . Saw a similar scenario in the official practice test, so I'd check the AWS study guide and practice banks for how these stages are defined.
All those activities scream exploratory data analysis to me, so C is right. Making correlation matrices and visualizations isn't creating new features. Pretty sure this is what AWS expects.
Similar question was in the official practice, recommend reviewing the AWS study guide for pipeline stages.
Be respectful. No spam.