Its D, class imbalance is the main issue here. But if the question asked for the first thing you'd check rather than the most likely cause, missing features from option A could affect things too. The official guide and SageMaker docs cover class imbalance problems like this.
Q: 14
Scenario: A document classification model detects fraud. It performs well on the majority ("legitimate
claim") documents but frequently misclassifies the minority ("fraudulent claim") samples. SageMaker
Clarify pretraining bias analysis reveals a significant skew in the dataset.
Question- What issue is most likely causing the model's poor performance on fraudulent claim
detection?
Options:
Options
Discussion
AWS always asks this kind of class imbalance! D
A tbh. If Textract didn't get all the key fields out, the model would be missing important info to tell fraud from legit claims. The Clarify bias might just be reflecting what's missing in the features. Not 100 percent sure, anyone see it this way?
Probably D, classic case of class imbalance with way fewer fraud cases so the model just skews to the legit ones.
D imo, class imbalance fits best. The SageMaker Clarify bias report means the dataset is skewed, so the model just isn’t seeing enough fraudulent claims to learn their features. Overfitting or Textract issues would cause broader errors, not this specific minority miss. Not totally sure unless we see actual data stats, but this is textbook CI behavior.
Be respectful. No spam.
Question 14 of 15