Scenario: A data scientist needs to develop a fraud detection model on SageMaker with a severely imbalanced dataset (fraudulent transactions are rare). They must minimize operational overhead and ensure the model is fair and unbiased. Question- Which approach will fulfill the given requirements?. Options:
Pretty sure D is right here since Amazon Transcribe custom vocab lets you add/update product names fast, you don't have to retrain the whole model. The others seem more for general AI or search stuff? Not fully confident, let me know if I missed something.
C or B for me. Both mention using A2I for bias, which seems like it could help with detection, and they include SMOTE for balancing data. I know Clarify is more standard for bias checks but thought A2I handled some of that too? Not 100 percent convinced though, maybe missing something with Pipelines. Anyone else prefer A2I here or am I way off?