Question 5

Question

A financial company receives a high volume of real-time market data streams from an external
provider. The streams consist of thousands of JSON records every second.
The company needs to implement a scalable solution on AWS to identify anomalous data points.
Which solution will meet these requirements with the LEAST operational overhead?

Accepted Answer

Ingest real-time data into Amazon Kinesis data streams. Use the built-in RANDOM_CUT_FOREST
function in Amazon Managed Service for Apache Flink to process the data streams and to detect data
anomalies.

Sam H. · Answer

Makes sense to pick A here, since Flink has built-in anomaly detection and keeps operational work minimal. Anyone disagree with that approach?

SteadyLead4502 · Answer

C/D? Both use Lambda but D sets up for batch, not real-time. Since the question wants real-time anomaly detection with low ops, I’m pretty sure A is right over these. Anyone seeing something I missed in B or C?

Logan · Answer

A , built-in Random Cut Forest in Flink is the lowest ops here.

Liam C. · Answer

Why not B? It needs Lambda and SageMaker, so more ops than A.

Mason G. · Answer

Its D if you batch, but since you need real-time and as little ops work as possible, option A is built for this. The built-in RANDOM_CUT_FOREST in Flink means no custom ML or extra infra. Pretty sure that's what AWS wants here.

Premium Access Includes

FLASH OFFER

avail 10% DISCOUNT on YOUR PURCHASE