Q: 8
A company has implemented a data ingestion pipeline for sales transactions from its ecommerce
website. The company uses Amazon Data Firehose to ingest data into Amazon OpenSearch Service.
The buffer interval of the Firehose stream is set for 60 seconds. An OpenSearch linear model
generates real-time sales forecasts based on the data and presents the data in an OpenSearch
dashboard.
The company needs to optimize the data ingestion pipeline to support sub-second latency for the
real-time dashboard.
Which change to the architecture will meet these requirements?
Options
Discussion
Option A looks right. If you want sub-second latency, you can't wait for the 60 second buffer-Firehose needs to send records as they come in. Setting buffering to zero pushes data through immediately. Tuning PutRecordBatch helps with efficiency but the main thing is removing that delay. Pretty sure that's what AWS recommends for real-time use cases like this. Anyone see a downside?
Its A. You'd need to set the Firehose buffer interval to zero for actual sub-second latency, or else your data sits there waiting. Seen similar advice in the official AWS docs and practice questions. Anyone disagree?
Be respectful. No spam.