Question 8

Question

A company has implemented a data ingestion pipeline for sales transactions from its ecommerce
website. The company uses Amazon Data Firehose to ingest data into Amazon OpenSearch Service.
The buffer interval of the Firehose stream is set for 60 seconds. An OpenSearch linear model
generates real-time sales forecasts based on the data and presents the data in an OpenSearch
dashboard.
The company needs to optimize the data ingestion pipeline to support sub-second latency for the
real-time dashboard.
Which change to the architecture will meet these requirements?

Accepted Answer

Use zero buffering in the Firehose stream. Tune the batch size that is used in the PutRecordBatch
operation.

PracticalAnalyst7755 · Answer

Option A matches what I had in a mock. Setting Firehose buffer interval to zero is the only way here to cut batch delay, so you get almost instant updates in OpenSearch. None of the other options really get true sub-second unless you change the buffer itself. Pretty sure it's A, but lmk if you see it differently.

Ava W. · Answer

Option A looks right. If you want sub-second latency, you can't wait for the 60 second buffer-Firehose needs to send records as they come in. Setting buffering to zero pushes data through immediately. Tuning PutRecordBatch helps with efficiency but the main thing is removing that delay. Pretty sure that's what AWS recommends for real-time use cases like this. Anyone see a downside?

Vikram W. · Answer

Option A seen this logic in a few official practice questions and the AWS docs too.

Chris U. · Answer

Honestly AWS makes this so annoying, always buffer tweak questions. A

Mia Y. · Answer

B tbh, DataSync with enhanced fan-out sounds like faster parallel processing to me. Option A feels like a buffer config trap.

Premium Access Includes

FLASH OFFER

avail 10% DISCOUNT on YOUR PURCHASE