Question 1

Question

A data engineer has configured a Structured Streaming job to read from a table, manipulate the data,
and then perform a streaming write into a new table.
The code block used by the data engineer is below:
https://kxbjsyuhceggsyvxdkof.supabase.co/storage/v1/object/public/file-images/DATABRICKS-CERTIFIED-DATA-ENGINEER-ASSOCIATE/page_48_img_1.jpg
If the data engineer only wants the query to process all of the available data in as many batches as
required, which of the following lines of code should the data engineer use to fill in the blank?

Accepted Answer

trigger(availableNow=True)

CameronJ · Answer

Makes sense to pick B here. trigger(availableNow=True) is meant for processing all available data in multiple batches if needed.

Sofia L. · Answer

Ugh, Databricks changing syntax again. Option B

AvaX · Answer

I'd actually pick D here. In my experience, trigger(processingTime="once") will process all the currently available data in one go and then stop, which feels like what they're asking for. Could be wrong if they're expecting multiple batches though. Anyone thinking the same?

Zoe X. · Answer

D

Anita N. · Answer

B is right here since trigger(availableNow=True) makes the job process all existing data in as many batches as needed, which matches the requirement. Official docs and practice tests both highlight this option. Pretty sure, but let me know if you see it differently.

Premium Access Includes

FLASH OFFER

avail 10% DISCOUNT on YOUR PURCHASE