Q: 13
Your team is building a data lake platform on Google Cloud. As a part of the data foundation design,
you are planning to store all the raw data in Cloud Storage You are expecting to ingest approximately
25 GB of data a day and your billing department is worried about the increasing cost of storing old
dat
a. The current business requirements are:
• The old data can be deleted anytime
• You plan to use the visualization layer for current and historical reporting
• The old data should be available instantly when accessed
• There should not be any charges for data retrieval.
What should you do to optimize for cost?
Options
Discussion
Had something like this in a mock. B and C are right since policy tags need to be set up with the right access controls, and you don't want analytics folks having the Fine-Grained Reader role for sensitive columns. Always double check the Data Catalog permissions too. Pretty sure that's it but open to other takes if I missed anything.
Nah, I think B and C. D's a trap since removing dataViewer isn't enough if policy tags aren't enforced.
Probably B and C, matches what I saw in similar practice sets. Super clear options here.
Be respectful. No spam.