Q: 11
You need to choose a database for a new project that has the following requirements:
Fully managed
Able to automatically scale up
Transactionally consistent
Able to scale up to 6 TB
Able to be queried using SQL
Which database do you choose?
Options
Discussion
C imo. Spanner ticks all those boxes, especially scalability and strong transaction consistency. Not 100 percent sure but fits the requirements best.
Be respectful. No spam.
Q: 12
Your company has recently grown rapidly and now ingesting data at a significantly higher rate than it
was previously. You manage the daily batch MapReduce analytics jobs in Apache Hadoop. However,
the recent increase in data has meant the batch jobs are falling behind. You were asked to
recommend ways the development team could increase the responsiveness of the analytics without
increasing costs. What should you recommend they do?
Options
Discussion
No comments yet. Be the first to comment.
Be respectful. No spam.
Q: 13
You have a BigQuery table that contains customer data, including sensitive information such as
names and addresses. You need to share the customer data with your data analytics and consumer
support teams securely. The data analytics team needs to access the data of all the customers, but
must not be able to access the sensitive dat
a. The consumer support team needs access to all data columns, but must not be able to access
customers that no longer have active contracts. You enforced these requirements by using an
authorized dataset and policy tags After implementing these steps, the data analytics team reports
that they still have access to the sensitive columns. You need to ensure that the data analytics team
does not have access to restricted data What should you do?
Choose 2 answers
Options
Discussion
Probably B and C, matches what I saw in similar practice sets. Super clear options here.
Be respectful. No spam.
Q: 14
Which of the following IAM roles does your Compute Engine account require to be able to run
pipeline jobs?
Options
Discussion
Is the question asking specifically about running jobs on Dataflow, or just any pipeline job in general? If it’s about Dataflow pipelines, A is best, but a different service might need another role.
Be respectful. No spam.
Q: 15
You are migrating an application that tracks library books and information about each book, such as
author or year published, from an on-premises data warehouse to BigQuery In your current
relational database, the author information is kept in a separate table and joined to the book
information on a common key Based on Google's recommended practice for schema design, how
would you structure the data to ensure optimal speed of queries about the author of each book that
has been borrowed?
Options
Discussion
Probably C
Be respectful. No spam.
Q: 16
An aerospace company uses a proprietary data format to store its night dat
a. You need to connect this new data source to BigQuery and stream the data into BigQuery. You
want to efficiency import the data into BigQuery where consuming as few resources as possible.
What should you do?
Options
Discussion
No comments yet. Be the first to comment.
Be respectful. No spam.
Q: 17
You need ads data to serve Al models and historical data tor analytics longtail and outlier data points
need to be identified You want to cleanse the data n near-reel time before running it through Al
models What should you do?
Options
Discussion
Seen similar on practice exams. Probably A.
Be respectful. No spam.
Q: 18
You want to build a managed Hadoop system as your data lake. The data transformation process is
composed of a series of Hadoop jobs executed in sequence. To accomplish the design of separating
storage from compute, you decided to use the Cloud Storage connector to store all input data,
output data, and intermediary dat
a. However, you noticed that one Hadoop job runs very slowly with Cloud Dataproc, when compared
with the on-premises bare-metal Hadoop environment (8-core nodes with 100-GB RAM). Analysis
shows that this particular Hadoop job is disk I/O intensive. You want to resolve the issue. What
should you do?
Options
Discussion
Option A Similar questions in the GCP practice exams highlight in-memory processing as key for performance issues.
Be respectful. No spam.
Q: 19
Each analytics team in your organization is running BigQuery jobs in their own projects. You want to
enable each team to monitor slot usage within their projects. What should you do?
Options
Discussion
Does the question specify if an organization-level view is required, or just within each project? That detail would change the best choice.
Be respectful. No spam.
Q: 20
You are designing a pipeline that publishes application events to a Pub/Sub topic. You need to
aggregate events across hourly intervals before loading the results to BigQuery for analysis. Your
solution must be scalable so it can process and load large volumes of events to BigQuery. What
should you do?
Options
Discussion
Streaming aggregation is the scalable way here, so A.
Be respectful. No spam.
Question 11 of 20 · Page 2 / 2