Q: 4
A company uses an Amazon Redshift provisioned cluster as its database. The Redshift cluster has five
reserved ra3.4xlarge nodes and uses key distribution.
A data engineer notices that one of the nodes frequently has a CPU load over 90%. SQL Queries that
run on the node are queued. The other four nodes usually have a CPU load under 15% during daily
operations.
The data engineer wants to maintain the current number of compute nodes. The data engineer also
wants to balance the load more evenly across all five compute nodes.
Which solution will meet these requirements?
Options
Discussion
Option B looks right. If one node is overloaded and others are mostly idle, that's usually a sign the distribution key isn't set well and data isn't spread out. Picking a column with high cardinality should make things more balanced across nodes. Pretty sure, but open to other ideas.
Be respectful. No spam.