Question 3

Question

You are tasked with contributing to the operations of an AI data center that requires high availability
and minimal downtime. Which strategy would most effectively help maintain continuous AI
operations in collaboration with the data center administrator?

Accepted Answer

Use GPUs in active-passive clusters, with DPUs handling real-time network failover and security

Avery W. · Answer

Makes sense to me, option C. Active-passive GPU with DPU-managed failover is exactly how you'd architect HA for AI workloads, at least from everything I've seen.

Reese · Answer

I remember a similar scenario from labs, in some exam reports, and it's C. This matches what NVIDIA recommends for high availability AI ops.

Jason C. · Answer

A is wrong, C. DPU handles network/security, not inference jobs, and CPUs can't really match GPU workloads for HA AI ops. Active-passive GPU clusters are pretty much how NVIDIA does high availability now.

CuriousEngineer2748 · Answer

C . GPUs in active-passive clusters plus DPU network failover is standard for minimum downtime.

CuriousLead7685 · Answer

Option C, Had something like this in a mock, GPU active-passive with DPU handling network failover is the standard HA setup for AI these days. Pretty sure that's what they want.

Premium Access Includes

FLASH OFFER

avail 10% DISCOUNT on YOUR PURCHASE