NVIDIA NCP AII.pdf
Q: 1
A system administrator needs to install a container toolkit and successfully run the following
commands:
sudo apt-get update
sudo apt-get install -y nvidia-container-toolkit
sudo nvidia-ctk runtime configure --runtime docker
What step should be taken next to finish the installation?
Options
Q: 2
An administrator is configuring node categories in BCM for a DGX BasePOD cluster. They need to
group all NVIDIA DGX H200 nodes under a dedicated category for GPU-accelerated workloads. Which
approach aligns with NVIDIA's recommended BCM practices?
Options
Q: 3
A financial services firm is deploying an AI model for fraud detection that requires rapid inference
and data retrieval across multiple sites. Which feature should their storage system prioritize?
Options
Q: 4
You are leading a project to enhance the energy efficiency of a data center that heavily relies on AI
workloads. NVIDIA suggests moving beyond traditional metrics like Power Usage Effectiveness (PUE)
to better capture the efficiency of modern data centers. Which strategy should you prioritize?
Options
Q: 5
An InfiniBand server stops working, and a system administrator runs the "ibstat" command that
provides the following output:
CA 'mlx5_1'
CA type: MT4115
Number of ports: 2
Firmware version: 10.20.1010
Hardware version: 0
Node GUID: 0x0002c90300002f78
System image GUID: 0x0002c90300002f7b
Port 1:
State: Initializing
Physical state: Linkup
Rate: 100
Base lid: 0
LMC: 0
SM lid: 0
Capability mask: 0x0251086a
Port GUID: 0x0002c90300002f79
Link layer: InfiniBand
What is the cause of the issue?
Options
Q: 6
A customer is designing an AI Factory for enterprise-scale deployments and wants to ensure
redundancy and load balancing for the management and storage networks. Which feature should be
implemented on the Ethernet switches?
Options
Q: 7
You are installing the operating system as part of the initial setup for a new NVIDIA Base Command
Manager (BCM) cluster. Which two of the following actions are essential for a successful OS
installation on the cluster's head node? (Pick the 2 correct responses below)
Options
Q: 8
A systems administrator is preparing a new DGX server for deployment. What is the most secure
approach to configuring the BMC port during initial setup?
Options
Q: 9
For a 48-hour NCCL burn-in test, which parameters ensure sustained fabric stress while detecting
silent data corruption?
Options
Q: 10
A system administrator needs to validate a GPU-based server and ensure that no errors occur under
load. What command should be used?
Options
Question 1 of 10