Does the question specify if "mlx5_X" is just an example, or are we supposed to check all InfiniBand devices? If it wants "the best" option for checking multiple interfaces at once, maybe B could be tempting, but D directly shows state and link info per interface.
I don’t think C is right. "Local" in LRH refers to the entire subnet, not just a single link. A fits because it’s about routing traffic between nodes inside the local subnet. Pretty sure this matches both exam wording and InfiniBand docs, but open to correction if there’s an angle I missed. B might trip people up since LIDs come from the subnet manager, but that’s not what makes it a local header.
Yeah, D makes sense here. DGX SuperPOD is the actual reference architecture for massive AI/HPC clusters on-prem, while LaunchPad (C) is just for quick hands-on labs. Pretty sure SuperPOD is what NVIDIA recommends for these research setups. Anyone see a reason to consider B instead?
Probably D since Spectrum-X tracks egress queue loads to spot congestion, not CPU or device number. Queue depth basically reflects if traffic is backing up, which is what you'd want to monitor for real-time congestion. Pretty sure that's how their switches do it but happy to hear different takes.
Yep, it's D for sure. The official guide and Spectrum-X config labs always mention checking egress queue loads to see congestion, not device count or CPU usage. If you've done the practice exams, this wording comes up a lot. Anyone think another option fits better?
Pretty sure it's B since ignore-files is all about letting you manage certain config via both NVUE and standard flat files without overwriting changes. D sounds security-related but that's not the main goal here. Anyone think differently?