1. NVIDIA GPUDirect RDMA Documentation. Sections on performance tuning and validation often recommend using large message sizes to saturate network bandwidth for testing.
2. Ohio State University (OSU) Micro-Benchmarks User Guide. While a different suite, the principles for network stress testing are universal: use large messages for bandwidth tests and long run times for stability tests. The NCCL tests are based on these principles.
3. NVIDIA. (2022). NCCL Tests. The source code and documentation for the nccl-tests repository (e.g., on GitHub) show options for message size (-b, -e), iterations (-n), and data checking (-c), confirming their roles in stress testing.