NVIDIA-Certified Professional AI Networking Questions and Answers
You are troubleshooting an InfiniBand network issue and need to check the status of the InfiniBand interfaces. Which command should you use to display the state, physical state, and link layer of InfiniBand interfaces?
A financial services company is planning to implement an AI infrastructure to support real-time fraud detection and risk assessment. They need a solution that can handle both training and inference workloads while maintaining data privacy and security.
Which NVIDIA reference architecture component would be most appropriate to address the data privacy and security concerns in this AI networking setup?
When creating a simu-lation in NVIDIA AIR, what syntax would you use to define a link between port 1 on spine-01 and port 41 on gpu-leaf-01?
What does NetQ leverage (in addition to NVIDIA "What Just Happened" switch telemetry data and NVIDIA DOCA telemetry) to help network operators proactively identify server and application root cause issues?
You are planning to deploy a large-scale Spectrum-X network for AI workloads. Before physical implementation, you want to validate the network design and configuration using a digital twin approach.
Which NVIDIA tool would be most appropriate for creating and simulating a digital twin of your Spectrum-X network?
How is congestion evaluated in an NVIDIA Spectrum-X system?
You suspect there might be connectivity issues in your InfiniBand fabric and need to perform a comprehensive check. Which tool should you use to run a full fabric diagnostic and generate a report?
You are deploying a Kubernetes cluster for AI workloads using NVIDIA Spectrum-X switches. You need to automate the deployment and management of networking components in this environment.
Which NVIDIA tool is specifically designed to automate the deployment and management of networking components in a Kubernetes cluster with Spectrum-X switches?
You are troubleshooting InfiniBand connectivity issues in a cluster managed by the NVIDIA Network Operator. You need to verify the status of the InfiniBand interfaces. Which command should you use to check the state and link layer of InfiniBand interfaces on a node?
A fabric administrator added new servers to a 40-port edge switch. The administrator now needs to gather and map the newly added ports’ LIDs and LINK SPEED. Which of the following commands can be used for that purpose?
You are troubleshooting connectivity issues in your InfiniBand network and need to test basic connectivity between nodes. Which command should you use to test basic connectivity between InfiniBand nodes?
You are tasked with troubleshooting a link flapping issue in an InfiniBand AI fabric. You would like to start troubleshooting from the physical layer.
What is the right NVIDIA tool to be used for this task?
Which of the following NCCL environment variables enable SHARP aggregation with NCCL when using the NCCL-SHARP plugin?
Pick the 2 correct responses below
In order to configure RoCE on a Cumulus switch, which command should be used?
You are concerned about potential security threats and unexpected downtime in your InfiniBand data center.
Which UFM platform uses analytics to detect security threats, operational issues, and predict network failures in InfiniBand data centers?
A cloud service provider is deploying the NVIDIA Spectrum-X Ethernet platform in a multi-tenant environment. To ensure the security and isolation of each tenant's AI workload, the provider wants to implement a feature that prevents unauthorized access to the network.
Which of the following features of the Spectrum-X platform should the provider implement?
Which command should be used to configure RoCE on a Cumulus switch?
A high-performance InfiniBand fabric requires a routing engine that maximizes throughput and network utilization while reducing congestion. Which option below is the best routing engine for InfiniBand?
A user has requested confirmation that the InfiniBand network is performing optimally and is not limiting the speed of a training run. To verify this, you would like to measure the RDMA throughput rate between two endpoints.
Which tool should be used?
You are implementing a multi-tenant environment on your Spectrum-X switches for different departments in your organization. You need to ensure that each department's network traffic is isolated and secure.
Which Spectrum-X security feature would be most effective in creating isolated network environments for each department?
Which of the following commands would you use to assign the IP address 20.11.12.13 to the management interface in SONiC?