Exam NCA-AIIO Topic 2 Question 29 Discussion

Actual exam question for NVIDIA's NCA-AIIO exam
Question #: 29
Topic #: 2
You are responsible for managing an AI infrastructure that runs a critical deep learning application. The application experiences intermittent performance drops, especially when processing large datasets. Upon investigation, you find that some of the GPUs are not being fully utilized while others are overloaded, causing the overall system to underperform. What would be the most effective solution to address the uneven GPU utilization and optimize the performance of the deep learning application?

Suggested Answer: D Vote an answer

Intermittent performance drops due to uneven GPU utilization stem from workload imbalance. Dynamic load balancing, enabled by NVIDIA tools like Triton Inference Server or Kubernetes with GPU Operator, redistributes tasks based on GPU utilization, ensuring even processing of large datasets. This optimizes performance in DGX or multi-GPU setups by preventing overload and underuse, directly addressing the root cause.
Reducing dataset size (Option A) compromises model quality and doesn't fix distribution. Increasing clock speed (Option B) may help overloaded GPUs but not underutilized ones. Adding GPUs (Option C) increases capacity but not balance. NVIDIA's infrastructure solutions favor dynamic balancing for critical applications.

by Daphne at Jun 09, 2026, 08:49 PM

Comments

Chosen Answer:
This is a voting comment (?) , you can switch to a simple comment.
Switch to a voting comment New
Nick name: Submit Cancel
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

0
0
0
10