Exam NCP-AII Topic 1 Question 110 Discussion

Actual exam question for NVIDIA's NCP-AII exam
Question #: 110
Topic #: 1
You have an NVIDIAAIOO GPU and need to configure it for optimal performance across two distinct AI workloads: a large language model (LLM) training job and a computer vision inference service. The LLM benefits from maximum memory bandwidth, while the inference service requires low latency and high throughput. Which MIG configuration would best suit this scenario?

Suggested Answer: D Vote an answer

Creating a log. 120gb instance for the memory-intensive LLM and a 4g.40gb instance for the inference service provides dedicated resources that cater to the specific needs of each workload, without the overhead or limitations of CUDA MPS or Kubernetes resource quotas. Option A is too conservative, potentially limiting the LLM performance. Option B sacrifices dedicated resources for inference, which may hurt latency. Option C does not leverage MIG and does not guarantee resource isolation and performance consistency. Option E introduces complexities associated with Time-Slicing and might not be suitable for real-time processing.

by Leona at Feb 11, 2026, 03:00 AM

Comments

Chosen Answer:
This is a voting comment (?) , you can switch to a simple comment.
Switch to a voting comment New
Nick name: Submit Cancel
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

0
0
0
10