Exam NCA-GENL Topic 9 Question 27 Discussion
Actual exam question for NVIDIA's NCA-GENL exam
Question #: 27
Topic #: 9
Question #: 27
Topic #: 9
Which of the following is a parameter-efficient fine-tuning approach that one can use to fine-tune LLMs in a memory-efficient fashion?
Suggested Answer: D Vote an answer
LoRA (Low-Rank Adaptation) is a parameter-efficient fine-tuning approach specifically designed for large language models (LLMs), as covered in NVIDIA's Generative AI and LLMs course. It fine-tunes LLMs by updating a small subset of parameters through low-rank matrix factorization, significantly reducing memory and computational requirements compared to full fine-tuning. This makes LoRA ideal for adapting large models to specific tasks while maintaining efficiency. Option A, TensorRT, is incorrect, as it is an inference optimization library, not a fine-tuning method. Option B, NeMo, is a framework for building AI models, not a specific fine-tuning technique. Option C, Chinchilla, is a model, not a fine-tuning approach. The course emphasizes: "Parameter-efficient fine-tuning methods like LoRA enable memory-efficient adaptation of LLMs by updating low-rank approximations of weight matrices, reducing resource demands while maintaining performance." References: NVIDIA Building Transformer-Based Natural Language Processing Applications course; NVIDIA Introduction to Transformer-Based Natural Language Processing.
by Wendell at Jan 28, 2026, 02:54 AM
0
0
0
10
Comments
Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.
Report Comment
Commenting
You can sign-up / login (it's free).