Exam Databricks-Machine-Learning-Associate Topic 1 Question 41 Discussion

Actual exam question for Databricks's Databricks-Machine-Learning-Associate exam
Question #: 41
Topic #: 1
In which of the following situations is it preferable to impute missing feature values with their median value over the mean value?

Suggested Answer: C Vote an answer

Imputing missing values with the median is often preferred over the mean in scenarios where the data contains a lot of extreme outliers. The median is a more robust measure of central tendency in such cases, as it is not as heavily influenced by outliers as the mean. Using the median ensures that the imputed values are more representative of the typical data point, thus preserving the integrity of the dataset's distribution. The other options are not specifically relevant to the question of handling outliers in numerical data.
Reference:
Data Imputation Techniques (Dealing with Outliers).

by Ula at May 22, 2025, 04:03 AM

Comments

Chosen Answer:
This is a voting comment (?) , you can switch to a simple comment.
Switch to a voting comment New
Nick name: Submit Cancel
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

0
0
0
10