Exam Databricks-Machine-Learning-Associate Topic 1 Question 41 Discussion
Actual exam question for Databricks's Databricks-Machine-Learning-Associate exam
Question #: 41
Topic #: 1
Question #: 41
Topic #: 1
In which of the following situations is it preferable to impute missing feature values with their median value over the mean value?
Suggested Answer: C Vote an answer
Imputing missing values with the median is often preferred over the mean in scenarios where the data contains a lot of extreme outliers. The median is a more robust measure of central tendency in such cases, as it is not as heavily influenced by outliers as the mean. Using the median ensures that the imputed values are more representative of the typical data point, thus preserving the integrity of the dataset's distribution. The other options are not specifically relevant to the question of handling outliers in numerical data.
Reference:
Data Imputation Techniques (Dealing with Outliers).
Reference:
Data Imputation Techniques (Dealing with Outliers).
by Ula at May 22, 2025, 04:03 AM
0
0
0
10
Comments
Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.
Report Comment
Commenting
You can sign-up / login (it's free).