Data Science Miscellaneous - Study Mode

[#56] Which statistical test is used to determine if there is a significant association between two categorical variables?
Correct Answer

(A) Chi-squared test

[#57] In the context of data ethics, what does "bias mitigation" refer to?
Correct Answer

(D) Reducing biases in data collection

[#58] What does the term "overfitting" mean in machine learning?
Correct Answer

(C) The model is too simple and underperforms on the training data

[#59] In the CRISP-DM data mining process model, what does "DM" stand for?
Correct Answer

(B) Data Mining

[#60] Which Python library is commonly used for natural language processing tasks such as text tokenization and language understanding?
Correct Answer

(C) spaCy