Master CompTIA DataX Certification Exam DY0-001: Your Gateway to Data Excellence
Which of the following distance metrics for KNN is best described as a straight line?
Correct : B
Euclidean distance measures the straight-line distance between two points in space, matching the geometric ''as-the-crow-flies'' notion of distance.
Start a Discussions
Given these business requirements:
Which of the following is the most likely optimization technique a data scientist would apply?
Correct : A
You must optimize boat trips subject to strict resource limits (fuel, boat capacity, travel distance), making this a constrained optimization problem (e.g., solvable via linear programming).
Start a Discussions
SIMULATION
A data scientist needs to determine whether product sales are impacted by other contributing factors. The client has provided the data scientist with sales and other variables in the data set.
The data scientist decides to test potential models that include other information.
INSTRUCTIONS
Part 1
Use the information provided in the table to select the appropriate regression model.
Part 2
Review the summary output and variable table to determine which variable is statistically significant.
If at any time you would like to bring back the initial state of the simulation, please click the Reset All button.
Correct : A
Part 1
Linear regression.
Of the four models, linear regression has the highest R (0.8), indicating it explains the greatest proportion of variance in sales.
Part 2
Var 4 -- Net operations cost.
Net operations cost has a p-value of essentially 0 (far below 0.05), indicating it is the only additional predictor statistically significant in explaining sales. Neither inventory cost (p0.90) nor initial investment (p0.23) reach significance.
Start a Discussions
A data scientist is building a proof of concept for a commercialized machine-learning model. Which of the following is the best starting point?
Correct : A
Before diving into selecting or tuning models, a literature review grounds the proof of concept in existing research and best practices, ensuring the approach aligns with state-of-the-art methods and the problem's domain requirements.
Start a Discussions
Which of the following problem-solving approaches is a set of guidelines to handle highly variable and not fully apparent situations?
Correct : C
Heuristics are rule-of-thumb strategies that guide problem solving in complex, uncertain situations where a fixed algorithm or plan isn't feasible.
Start a Discussions
Total 85 questions