This browser is no longer supported.
Upgrade to Microsoft Edge to take advantage of the latest features, security updates, and technical support.
Which statement best describes the role of ethics in data science?
Investors should scrutinize the environmental practices of companies that hire data scientists.
It's best to analyze as much data as possible, so you learn what issues you need to fix.
Be sure that data inputs are accurate because often, output is used to make policy decisions that affect people's health and well-being.
What are the four steps in the data science lifecycle?
Business understanding, data gathering and preparation, model training and testing, model deployment
Data gathering, data validation, machine learning, result visualization
Data identification, model training until 100% accurate, model deployment, business understanding
What is overfitting in machine learning?
When your machine learning model takes up most of the disk space on your server.
When your machine learning model is so broad, it misidentifies a new item as something it has been trained on.
When your machine learning model easily handles new types of items; it's a good thing.
What is the goal of manipulating data in data science?
To eliminate data that will make it harder to prove what you think must be true.
To remove incomplete or inconsequential data, so it doesn't skew the output away from the truth.
To clean up your results, so they are summarized and easy to present.
What is the role of subject matter expert (SMEs) in the data science lifecycle?
SMEs help set the scope of data analysis by identifying factors that will affect the outcome.
SMEs add the seal of approval on the results that your data points to.
SMEs help you interpret your data in the context of their specialties.
You must answer all questions before checking your work.
Was this page helpful?