Which Algorithm Family Can Answer My Question?
This post, authored by Brandon Rohrer, Senior Data Scientist at Microsoft, is the last in a three-part series introducing data science with no jargon. The first post was titled What Can Data Science Do For Me? and the second was on What Types of Questions Can Data Science Answer?
There are a few data science questions that seem to pop up a lot. They’re listed here, together with the best algorithm family. If you don’t see yours or one like it, let us know and we’ll add it. Several of these questions have links to sample experiments or working examples in the Azure ML Marketplace.
● first choice ○ may also work well |
Two-class classification |
Multi-class classification |
Regression |
Anomaly detection |
Unsupervised learning |
Reinforcement learning |
Predictive Maintenance |
||||||
Should I replace this part now? |
● |
|
○ |
|
|
|
When will this part fail? |
|
○ |
● |
|
|
|
● |
|
○ |
|
|
|
|
Is this unit behaving in an unusual way? |
○ |
|
|
● |
|
|
|
|
● |
|
|
|
|
Which vehicle needs servicing most urgently? |
|
● |
○ |
|
|
○ |
Is this pressure reading unusual? |
○ |
|
|
● |
|
|
Are these voltages normal for this season and time of day? |
○ |
|
|
● |
|
|
Is this internet message typical? |
○ |
|
|
● |
|
|
● |
|
|
|
|
|
|
Out of a thousand units, how many of this model of bearings will survive 10,000 hours of use? |
|
○ |
● |
|
|
|
How likely is this employee to be an insider security threat? |
|
○ |
● |
|
|
|
Which printer models fail the same way? |
|
|
○ |
|
● |
|
Which groups of sensors in this jet engine tend to vary with (and against) each other? |
|
|
|
|
● |
|
● first choice ○ may also work well
|
Two-class classification |
Multi-class classification |
Regression |
Anomaly detection |
Unsupervised learning |
Reinforcement learning |
Marketing |
||||||
● |
|
○ |
|
|
|
|
When will this customer make another purchase? |
|
○ |
● |
|
|
|
Will this customer renew their subscription? |
● |
○ |
|
|
|
|
Of all my customers, which 10% should receive an offer? |
○ |
|
● |
|
|
|
● |
|
○ |
|
|
|
|
● |
|
|
|
|
|
|
Which offer should this customer receive? |
|
● |
|
|
|
○ |
● |
|
|
|
|
|
|
● |
|
|
|
|
|
|
Does the $5 coupon or the 25% off coupon result in more return customers? |
● |
|
○ |
|
|
|
● |
|
○ |
|
|
|
|
It this review positive or negative? |
○ |
|
● |
|
|
|
Is the overall Twitter mood on my business positive or negative? |
● |
|
○ |
|
|
|
How many new followers will I get next week? |
|
|
● |
|
|
|
Which advertisement should be listed first for this reader? |
|
|
○ |
|
|
● |
Will this customer click on the top link? |
● |
|
○ |
|
|
|
Is this combination of purchases very different from what this customer has made in the past? |
○ |
|
|
● |
|
|
|
|
● |
|
|
|
|
What fraction of pulls on this slot machine result in payout? |
|
○ |
● |
|
|
|
What other products is this customer likely to buy? |
|
● |
○ |
|
|
|
Which other customers have similar preferences to this one? |
|
|
|
|
● |
|
What is a natural way to divide this set of customers into groups? |
|
|
|
|
● |
|
|
● |
○ |
|
|
|
|
Which viewers like the same kind of movies? |
|
|
○ |
|
● |
|
Where should I place this ad on the webpage so that the viewer is most likely to click it? |
|
○ |
|
|
|
● |
● first choice ○ may also work well
|
Two-class classification |
Multi-class classification |
Regression |
Anomaly detection |
Unsupervised learning |
Reinforcement learning |
Finance |
||||||
|
○ |
● |
|
|
|
|
|
|
● |
|
|
|
|
How many shares of this stock should I buy right now? |
|
|
○ |
|
|
● |
Will mortgage interest rates go up, down, or remain the same next week? |
|
● |
○ |
|
|
|
How many orders will there be from a region for this product next month? |
|
○ |
● |
|
|
|
○ |
|
|
● |
|
|
|
● |
|
|
|
|
|
|
|
|
|
● |
|
|
|
How likely is this customer to repay a car loan? |
|
○ |
● |
|
|
|
● |
|
|
|
|
|
|
What are the most common patterns in gasoline price changes? |
|
|
|
|
● |
|
What is a natural way to break this set of companies up into groups? |
|
|
|
|
● |
|
How much are the stock prices in my portfolio likely to change in the next year? |
|
|
● |
|
|
|
● first choice ○ may also work well
|
Two-class classification |
Multi-class classification |
Regression |
Anomaly detection |
Unsupervised learning |
Reinforcement learning |
Operational Efficiency |
||||||
What will the demand for this item (or service) be next month? |
|
○ |
● |
|
|
|
|
|
● |
|
|
|
|
|
|
● |
|
|
|
|
What price should I set on this item? |
|
|
● |
|
|
○ |
● |
|
|
|
|
|
|
What fraction of today’s flights will depart on time? |
|
○ |
● |
|
|
|
How many employees should be scheduled to work on Black Friday? |
|
○ |
● |
|
|
|
Is it time to order more of this product? |
● |
|
|
|
|
○ |
What management practices do successful CEOs have in common? |
|
|
|
|
● |
|
● first choice ○ may also work well
|
Two-class classification |
Multi-class classification |
Regression |
Anomaly detection |
Unsupervised learning |
Reinforcement learning |
Energy Forecasting |
||||||
How many kilowatts will be demanded from my wind farm 30 minutes from now? |
|
○ |
● |
|
|
|
During which days of the week does this electrical substation have similar electrical power demands? |
|
○ |
|
|
● |
|
Is this grid likely to face an overload situation in the next day? |
● |
|
|
○ |
|
|
What will the consumer demand be in this region over the next month? |
|
|
● |
|
|
|
Is the power usage in this grid unusual? |
○ |
|
|
● |
|
|
● first choice ○ may also work well
|
Two-class classification |
Multi-class classification |
Regression |
Anomaly detection |
Unsupervised learning |
Reinforcement learning |
Internet of Things |
||||||
Has this patient’s health suddenly taken a turn for the worse? |
○ |
|
|
● |
|
|
Is your heart rate within your typical training range? |
○ |
|
|
● |
|
|
What activity is the wearer of a fitness tracker engaged in? |
|
● |
|
|
|
|
Should the robot vacuum clean the living room or continue to charge? |
○ |
|
|
|
|
● |
Do I move this obstacle or navigate around it? |
○ |
|
|
|
|
● |
Which aircraft is causing this radar signature? |
|
● |
|
|
|
|
Who is the speaker in this recording? |
|
● |
|
|
|
|
What will the temperature be next Tuesday? |
|
|
● |
|
|
|
● |
|
|
|
|
|
|
Should the thermostat adjust the temperature higher, lower, or leave it where it is? |
|
○ |
|
|
|
● |
Should I continue driving at the same speed, brake, or accelerate in response to that yellow light? |
|
○ |
|
|
|
● |
● first choice ○ may also work well
|
Two-class classification |
Multi-class classification |
Regression |
Anomaly detection |
Unsupervised learning |
Reinforcement learning |
Text and Speech Processing |
||||||
What is a natural way to break these documents into five topic groups? |
|
|
|
|
● |
|
|
● |
|
|
|
|
|
|
● |
|
|
|
|
|
What groups of words tend to occur together in this set of documents? (What are the topics they cover?) |
|
|
|
|
● |
|
|
● |
|
|
|
|
|
|
● |
|
|
|
|
|
|
● |
|
|
|
|
|
● |
● |
○ |
|
|
|
|
What is the translation of this sentence from English into Chinese? |
● |
● |
|
|
|
● |
Is the person speaking into the phone authorized to use the phone? |
● |
|
|
● |
|
|
● first choice ○ may also work well
|
Two-class classification |
Multi-class classification |
Regression |
Anomaly detection |
Unsupervised learning |
Reinforcement learning |
Image Processing and Computer Vision |
||||||
|
|
|
|
● |
|
|
● |
|
|
|
|
|
|
Is there a dog or a bench in this image? |
● |
|
○ |
|
|
|
What objects are in this image? |
|
● |
|
|
|
|
How many people are there in this photo? |
|
|
● |
|
|
|
● |
|
● |
|
|
|
|
What is this person doing in this video? |
|
● |
|
|
|
|
Does anyone behave suspiciously in this surveillance video? |
|
|
|
● |
|
|
In which direction should this robot move given what it sees? |
|
|
|
|
|
● |
After you choose the algorithm family that fits your question, the next step is to choose your algorithm and get to work. From here, it gets a bit more technical, but the final results are worth it. Visit How to Choose a Machine Learning Algorithm in Azure ML and the Machine Learning Algorithm Cheat Sheet for Azure ML to take the next step.
Brandon
Follow me on Twitter or ping me on LinkedIn
Comments
- Anonymous
September 02, 2015
AMAZING VIEW ! I love it !