Share via


Using the SQL Server Data Mining Algorithms: Tips and Resources

Work in progress

****
This article is a list of frequently asked questions about each of the data mining algorithms provided by SQL Server Data Mining.
Resources are also provided, where available, to help you understand how these algorithms compare or align with other open source or commercially available methods, including Excel features and third-party plug-ins for Excel.

What algorithms are included with SQL Server?

The algorithms (and other features) that are available depend on which version you have. For complete details, see this page:

SQL Server 2000
   - Clustering
   - Decision trees

SQL Server 2005
A huge number of new features were provided including these new data mining algorithms: 
   - Decision trees with regression
   - Association rules
   - Sequence clustering
   - Time series (ARTXP)
   - Neural network
   - Naive Bayes

An easy Excel client for data mining was also added

SQL Server 2008 and SQL Server 2008 R2
   - New ARIMA model for time series
   - No other algorithms, but cross-validcation was added, as well as the ability to split data into a traingi and testing set.
   - Added the ability to filter models while using the same minign structure. This is

The Excel client was updated to support all new features.

SQL Server 2012
No change in data mining features, but the client was updated ot support 64-bit Excel.

Association Model Questions and Tips

What are some uses?
Tips and tricks
Issues

Clustering Model Questions and Tips

What are some uses?
Tips and tricks
Issues

Decision Tree Questions and Tips

What are some uses?
Tips and tricks
Issues

Logistic Regression Questions and Tips

What are some uses?
Tips and tricks
Issues

Naive Bayes Questions and Tips

What are some uses?
Tips and tricks
Issues

Neural Networks Questions and Tips

What are some uses?
Tips and tricks
Issues

Time Series Questions and Tips

What are some uses?
Tips and tricks
Issues

Other Cool Tools

This section provides links to other resources.

My customer wants to do RFM analysis. What options/processes are available with SQL Server Data Mining or with Excel?

answer

I need to do Holt_Winter analysis. Do you support that?

The Time Series Foundation, developed by Microsoft researcher Alex Bocharov, is a free platform for time series analysis that includes several variations on the Holt-Winters method.
Download the sample: http://research.microsoft.com/en-us/downloads/d599b5d2-5a99-4ee5-8c04-c180eb9044d9/
Read the blog: http://blogs.msdn.com/b/alexeib/
View BI Labs interview: http://www.microsoft.com/en-us/bi/featureVideo.aspx?videoId=qpr4tLPiAo0

Misc