Using the SQL Server Data Mining Algorithms: Tips and Resources
|
****
This article is a list of frequently asked questions about each of the data mining algorithms provided by SQL Server Data Mining.
Resources are also provided, where available, to help you understand how these algorithms compare or align with other open source or commercially available methods, including Excel features and third-party plug-ins for Excel.
What algorithms are included with SQL Server?
The algorithms (and other features) that are available depend on which version you have. For complete details, see this page:
SQL Server 2000
- Clustering
- Decision trees
SQL Server 2005
A huge number of new features were provided including these new data mining algorithms:
- Decision trees with regression
- Association rules
- Sequence clustering
- Time series (ARTXP)
- Neural network
- Naive Bayes
An easy Excel client for data mining was also added
SQL Server 2008 and SQL Server 2008 R2
- New ARIMA model for time series
- No other algorithms, but cross-validcation was added, as well as the ability to split data into a traingi and testing set.
- Added the ability to filter models while using the same minign structure. This is
The Excel client was updated to support all new features.
SQL Server 2012
No change in data mining features, but the client was updated ot support 64-bit Excel.
Association Model Questions and Tips
What are some uses?
Tips and tricks
Issues
Clustering Model Questions and Tips
What are some uses?
Tips and tricks
Issues
Decision Tree Questions and Tips
What are some uses?
Tips and tricks
Issues
Logistic Regression Questions and Tips
What are some uses?
Tips and tricks
Issues
Naive Bayes Questions and Tips
What are some uses?
Tips and tricks
Issues
Neural Networks Questions and Tips
What are some uses?
Tips and tricks
Issues
Time Series Questions and Tips
What are some uses?
Tips and tricks
Issues
Other Cool Tools
This section provides links to other resources.
My customer wants to do RFM analysis. What options/processes are available with SQL Server Data Mining or with Excel?
answer
I need to do Holt_Winter analysis. Do you support that?
The Time Series Foundation, developed by Microsoft researcher Alex Bocharov, is a free platform for time series analysis that includes several variations on the Holt-Winters method.
Download the sample: http://research.microsoft.com/en-us/downloads/d599b5d2-5a99-4ee5-8c04-c180eb9044d9/
Read the blog: http://blogs.msdn.com/b/alexeib/
View BI Labs interview: http://www.microsoft.com/en-us/bi/featureVideo.aspx?videoId=qpr4tLPiAo0
Misc