Hadoop adventures with Microsoft HDInsight
What is HDInsight?
HDinsight is the product name for Microsoft installation of Hadoop and Hadoop on azure service. HDInsight is Microsoft’s 100% Apache compatible Hadoop distribution, supported by Microsoft. HDInsight, available both on Windows Server or as an Windows Azure service, empowers organizations with new insights on previously untouched unstructured data, while connecting to the most widely used Business Intelligence (BI) tools on the planet.
It is available in two mode:
- HDInsight as Cloud Service: Cloud Version running on Windows Azure
- HDInsight as Local Cluster: A downloadable version to runs locally on Windows Server and Desktop
In this article we will see how to use HDInsight on local machine.
Where to get it?
- You can download HDInsight Preview version from the link below:
- https://www.microsoft.com/web/handlers/webpi.ashx?command=GetInstallerRedirect&appid=HDINSIGHT-PREVIEW&mode=new
What does Windows installer brings to your machine:
After the installation is completed you will see the following applications are installed:
- Microsoft HDInsight Community Technology Preview Version 1.0.0.0
- Hortonwoks Data Platform 1.0.1 Developer Preview Version 1.0.1
- If you do not change the installed component, Python 2.7.3150 is also installed
- Java and C++ runtime is also installed as required in the machine
Once installer is completed you will see the following shortcuts are setup in your machine:
Here is the list of shortcuts:
- Hadoop Command Line
- Microsoft HDInsight Dashboard
- Hadoop MapReduce Status
- Hadoop Name Node Status
By default the Hadoop is installed at C:\Hadoop as below:
If you launch the “Hadoop command Line” you will see the list of commands as below:
Most commands print help when invoked w/o parameters. |
Try checking the Version as below:
c:\Hadoop\hadoop-1.1.0-SNAPSHOT>hadoop version Hadoop 1.1.0-SNAPSHOT Subversion on branch -r Compiled by jenkins on Wed Oct 17 22:28:56 PDT 2012 From source with checksum 80f5614dfb0743b569344f051a07b37d |
Now if you Launch “Microsoft HDInsight Dashboard” shortcut you will see the dashboard running locally as below:
Launching “Hadoop MapReduce Status” shortcut will give you the following info:
And Launching “Hadoop Name Node Status” shortcut you will see the following:
So as you can see above, you do have Hadoop Cluster running on your local machine.
Play with it a little more and my next article is coming with more info on this regard.
Have fun with Hadoop!!
Comments
Anonymous
February 06, 2013
Hi Avkash, Thanks for the useful information. HDInsight Preview version is not available under the web installer platform 4.0 using below link: www.microsoft.com/.../webpi.ashx please provide your inputs and how to install this previewAnonymous
May 21, 2013
Wonderful information to start withAnonymous
July 16, 2014
I am not seeing the HDinsight dashboard can you please help me