So, I built my first single node Hadoop cluster today using Ubuntu 14.04 LTS, Hadoop 2.7.1 and Oracle Java 1.7.0_80 on VMware Workstation. For my next trick, I’ll reproduce this on my vCenter server at home and put some power behind it. 2-cores and 2GB of RAM in VMware Workstation doesn’t really seem all that powerful.
Here is a tip, use the guide located here: https://hadoop.apache.org/docs/current/hadoop-project-dist/hadoop-common/SingleCluster.html
However, now that I have it up and running, I’m not quite sure what to do with it… I suppose that will be the next thing I explore in my quest to become the next Data Scientist,