Hello,
During this winter, I participated in
which I just stumbled upon while searching something.
This challenge was about using IBM Bluemix’s “Analytics For Hadoop” service to process a data set that is minimum 500MB in size.
This was a wonderful opportunity to get some hands on on IBM Bluemix ( IBM is giving extended trial access if you are a participant). Apart from this I was also keen to build some Data visualization app on my own.
I selected CitiBike data for one year (2013-2014). Initially I did not had a clue about what insights I could gather from the dataset, but as soon as I ran some Apache Pig scripts and started looking at the output, I could see more and more use cases around the dataset. I could not address all the use cases I thought as I soon hit the deadline pressure. I had to finish the video demonstration and write some write up about the project.
Overall it was a very enriching experience as I did so many things for the very first time.
Listing some of them below
- IBM Bigsheets and BigSQL
- Using Chart.js library
- Using Google Maps JavaScript APIs – It was remarkably simpler than I thought. Much appreciate these APIs from Google.
- Creating the custom Map icon – Never realized it would be this difficult
- HTML 5/CSS challenges when putting up the UI
- Last but not the least GitHub’s easy way to publish your work online.
Now that the challenge is in Public voting and judging phase, appreciate if you could take a look at
http://ibmhadoop.challengepost.com/submissions/33509-citibike-looking-back-in-a-year
and provide your feedback and vote if you like it.