Saturday, May 25, 2013

My upcoming workshop on Storm at The Fifth Elephant, 2013

The Fifth Elephant is a conference in Bangalore, India which focuses on Big Data and Analytics. Its a community powered conference. This means, as highlighted in their website, it is "Of the Community, By the Community, For the Community". So, any one can propose a session as such for the conference in the Funnel. Participants who have purchased conference tickets can vote on session proposals.

In 2012 edition, Fifth Elephant with more than 50 sessions, attracted 600+ participants from many MNCs, startups alike. This 2-day conference was preceded by a one-day workshop sessions as well due to the overwhelming demand. The biggest USP of any of the HasGeek organized conferences [other than being community powered conferences] is, they live stream most of the sessions and they also upload all the recorded videos to Youtube and / or HasGeek TV.

I gave a session on Introduction to Pig at The Fifth Elephant last year. I have written about it previously here.

The Fifth Elephant is back this year. Its even better with a dedicated day for Workshops at the same venue and the regular 2-day Conference on Big Data, Storage and Analytics and also with the product demos and hacker corners. There are some wonderful sessions proposed including one on Neo4J, Julia, etc and few sessions have already been selected by the Program Committee.

I have been working on Big Data especially on Hadoop Ecosystem since more than 2 years now. I am fascinated by Big Data and various tools / frameworks which help analyze such large amounts of data. During this time, I came across Storm, which not just analyzes the Big Data, but analyzes in real-time. Yes, real-time very unlike Hadoop, which is basically batch-processing. I worked on couple of use cases and processed the streaming live data in really real-time using Storm. I can quote streaming tweets as my main source of real-time data , which I processed for multiple use cases using Storm.

This year I have proposed a session on Storm titled "Big Data, Real-time Processing and Storm" and it has been accepted as the first workshop this time around. I will be speaking on 11th July, 2013. It will be a live-coding session, which will help the participants understand and appreciate Storm as one of the better alternatives of Hadoop. Below is the outline of this workshop.

I have also uploaded the slides of the outline of this workshop to SpeakerDeck.

Check the above slides and do let me know if you have any feedback and / or comments on this outline for the workshop.

Wish me good luck on @P7h. And also if you happen to be there in this Conference, do come and say hi.

Please find the complete slides of this workshop session here.

Update on 05th April, 2015: After a fair bit of time here, I have moved on to GitHub hosted Octopress blogs. Please find me on henceforth for all new updates.

Monday, May 20, 2013

Open Source licenses

Understanding Open Source licenses turns out to be rather too difficult. At least I always have had issues in understanding which Open Source license is too restrictive and which is a bit liberal.

After looking around for some time, I found the following 3 alternatives for easily understanding the terms of few of the Open Source licenses.


+Brian Fitzpatrick and +Dan Bentley have made a brilliant flow chart for Open Source licenses. It is pretty simple and easy to understand.


+Marakana have another interesting flowchart for understanding Open Source licenses.


And finally another option to understand Open Source licenses is tl;drLegal Website, which summarizes and explains Open Source licenses in simple terms and in plain English. Its a pretty decent website which is fast and also very intuitive and easy to use. Just key in the name of the License you want to read about. The website will do the rest with a quick summary and also the full text of this particular license.

I always look up to one of these alternatives when I am in doubt about licensing terms of a particular Open Source License. And when I need to check the complete text of a license, I usually use tl;drLegal Website.

Hope this is of some help to you too.

Update on 05th April, 2015: After a fair bit of time here, I have moved on to GitHub hosted Octopress blogs. Please find me on henceforth for all new updates.