Playing with Hadoop

By Søren Lund (‎slu‎) from Copenhagen.pm
Date: Saturday, 23 November 2013 15:30
Duration: 45 minutes
Target audience: Any
Language: English
Tags: bigdata hadoop java perl ubuntu

You can find more information on the speaker's site:


Recently, I was asked to help with a demonstration of Hadoop. The presentation was part of a larger presentation about Big Data.

Thus, I've been spending some time on

* looking into what Hadoop is
* pondering what Hadoop can be used for
* installing Hadoop, and
* playing with Hadoop

This talk will present the above subjects divided into four parts.

In the first part, I will talk about the background and history of Hadoop. I will also introduce the concept of BigData.

In the second part, I will describe how to install Hadoop on a single host in what is called pseudo distributed mode. This mode suitable for local development of Hadoop applications, and similar to how you would install Hadoop in a production environment. I will also demonstrate how to run a Hadoop job on some example data.

In the third part, I will talk about MapReduce. MapReduce is how you write you code for Hadoop. The built-in way is using Java, but I will also show how to use Perl to write MapRecude scripts that interacts with Hadoop.

Finally, I will talk about what Hadoop could be used for, and how to deploy your Hadoop jobs in the Cloud.


Attended by: Salve J. Nilsen (‎sjn‎), Anatoliy Dmytriyev (‎tolid‎),

Sponsors & Partners

Thanks to our sponsors and partners for making the workshop possible:

The Nordic Perl Workshop has a long tradition of being hosted between the Nordic Countries and cities, by local monger groups in happy collaboration. A list of previous workshops is available at: http://perlworkshop.dk/.


The Nordic Perl Workshop 2013 is hosted by DK Hostmaster A/S


Dinner is sponsored by Perl6.org - The Perl 6 Developers Community


Perl Weekly is the best source for up-to-date news from the Perl community.