Getting started with Hadoop | Coderholic: "Hadoop is an open source Java implementation of Google’s MapReduce, a distributed programming technique. I’ve been investigating Hadoop lately for some data processing tasks at work. It is a bit of a minefield at first, so I’m writing this post partly as a way for me to keep track of everything, and also to hopefully save somebody else some time, or maybe to spark an an interest in Hadoop."