Defining Hadoop: the Players, Technologies and Challenges of 2011

Summary:

Hadoop has been used by large web companies for applications such as search engines, but the reality is that the project is so much more. This report takes a closer look, examining what Hadoop is (and isn’t), who’s doing what to productize it and why we can expect to see the market pick up serious steam in 2011. We profile the growing number of companies — from startups like MapR to Cloudera, the arguable leader in the space — using Hadoop, the challenges still hindering widespread adoption and where potential users can expect the market to go as we move through 2011 and beyond. Companies mentioned in this report include Yahoo, Facebook, EMC, Teradata and Appistry. For a full list of companies, and to read the full report, sign up for a free trial.

  1. Table of Contents
  2. About Derrick Harris
  3. About GigaOM Pro
  4. Executive Summary
  5. Introduction – Apache Hadoop
    1. What Hadoop Is (and Is Not)
  6. The Hadoop Ecosystem
    1. The Distributions
      1. Apache Hadoop
      2. Cloudera’s Distribution for Apache Hadoop (CDH)
      3. IBM Distribution of Apache Hadoop
      4. DataStax Brisk
      5. Related Projects
    2. Other Hadoop-Based Products
    3. ISVs Supporting Hadoop
  7. Hadoop Use Cases
    1. High-level Use Cases
    2. Who’s Using Hadoop
    3. Specific Use Cases
    4. State of Deployment (Research or Production)
    5. Deployment Size
  8. Challenges
    1. Outlook
    2. Growth
    3. New Technologies
  9. New Opportunities
  10. Further Reading