I have recently had several requests from people asking for resources to learn about Big Data and Hadoop. Below is a list of resources that I typically recommend. I'll update this list as I find more resources. Let's crowdsource this... Tell me your favorite resources and I'll get them on the list!
Books and Whitepapers
Planning
for Big Data Free e-book
Great
primer on the general Big Data space. This is always my recommendation
for people who are new to Big Data and are trying to understand it.
Hadoop:
The Definitive Guide by Tom White
This
will dive deep under the hood of Hadoop. This should not be a first book
for someone who is just starting with Hadoop, Map Reduce or Big Data.
Make sure you don’t get the first edition. The third edition is the best
as it also dedicates a chapter to HBase, Hive, and other tools in the ecosystem
that are important to understand.
Programming
Pig by Alan Gates
Great
(and entertaining) book about Pig. The first chapter is a really good
primer on Hadoop.
Programming
Hive By Edward Capriolo, Dean Wampler, Jason Rutherglen (est publication date
10/9/2012)
Nothing to say
about this book yet – it isn’t yet released.
I will add a quick blurb when I have a chance to read it.
“If You
Have Too Much Data, then ‘Good Enough’ Is Good Enough” by Pat Helland
Great
whitepaper to discuss the tenets behind distributed systems.
Websites
Apache
Hadoop: http://hadoop.apache.org/
Microsoft
Big Data Solution: www.microsoft.com/bigdata
Windows
Azure: www.windowsazure.com/en-us/home/scenarios/big-data
Webcasts
Hadoop
Videos on Microsoft TechNet: http://social.technet.microsoft.com/wiki/contents/articles/6204.hadoop-based-services-for-windows-en-us.aspx#videos
Hortonworks
Video Series: http://hortonworks.com/videos/
Cloudera
Video Series: http://www.cloudera.com/resource-types/video/
Tim
O'Reilly and Dave Campbell Explore How to Accelerate Insights from Data
Denny Lee talks about Big Data
Blogs
Andrew
Brust on ZDNet: http://www.zdnet.com/blog/big-data/
Denny
Lee: http://dennyglee.com/
Carl
Nolan: http://blogs.msdn.com/b/carlnol/archive/tags/hadoop+streaming/
Cindy Gross: http://blogs.msdn.com/b/cindygross/
Oakleaf
Blogs (good for Hadoop on Azure): http://oakleafblog.blogspot.com/
Buck
Woody: Big Data: A Microsoft Tools Approach http://sqlblog.com/blogs/buck_woody/archive/2012/02/20/big-data-a-microsoft-tools-approach.aspx
Forrester
Blogs: http://blogs.forrester.com/category/big_data
Try Now
Preview
of the Hadoop-based service for Windows Azure: https://www.hadooponazure.com