Thursday, October 2, 2014

Book Review: Using Flume: Stream Data into HDFS and HBase


Book Review:  Using Flume: Stream Data into HDFS and HBase by Hari Shreedharan: Publisher- O'Reilly: ISBN- 13: 978-1449368302



Using Flume: Stream Data into HDFS and HBase is for developers as well as Administrators of Hadoop clusters.  In its first chapter book discusses HBase which is little puzzling but as book progresses, it takes you for  deep dive in various aspects of Flume.  Book covers Streaming of data, various sources, channels, sinks, interceptors, and other components of Flume.

The last chapter is about administration of Flume which is very short. This chapter might be little bit in depth to cover capacity planning, deployment options, etc.

Nevertheless, book is a good reference for any person playing in Hadoop playground.


Disclaimer: I did not get paid to review this book, and I do not stand to gain anything if you buy the book. I have no relationship with the publisher or the author. I got electronic format of book from publisher for review.

Further reading: Apache Flume: Distributed Log Collection for Hadoop (http://www.amazon.com/Apache-Flume-Distributed-Collection-Hadoop/dp/1782167919)


One can get more information about book and related topics from:

  1. Amazon: http://www.amazon.com/Using-Flume-Stream-Data-HBase/dp/1449368301
  2. Publisher -- Oreilly http://shop.oreilly.com/product/0636920030348.do

No comments:

Post a Comment