Agile, Scrum, Kanban, Architecture, ...: Bigdata Ecosystem

Tuesday, December 2, 2014

Bigdata Ecosystem

I have had a lot of discussions on big data with my clients and prospects. During these discussions some questions comes up on Hadoop for ex: – What are the different components of a Hadoop ecosystem?

In my point of view the question should be - What are different components of a Bigdata ecosystem?

Everyone seems to have a different answer for it. I have tried to consolidate the answers. The results are this picture.

2 comments:

Back To BasicsJanuary 17, 2015 at 12:23 PM
Hi,
I love the way you mentioned the different components in that image. Under Data Access and Processing why did you separate out MapReduce, Giraph, Mahout in one container, Spark & Storm into another and HBase, Cassandra & Impala under other? Are they all not the same genre to access HDFS?
ReplyDelete
Replies
tjainJanuary 28, 2015 at 11:40 AM
MapReduce, Giraph, Mahout are for batch processing ( mahout may not fit in to this category) while Spark & Storm for streaming. HBase, Cassandra & Impala are NO SQL databases, so shown in different box. This separation is arbitrary and can be done in multiple ways.
ReplyDelete
Replies

Add comment

Disclaimer & Copyright

The entries in my blog are solely my opinions and do not represent the thoughts, intentions, plans or strategies of any third party, including my employer, except where explicitly stated. Needless to say, a weblog is a snapshot in time. Over time, as I interact with the community at large and/or learn more about various topics, my thoughts and opinions are subject to change. As such you should not consider out of date posts to reflect my current thoughts and opinions. Java, Oracle, Orcle Fusion Middleware, TIBCO, Sun, Microsoft, IBM, WebSphere, SAP, NetWeaver, Cloudera, HortonWorks and any other mentioned are trade marks of respective owners. © Copyright 2001-2015, Tushar Jain

Agile, Scrum, Kanban, Architecture, ...

Tuesday, December 2, 2014

Bigdata Ecosystem

2 comments:

Followers

Add to Technorati Favorites

My Docs

Blog Archive

Contributors

My Blog List

Disclaimer & Copyright

Agile, Scrum, Kanban, Architecture, ...

Tuesday, December 2, 2014

Bigdata Ecosystem

2 comments:

Subscribe To SOA Blog

Followers

Add to Technorati Favorites

My Docs

Blog Archive

Contributors

My Blog List

Disclaimer & Copyright