HBase is a remarkable tool for indexing mass volumes of data, but getting started with this distributed database and its ecosystem can be daunting. With this hands-on guide, you'll learn how to architect, design, and deploy your own HBase applications by examining real-world solutions. Along with HBase principles and cluster deployment guidelines, this book includes in-depth case studies that demonstrate how large companies solved specific use cases with HBase.
Authors Jean-Marc Spaggiari and Kevin O'Dell also provide draft solutions and code examples to help you implement your own versions of those use cases, from master data management (MDM) and document storage to near real-time event processing. You'll also learn troubleshooting techniques to help you avoid common deployment mistakes.
- Learn exactly what HBase does, what its ecosystem includes, and how to set up your environment
- Explore how real-world HBase instances were deployed and put into production
- Examine documented use cases for tracking healthcare claims, digital advertising, data management, and product quality
- Understand how HBase works with tools and techniques such as Spark, Kafka, MapReduce, and the Java API
- Learn how to identify the causes and understand the consequences of the most common HBase issues
Author: Jean-Marc Spaggiari, Kevin O'Dell
Publisher: O'Reilly Media
Binding Type: Paperback
Size: 9.10h x 7.00w x 0.50d
About the Author
Jean-Marc Spaggiari, an HBase contributor since 2012, works as an HBase specialist Solutions Architect for Cloudera to support Hadoop and HBase through technical support and consulting work. He has worked with some of the biggest HBase users in North America.
Jean-Marc's prime role is to support HBase users over their HBase cluster deployments, upgrades, configuration and optimization, as well as to support them regarding HBase related application development. He is also a very active HBase community member, testing every release from performance and stability standpoints.
Prior to Cloudera, Jean-Marc worked as a Project Manager and as a Solution Architect for CGI and insurances companies. He has almost 20 years of Java development experience. In addition to regularly attending HBaseCon, he has spoken at various Hadoop User Group meetings and many conferences in North America, usually focusing on HBase related presentations and demonstration.
Kevin is currently a Field Engineer at Rocana where he works with customers to architect large-scale IT Operations. Prior to Rocana, Kevin worked at Cloudera for over four years where he interacted with numerous Fortune 500 companies across every vertical.
In addition, to his day to day at Rocana, Kevin works closely with the open source Apache community. He is a contributor on the Apache HBase project, has written numerous blog posts and presented at multiple conferences regarding the Hadoop ecosystem.