Hadoop Tutorials: Using Hive with HBase

Hadoop | NoSQL | Resources | Tech and Tools | Tutorials |

Published September 5, 2013 |

Chandeep

Here is another interesting use case that came up when I was working with one of our clients in the insurance industry. The client had enormous amount of claim data residing in multiple databases in SQL Server which were to be consolidated into one. Some of the queries on this data took days because of which we were looking for an alternate solution that could process data in a distributed fashion and save us some time. We started looking into a Hadoop based solution since the company was already using Hadoop.

We had few options on the table such as Hive, Pig, Hbase etc and after some brainstorming decided to go with HBase for the following reasons:

It is an open source distributed database which would yield higher performance while being cost effective at the same time.
We do not have to worry about distributing the data for faster processing since Hadoop takes care of it.
Batch processing with no real indexes.
Data integrity as HBase confirms a write after its write-ahead log reaches all the three in-memory HDFS replicas.
Easily scalable, fault tolerant and highly available.

Now the next step was to move data from the SQL database to HDFS for which we used Sqoop.

Recent Blogs

Why vector databases are key to enhanced AI and data analysis

Why vector databases are key to enhanced AI and data analysis

Artificial Intelligence, Industry, others

In a...

Are digital wallets a catalyst for customer engagement and revenue growth?

Are digital wallets a catalyst for customer engagement and revenue growth?

Industry, Industry Articles

Customers...

Large Language Models (LLMs) leveraged for data enrichment: Transform data into insights

Large Language Models (LLMs) leveraged for data enrichment: Transform data into insights

Artificial Intelligence

Data...

Natural Language Querying (NLQ): The future of search

Natural Language Querying (NLQ): The future of search

Artificial Intelligence, Business Intelligence, Industry

Everything...

25 Recommender Algorithms, Packages, Tools, and Frameworks: Your Gateway to Exploring Personalized Recommendations

25 Recommender Algorithms, Packages, Tools, and Frameworks: Your Gateway to Exploring Personalized Recommendations

Industry, Retail / eCom

In...

Product recommendations for ecommerce brands

Product recommendations for ecommerce brands

Industry, Retail / eCom

Build a...

AI model stores: time saver or just another layer?

AI model stores: time saver or just another layer?

Artificial Intelligence

You walk...

Leveraging LLMs: Revolutionizing ecommerce and transforming customer experience

Leveraging LLMs: Revolutionizing ecommerce and transforming customer experience

Industry, Retail / eCom

Today,...

Reimagining digital commerce with Open Protocols

Reimagining digital commerce with Open Protocols

Namma...

How digital wallets are transforming the global payment landscape

How digital wallets are transforming the global payment landscape

Physical...

Game of Phones: Device financing for a digital-forward Africa

Game of Phones: Device financing for a digital-forward Africa

When the...

Accelerating Revenue with Recommendation-as-a-Service (RaaS): The Power of Personalized Experiences

Accelerating Revenue with Recommendation-as-a-Service (RaaS): The Power of Personalized Experiences

Artificial Intelligence, Industry

As much...

Loyalty Programs and Personalized Marketplaces: How to get the best of both worlds

Loyalty Programs and Personalized Marketplaces: How to get the best of both worlds

In...

The synthetic data revolution

The synthetic data revolution

Artificial Intelligence, Industry

British...

Pivoting to the future: where the digital world is heading today

Pivoting to the future: where the digital world is heading today

When I...

An entrepreneur’s guide to managing a consumer-centric digital product

An entrepreneur’s guide to managing a consumer-centric digital product

All the...

Vijaya Kumar Ivaturi on the technology renaissance

Vijaya Kumar Ivaturi on the technology renaissance

Leadership, Industry

Crayon...

Why Invest In Data?

Why Invest In Data?

Large...

How to Exploit the Power of Data with the Right Unstructured Data Management Strategy

How to Exploit the Power of Data with the Right Unstructured Data Management Strategy

In...

5 Crucial Steps to Investing in AI for Your Business

5 Crucial Steps to Investing in AI for Your Business

Artificial Intelligence, Tech and Tools

It’s no...

How big data and product analytics are impacting the fintech industry

How big data and product analytics are impacting the fintech industry

The...

Can trading bots make you a Crypto billionaire?

Can trading bots make you a Crypto billionaire?

The...

Integrate deeply, so customers find it difficult to pull the plug: Suresh Shankar on Churn.FM podcast

Integrate deeply, so customers find it difficult to pull the plug: Suresh Shankar on Churn.FM podcast

Leadership, Industry, Press and Media

Crayon...

How Even the Most World-Weary Investors are Leveraging the Power of Big Data to Make Trades

How Even the Most World-Weary Investors are Leveraging the Power of Big Data to Make Trades

It's no...

Low-code platforms for building enterprise applications – which make for the best investment?

Low-code platforms for building enterprise applications – which make for the best investment?

The...

What you need to build and implement an enterprise big data strategy

What you need to build and implement an enterprise big data strategy

Enterprise...

Subscribe to the Crayon Blog

Get the latest posts in your inbox!