Faster, more capable: What Apache Spark brings to Hadoop

Hadoop | Tech and Tools |

Published February 11, 2014 |

Nick Heath

Apache Spark is an execution engine that broadens the type of computing workloads Hadoop can handle, while also tuning the performance of the big data framework.

Hadoop specialist Cloudera recently announced that it will offer commercial support for Apache Spark, which is available as part of Cloudera’s Hadoop-powered Enterprise Data Hub. But why should businesses care about Spark?

Apache Spark has numerous advantages over Hadoop’s MapReduce execution engine, in both the speed with which it carries out batch processing jobs and the wider range of computing workloads it can handle.

Spark is able to execute batch-processing jobs between 10 to 100 times faster than the MapReduce engine according to Cloudera, primarily by reducing the number of writes and reads to disc.

“You have map and reduce tasks and after that there’s a synchronisation barrier and you persist all of the data to disc,” said Mark Grover, Hadoop engineer for Cloudera.

Recent Blogs

Why vector databases are key to enhanced AI and data analysis

Why vector databases are key to enhanced AI and data analysis

Artificial Intelligence, Industry, others

In a...

Large Language Models (LLMs) leveraged for data enrichment: Transform data into insights

Large Language Models (LLMs) leveraged for data enrichment: Transform data into insights

Artificial Intelligence

Data...

Natural Language Querying (NLQ): The future of search

Natural Language Querying (NLQ): The future of search

Artificial Intelligence, Business Intelligence, Industry

Everything...

AI model stores: time saver or just another layer?

AI model stores: time saver or just another layer?

Artificial Intelligence

You walk...

Accelerating Revenue with Recommendation-as-a-Service (RaaS): The Power of Personalized Experiences

Accelerating Revenue with Recommendation-as-a-Service (RaaS): The Power of Personalized Experiences

Artificial Intelligence, Industry

As much...

The synthetic data revolution

The synthetic data revolution

Artificial Intelligence, Industry

British...

An entrepreneur’s guide to managing a consumer-centric digital product

An entrepreneur’s guide to managing a consumer-centric digital product

All the...

Why Invest In Data?

Why Invest In Data?

Large...

How to Exploit the Power of Data with the Right Unstructured Data Management Strategy

How to Exploit the Power of Data with the Right Unstructured Data Management Strategy

In...

5 Crucial Steps to Investing in AI for Your Business

5 Crucial Steps to Investing in AI for Your Business

Artificial Intelligence, Tech and Tools

It’s no...

How big data and product analytics are impacting the fintech industry

How big data and product analytics are impacting the fintech industry

The...

Can trading bots make you a Crypto billionaire?

Can trading bots make you a Crypto billionaire?

The...

How Even the Most World-Weary Investors are Leveraging the Power of Big Data to Make Trades

How Even the Most World-Weary Investors are Leveraging the Power of Big Data to Make Trades

It's no...

Low-code platforms for building enterprise applications – which make for the best investment?

Low-code platforms for building enterprise applications – which make for the best investment?

The...

What you need to build and implement an enterprise big data strategy

What you need to build and implement an enterprise big data strategy

Enterprise...

Big data challenges and how to overcome them

Big data challenges and how to overcome them

For...

Can decentralization help us save free speech on the internet?

Safety...

Big Data and blockchain are a perfect match. So what's keeping them apart?

Big Data and blockchain are a perfect match. So what's keeping them apart?

Not that...

4 applications of big data in Supply Chain Management

As Big...

How to help high schoolers understand big data

How to help high schoolers understand big data

Data Science, Tech and Tools

Data...

The use of big data in manufacturing industry

The use of big data in manufacturing industry

Data Science, Tech and Tools

Approximat...

5 must have tools for every stock trader

5 must have tools for every stock trader

For the...

How to leverage stock screeners to find compelling stock opportunities

How to leverage stock screeners to find compelling stock opportunities

he stock...

The importance of big data and open source for the blockchain

The importance of big data and open source for the blockchain

Digital...

How modern day AI-based products are empowering businesses?

How modern day AI-based products are empowering businesses?

Artificial Intelligence

Artificial...

Challenges of maintaining a traditional data warehouse

Challenges of maintaining a traditional data warehouse

One of...

Subscribe to the Crayon Blog

Get the latest posts in your inbox!