Dark

Light

Dark

Light

Hadoop

Kumar Chinnakali
Posted by Kumar Chinnakali
January 11, 2016

Top ten pointers in the new Apache Spark release (version 1.6)

In 2016, we should be excited that Apache Spark community launched Apache Spark 1.6. Committers – There are around 1000 contributors to Apache Spark,...

Read More
Kumar Chinnakali
Posted by Kumar Chinnakali
January 5, 2016

What is the role of RDDs in Apache Spark? – Part 1

This blog introduces Spark’s core abstraction for working with data, the RDD (Resilient Distributed Dataset). An RDD is simply a distributed collection of elements...

Read More
Kumar Chinnakali
Posted by Kumar Chinnakali
December 24, 2015

Is Apache Hadoop the only option to implement big data?

Yes, Hadoop is not only the options to big data problem. Hadoop is one of the solutions. The HPCC (High-Performance Computing Cluster) Systems technology...

Read More
Kumar Chinnakali
Posted by Kumar Chinnakali
November 23, 2015

The top 12 Apache Hadoop challenges

Hadoop is a large-scale distributed batch processing infrastructure. While it can be used on a single machine, its true power lies in its ability...

Read More
Kumar Chinnakali
Posted by Kumar Chinnakali
November 4, 2015

What are the 3 S's of Spark and its effect on big data?

Many thanks for your cherished time, this time we like to share with you the details on what is 3 S’s of Spark as we...

Read More
Kumar Chinnakali
Posted by Kumar Chinnakali
September 14, 2015

(Big) Data in Data Lake vs. Data Warehouse - Interesting things to consider

Big data is used across verticals like Insurance, Healthcare, Manufacturing, Financial, Retail and more. Companies are using big data to improve top & bottom...

Read More