Skip to main content

Mapreduce Vs Pig Vs Hive


Apache Hadoop is an open source framework intended to make interaction with big data easier. Hadoop has made its place in the industries and companies that need to work on large data sets which are sensitive and needs efficient handling. 

There are several components that Hadoop Ecosystem has to handle the huge data collectively. MapReduce, Pig and Hive are one of the Key components.



MapReduce is a software framework and programming model used for processing huge amounts of data. MapReduce program works in two phases, namely, Map and Reduce. Map tasks deal with splitting and mapping of data while Reduce tasks shuffle and reduce the data.

Apache Pig is an abstraction over MapReduce. It is a tool/platform which is used to analyze larger sets of data representing them as data flows. Pig is generally used with Hadoop. We can perform all the data manipulation operations in Hadoop using Pig.

Hive is an open-source system that processes structured data in Hadoop, residing on top of the latter for summarizing Big Data, as well as facilitating analysis and queries. Hive data warehouse software facilitates reading, writing, and managing large datasets residing in distributed storage using SQL. The structure can be projected onto data already in storage.

There are differences among three components.

 

MapReduce

Pig

Hive

MapReduce built on top of Hadoop

Pig is open source

Hive open source

It is a data processing paradigm.

It is a data flow language.

Hive uses a language called HiveQL.

MapReduce is low level language.

Pig is a high level language.

HiveQL is a query processing language.

MapReduce jobs have a long compilation process.

In pig there is no need for compilation.

Hive compiler parses the query.

Exposure to Java is must to work with MapReduce.

Basic knowledge of SQL is enough to work with Apache Pig.

Basic knowledge of SQL is enough to work with Hive.

MapReduce was developed by Google

It was originally created at Yahoo.

It was originally created at Facebook.

More lines of code

Comparatively less line of codes than MapReduce

Comparatively less line of codes than MapReduce and Pig

More development involved

Development effort is less code efficiency

 

Development effort is less code efficiency

 

MapReduce can handle structured and unstructured data

Apache Pig can handle structured, unstructured, and semi-structured data.

 

Basically Hive handle only structured data.

 

 

Comments

Popular posts from this blog

Photo : Savitri during her last days

Even after 4 decades of her death, people are very much interested to watched her (Savitri) biopic. Mahanati turned to be stupendous success. On eve of remembering Savitri, here we posted Savitri photo. The photo seems her last days. The man stand behind Savitri is Gemini Ganeshan who married Savitri. It seems that she suffered bad days during her last days.

Suchi Leaks Next Target - Samantha

Wikileaks created huge political uproar among Indian Politics and Bureaucrats. Suchi leaks causes disturbances both in Tollywood and Kollywood. Female Singer Suchitra discloses personal issues of celebrities in Telugu and Tamil. She leaked few photographs of Heros and Heroines who spend their personal time in private parties. Karthik, husband of Suchitra responded on leaks that she was not mentally firm. Few actors in particular Tamanna, Trisha, Dhanush, Anirudh and Rana were feel uncomfortable on these leaks. Trisha reacted on Photo of Rana who kissed her said that it was just peck between two friends. There was nothing more beyond that. Suchitra said that she was raped by Dhanush and Anirudh after the party in the night. In her another leak, Tamanna was too raped by Dhanush. As per buzz, Suchi's next leak will be related to Heroine Samantha. Earlier, Samantha had love affair with Hero Siddhartha. Both were participated in Rahu-Kethu pooja in Srikalahasti. There were many go...

Girl signed on 100/- Stamp Paper for getting 5K!

YSRCP Govt gave notification for more than 2.6 lakh jobs for Village Volunteers and was selected through interview process. All unemployed youth from SSC to Graduate were applied for the posts.