#bigdata 27e?—?PIG and Hive Languages
Apache Hive (credits Apache Foundation)

#bigdata 27e?—?PIG and Hive Languages

PIG e Hive are Script languages, which translate high-level commands to be executed by MapReduce, simplifying Hadoop parallel programming, which natively uses the Java language.

With that, Hadoop development has been simplified, increasing programmers’ productivity and allowing non-Java professionals to become Hadoop users and programmers.

PigLatin is the data flow language offered by PIG, which does not require data to have a schema (such as in SQL with tables and relationships) and for this reason is suitable for processing unstructured data.

Hive, turns Hadoop into a Data Warehouse, using the SQL dialect for data query, called HiveSQL. Implements scripts very similar to SQL, simplifying the work for professionals who know SQL database programming.

Hive can also be run interactively, just as it is done within SQL (Database Language).

CURIOSITIES

  1. Yahoo developed the Pig language in 2006 and assigned it to the Apache Foundation in 2007.
  2. The Hive SQL language was developed by Facebook in 2009 to take advantage of the most programmers and analysts who knew SQL, to come to work with unstructured data in Hadoop.
  3. Pig and Hive seek to simplify programming for the development of MapReduce applications.
  4. Developers who do not have a broad knowledge of MapReduce can start programming very quick using Hive or PIG.
  5. HiveSQL provides a SQL-like interface for querying data stored in various databases and unstructured file systems that integrate with Hadoop.

More information about this article

Article selected from the eBook “Big Data for Executives and Market Professionals.”

eBook in English: Amazon or Apple Store

eBook in Portuguese: Amazon or Apple Store

要查看或添加评论,请登录

José Antonio Ribeiro Neto的更多文章

社区洞察

其他会员也浏览了