1 Answers
- Both High-level Languages work on top of the map-reduce framework.
- Can coexist since both use the underlying HDFS and map-reduce.
language
- Pig is a procedural ; (A = load ‘mydata’; dump A)
- Hive is Declarative (select * from A)
Work Type
- Pig more suited for adhoc analysis (on-demand analysis of click stream search logs)
- Hive a reporting tool (e.g. weekly BI reporting)
Users
- Pig – Researchers, Programmers (build complex data pipelines, machine learning)
- Hive – Business Analysts
Integration
- Pig – Doesn’t have a thrift server(i.e no/limited cross-language support)
- Hive – Thrift server
User’s need
- Pig – Better dev environments, debuggers expected
- Hive – Better integration with technologies expected(e.g JDBC, ODBC)