


Programming Hive – Data Warehouse and Query Language for Hadoop, published by O’Reilly, is a comprehensive book on Apache Hive, Hadoop’s Data Warehouse Infrastructure, using Hive’s SQL dialect HiveQL in summarizing, querying, and analysing large datasets stored in Hadoop’s distributed file system. The book comprises of a large number of examples and case studies on Hadoop and MapReduce and how Hive works within the Hadoop ecosystem. Hive could also be used to create, drop and alter databases and tables, functions, views and indexes. Data formats and storage options can be customized. This book serves as a massive guide for those who are ardent programmers of Hadoop and Data Warehouse. The book is compiled by Edward Capriolo, Dean Wampler and Jason Rutherglen.
About O’Reilly
O’Reilly is a renowned media house which has been publishing technology books, tech conferences, IT courses and news, online services, magazines and research since the year 1978. Some of the books published by O’Reilly are Lean UX Workshop, Software Architecture Fundamentals Part 2: Taking a Deeper Dive, Bioinformatics Data Skills, Head First JavaScript Programming and Python Networking Programming Cookbook.
| Imprint |
|
Good one for Hadoopers
Madhu Vadlamani
Certified Buyer, Pune
Feb, 2014