by Dean Wampler (Author), Edward Capriolo (Author), Edward Rutherglen (Author), Jason Rutherglen (Author)
Hive makes life much easier for developers who work with stored and managed data in Hadoop clusters, such as data warehouses. With this example-driven guide, you'll learn how to use the Hive infrastructure to provide data summarization, query, and analysis - particularly with HiveQL, the query language dialect of SQL. You'll learn how to set up Hive in your environment and optimize its use, and how it interoperates with other tools, such as HBase. You'll also learn how to extend Hive with custom code written in Java or scripting languages. Ideal for developers with prior SQL experience, this book shows you how Hive simplifies many tasks that would be much harder to implement in the lower-level MapReduce API provided by Hadoop.
Format: Paperback
Pages: 352
Edition: 1
Publisher: O′Reilly
Published: 06 Oct 2012
ISBN 10: 1449319335
ISBN 13: 9781449319335