Best Hive Online Courses
Prerequisites: Hive requires knowledge of SQL. The course includes and SQL primer at the end. Please do that first if you don’t know SQL. You’ll need to know Java if you want to follow the sections on custom functions.
Taught by a 4 person team including 2 Stanford-educated, ex-Googlers and 2 ex-Flipkart Lead Analysts. This team has decades of practical experience in working with large-scale data.
Hive is like a new friend with an old face (SQL). This course is an end-to-end, practical guide to using Hive for Big Data processing.
Let’s parse that
A new friend with an old face: Hive helps you leverage the power of Distributed computing and Hadoop for Analytical processing. It’s interface is like an old friend : the very SQL like HiveQL. This course will fill in all the gaps between SQL and what you need to use Hive.
End-to-End: The course is an end-to-end guide for using Hive: whether you are analyst who wants to process data or an Engineer who needs to build custom functionality or optimize performance – everything you’ll need is right here. New to SQL? No need to look elsewhere. The course has a primer on all the basic SQL constructs, .
Practical: Everything is taught using real-life examples, working queries and code .
Analytical Processing: Joins, Subqueries, Views, Table Generating Functions, Explode, Lateral View, Windowing and more
Tuning Hive for better functionality: Partitioning, Bucketing, Join Optimizations, Map Side Joins, Indexes, Writing custom User Defined functions in Java. UDF, UDAF, GenericUDF, GenericUDTF, Custom functions in Python, Implementation of MapReduce for Select, Group by and Join
Apache Hive is a data processing tool on Hadoop. It is a querying tool for HDFS and the syntax of it’s queries is almost similar to our old SQL. Hive is an open source-software that lets programmers analyze large data sets on Hadoop.
Benefits of this course:
“Basic Hive is not sufficient if you want to work on Real-time projects.”
Make yourself prepared to work on Real time Big data and Hive projects by learning Advance Hive from this course. Enroll into this course and get end to end knowledge of Basic + ADVANCE Hive + Interview asked Use cases. This course is very rare of its kind and includes even a very thin detail of Hive.
In this course you will get to understand a step by step learning of very Basic Hive to Advance Hive (which is actually used in Real-time projects) like:
- Variables in Hive
- Table properties of Hive
- Map and Bucketed Joins
- Advance functions in Hive
- Compression techniques in Hive
- Configuration settings of Hive
- Working with Multiple tables in Hive
- Loading Unstructured data in Hive
And many more……
Apache Hadoop EcoSystem Hive Concept course is basically intended for users who are interested to learn about Hive. Hive enables examination of huge data sets using a language. This means anyone who can write SQL queries can access data stored on the Hadoop cluster. This discussion introduces the functionality of Hive, as well as its various applications for data analysis and data warehousing. The course will give you proper understanding about Apache Hive concepts from basic till advance like what Hive is, hive data types, commands partitioning ,bucketing etc.
After this course student/professional will have complete knowledge about the topics with practical along with advance concepts and they would be able to work on Hive with full of confidence.
Best Hive Books:
#1 Learn Hive in 1 Day: Complete Guide to Master Apache Hive by Krishna Rungta
#2 Apache Hive Essentials by Dayong Du
#3 Practical Hive: A Guide to Hadoop’s Data Warehouse System 1st Edition by Scott Shaw & Andreas François Vermeulen & Ankur Gupta
#4 Apache Hive Cookbook by Hanish Bansal & Saurabh Chauhan & Shrey Mehrotra
#5 Apache Hive Essentials: Essential techniques to help you process, and get unique insights from, big data, 2nd Edition by Dayong Du
#6 Programming Hive: Data Warehouse and Query Language for Hadoop 1st Edition by Edward Capriolo & Dean Wampler & Jason Rutherglen