HW HDP PH - Hortonworks HDP Developer Apache Pig and Hive (DEV-302)
Sunset Learning Institute
Course description
HW HDP PH - Hortonworks HDP Developer Apache Pig and Hive (DEV-302)
This course is designed for developers who need to create applications to analyze Big Data stored in Apache Hadoop using Pig and Hive. Topics include: Hadoop, YARN, HDFS, MapReduce, data ingestion, workflow definition, using Pig and Hive to perform data analytics on Big Data and an introduction to Spark Core and Spark SQL.
COVID-19 Update
In light of COVID-19, this provider is now delivering some or all of their courses online. Contact them for more information!
Who should attend?
Software developers who need to understand and develop applications for Hadoop.
Training content
DAY 1 – IN INTRODUCTION TO THE HADOOP DISTRIBUTED FILE SYSTEM
OBJECTIVES
- Understanding Hadoop
- The Hadoop Distributed File System
- Ingesting Data into HDFS
- The MapReduce Framework
LABS
- Starting an HDP Cluster
- Demonstration: Understanding Block Storage
- Using HDFS Commands
- Importing RDBMS Data into HDFS
- Exporting HDFS Data to an RDBMS
- Importing Log Data into HDFS Using Flume
- Demonstration: Understanding MapReduce
- Running a MapReduce Job
DAY 2 – AN INTRODUCTION TO APACHE PIG
OBJECTIVES
- Introduction to Apache Pig
- Advanced Apache Pig Programming
LABS
- Demonstration: Understanding Apache Pig
- Getting Starting with Apache Pig
- Exploring Data with Apache Pig
- Splitting a Dataset
- Joining Datasets with Apache Pig
- Preparing Data for Apache Hive
- Demonstration: Computing Page Rank
- Analyzing Clickstream Data
- Analyzing Stock Market Data Using Quantiles
DAY 3 – AN INTRODUCTION TO APACHE HIVE
OBJECTIVES
- Apache Hive Programming
- Using HCatalog
- Advanced Apache Hive Programming
LABS
- Understanding Hive Tables
- Understanding Partition and Skew
- Analyzing Big Data with Apache Hive
- Demonstration: Computing NGrams
- Joining Datasets in Apache Hive
- Computing NGrams of Emails in Avro Format
- Using HCatalog withApachePig
DAY 4 – WORKING WITH SPARK CORE, SPARK SQL AND OOZIE
OBJECTIVES
- Advanced Apache Hive Programming (Continued)
- Hadoop 2 and YARN
- Introduction to Spark Core and Spark SQL
- Defining Workflow with Oozie
LABS
- Advanced Apache Hive Programming
- Running a YARN Application
- Getting Started with Apache Spark
- Exploring Apache Spark SQL
- Defining an Apache Oozie Workflow
Why choose SLI?
Award-winning instructors
Over 50 training locations across North America
All dates are guaranteed-to-run
About SLI

Sunset Learning Institute (SLI) has been an innovative leader in developing and delivering authorized technical training since 1996. We develop and deliver scalable technical learning solutions to our technology partners and their customers to help optimize technology investments, improve job...
Contact this provider
Contact info
Sunset Learning Institute
Before we redirect you to this supplier's website, do you mind filling out this form so that we can stay in touch? You can unsubscribe at any time.
No reviews available
Need help with your search?
findcourses.com offers a free consultancy service to help compare training for you and your team