Cloudera Data Analyst Training (CDAT)

 

Course Overview

Apache Hive makes multi-structured data accessible to analysts, database administrators, and others without Java programming expertise. Apache Pig applies the fundamentals of familiar scripting languages to the Hadoop cluster. Cloudera Impala enables real-time interactive analysis of the data stored in Hadoop via a native SQL environment.

Who should attend

  • Data Analysts
  • Application Developers
  • Database Programmers
  • Data Warehouse Administrators
  • System Administrators

Prerequisites

  • A basic understanding of Structured Query Language (SQL) and scripting languages used in SQL is helpful but not required
  • A basic understanding of distributed file system concepts such as clustering, MapReduce, BigTable, MetaData storage concepts are helpful but not required.

Course Objectives

Upon successful completion of this course and it's interactive hands-on exercises, you should be able to:

  • Understand the fundamentals of Apache Hadoop, data ETL (extract, transform, load), ingestion, and processing with Hadoop tools
  • Join multiple data sets and analyzing disparate data with Pig
  • Organize data into tables, perform transformations, and simplify complex queries with Hive
  • Perform real-time interactive analyses on massive data sets stored in HDFS or HBase using SQL with Impala
  • Understand how to pick the best tool for a given task in Hadoop to achieve interoperability, and manage recurring workflows

Prices & Delivery methods

Classroom Training
Modality: G

Duration 4 days

Price
  • on request

Currently there are no training dates scheduled for this course.