DENG-254: Preparing with Cloudera Data Engineering | PUE Data Training
Cloudera

DENG-254: Preparing with Cloudera Data Engineering

Course delivers the key concepts & knowledge needed to use Apache Spark to develop high-performance, parallel applications on Cloudera.

28 hours
Virtual
28 Hours
Duration
Virtual
Format
Included
Lab Access
Included
Certificate

Course Overview

Course delivers the key concepts & knowledge needed to use Apache Spark to develop high-performance, parallel applications on Cloudera.

This hands-on training course delivers the key concepts and knowledge developers need to use Apache Spark to develop high-performance, parallel applications on the Cloudera Data Platform (CDP). 

Hands-on exercises allow students to practice writing Spark applications that integrate with CDP core components. Participants will learn how to use Spark SQL to query structured data, how to use Hive features to ingest and denormalize data, and how to work with “big data” stored in a distributed file system.

After taking this course, participants will be prepared to face real-world challenges and build applications to execute faster decisions, better decisions, and interactive analysis, applied to a wide variety of use cases, architectures, and industries.

Course Objectives

During this course, you will learn how to:

  • Distribute, store, and process data in a CDP cluster

  • Write, configure, and deploy Apache Spark applications

  • Use the Spark interpreters and Spark applications to explore, process, and analyze distributed data

  • Query data using Spark SQL, DataFrames, and Hive tables

  • Deploy a Spark application on the Data Engineering Service

Available Options

English €2,970.00
Italian €2,685.00
Spanish €1,840.00

Upcoming Sessions

0 sessions available

No Sessions Scheduled

There are currently no upcoming sessions scheduled for this course.

Request Course Information

Have Questions About This Course?

Contact Our Training Team