
DENG-259: Building Solutions with Cloudera Data Services
This three-day course provides participants with a comprehensive understanding of the Cloudera platform and its integrated services, including Cloudera Data Warehouse, Cloudera Data Engineering, Cloudera Data Flow, and Cloudera AI.
Participants will gain hands-on experience in designing, implementing, and optimizing data workflows and analytics solutions within the Cloudera ecosystem. The course emphasizes practical strategies for building scalable, secure, and efficient data-driven solutions tailored to enterprise needs. Key topics include data ingestion and processing, stream management, query optimization, machine learning integration, and managing resource performance in production environments.
Cloudera Data FlowIntroduction to data ingestion and streaming capabilitiesOverview of NiFi, Kafka, and stream processingHands-on Session: Creating and managing data flowsCloudera Data EngineeringIntroduction to Cloudera Data Engineering and AirflowTroubleshooting jobs and reviewing use casesHands-on Session: Building Airflow DAGsCloudera Data WarehouseUnderstanding Cloudera Data Warehouse for large- scale data analyticsIntroduction to IcebergHands-on Session: Building a data lakehousePerformance optimization and lakehouse maintenanceData visualizationCloudera AI & Machine LearningIntroduction to Cloudera Machine LearningAutomating ML workflows and deploying models at scaleHands-on Session: Training and deploying a model using Cloudera AIMLOps pipeline and model monitoringWorkshop: Stock Market Analysis with Alpha VantageParticipants will use Alpha Vantage APIs to fetch and analyze stock market data.Data Ingestion and Streaming: Using Cloudera Data Flow and Cloudera Data Engineering to process real-time stock data.Global Data Access: Storing and querying stock data with Cloudera Data Warehouse.Data Visualization: Leveraging Cloudera Data Visualization to create insightful dashboards and reports.
This course is designed for data engineers, data analysts, application developers, and machine learning engineers who want a deeper understanding of how the Cloudera platform and its data services support solution development. This course assumes a foundational knowledge of data engineering principles (e.g., ETL concepts, data warehousing), analytics concepts (e.g., basic statistical analysis, data visualization), and cloud services (e.g., basic cloud computing models, service deployment). Basic familiarity with Linux environments (e.g., navigating the file system, using basic commands) and SQL (e.g., writing basic queries, understanding relational database concepts) is required. While some programming experience is helpful, this course focuses on practical application and does not require extensive coding skills. Prior experience with ETL, big data, and streaming technologies will greatly benefit participants.



