Pentaho Data Integration Fundamentals

Код
DI1000
Продолжительность
3 дн.
Ближайшие даты
с 30.03.2020 по 01.04.2020,  город: VILT
Цена
1,920Partners
2,700Customers
О курсе

This course introduces the Pentaho Data Integration (PDI) platform. It covers the basic functions, explains the capabilities of PDI, and describes the best practices to use it successfully. Course demonstrations, combined with practice, prepare you to use PDI for real world cases. Additional benefits are attained because you practice concepts learned in a PDI development environment during the course.

Для кого этот курс
  • ETL Developers
  • Data Analyst
Цель

When you complete this course, you should be able to:

  • Describe the Pentaho Data Integration (PDI) Platform and its components and their common uses
  • List the parts of transformations and describe how they execute
  • Create, preview, run, and troubleshoot a transformation using best practices and modular design principles
  • Read and write data to and from various file formats
  • Perform calculations, merges, and lookups
  • Use the PDI enterprise repository, scheduling, and monitoring capabilities
  • Log execution metrics to database tables
Требуемые знания и навыки

Experience in ETL concepts is Preferred

Приобретаемые навыки

Pentaho Data Integration prepares and blends data to create a complete picture of your business that drives actionable insights. The complete data integration platform delivers accurate, “analytics ready” data to end users from any source. With visual tools to eliminate coding and complexity, Pentaho puts big data and all data sources at the fingertips of business and IT users alike.

Программа
  • Introduction to Pentaho Data Integration
    • Objectives and Class Logistics
    • Pentaho Platform and Architecture
  • Transformations
    • Transformation Concepts
    • Learning the PDI User Interface
    • Creating and Running Transformations
    • Introduction to Repositories
  • Reading and Writing Files
    • Input and Output Steps
    • PDI’s Home Directory
    • Parameterization
  • Working with Databases
    • Connecting to and Exploring a Database
    • Table Input and Output Steps
    • Insert / Update and Delete Steps
    • Filtering and Sorting Data
    • Variables and Unnamed Parameters in SQL
  • Data Flow and Lookups
    • Data Movement and Step Copies
    • Lookups and Merge
  • Calculations
    • Grouping
    • Calculation and Scripting Steps
  • Jobs Orchestration
    • Introduction to Jobs
    • Explore Common Job Entries
  • Exploring the Pentaho Repository
    • The Pentaho Repository
  • Scheduling and Monitoring
    • Setting up the Scheduler
    • Monitoring Scheduled Tasks
  • Logging
    • Introduction to Logging
    • File-based logging
    • Logging Execution Metrics to Databases