Pentaho Data Integration Fundamentals
Код: DI1000
Продолжительность очно: 3 дн.
Цена:
1,920$ - Partners
2,700$ - Customers
О курсе
This course introduces the Pentaho Data Integration (PDI) platform. It covers the basic functions, explains the capabilities of PDI, and describes the best practices to use it successfully. Course demonstrations, combined with practice, prepare you to use PDI for real world cases. Additional benefits are attained because you practice concepts learned in a PDI development environment during the course.
Для кого этот курс
- ETL Developers
- Data Analyst
Требуемые знания и навыки
Experience in ETL concepts is Preferred
Приобретаемые навыки
Pentaho Data Integration prepares and blends data to create a complete picture of your business that drives actionable insights. The complete data integration platform delivers accurate, “analytics ready” data to end users from any source. With visual tools to eliminate coding and complexity, Pentaho puts big data and all data sources at the fingertips of business and IT users alike.
Программа
- Introduction to Pentaho Data Integration
- Objectives and Class Logistics
- Pentaho Platform and Architecture
- Transformations
- Transformation Concepts
- Learning the PDI User Interface
- Creating and Running Transformations
- Introduction to Repositories
- Reading and Writing Files
- Input and Output Steps
- PDI’s Home Directory
- Parameterization
- Working with Databases
- Connecting to and Exploring a Database
- Table Input and Output Steps
- Insert / Update and Delete Steps
- Filtering and Sorting Data
- Variables and Unnamed Parameters in SQL
- Data Flow and Lookups
- Data Movement and Step Copies
- Lookups and Merge
- Calculations
- Grouping
- Calculation and Scripting Steps
- Jobs Orchestration
- Introduction to Jobs
- Explore Common Job Entries
- Exploring the Pentaho Repository
- The Pentaho Repository
- Scheduling and Monitoring
- Setting up the Scheduler
- Monitoring Scheduled Tasks
- Logging
- Introduction to Logging
- File-based logging
- Logging Execution Metrics to Databases