Day 1: Basics of ETL and Apache Airflow
Introduction to ETL – discussion of ETL processes, their importance in data processing, and an overview of available tools
Basics of Apache Airflow – installation, configuration, and creating the first Directed Acyclic Graphs (DAGs)
Designing data pipelines – best practices, modeling, and implementing simple ETL processes in Airflow
Day 2: Advanced Techniques and Talend
Advanced operations in Apache Airflow – using operators, monitoring, and debugging data pipelines
Introduction to Talend – installation, configuration, and creating ETL processes using the graphical interface
Integration and optimization – best practices for advanced data modeling and integrating ETL processes with various data sources
Features
Description
Company Description
InfoShare Academy is a leading IT academy offering comprehensive educational programs in new technologies for companies. Since 2015, we have supported organizations in developing technology teams through dedicated courses in Machine Learning, DevOps, Data Engineering, Python, UX/UI Design, AWS, and Kubernetes. Our training is based on practical skills and real business cases. We collaborate with over 300 industry practitioners, ensuring that our programs are tailored to current market needs. We specialize in reskilling and upskilling employees. With us, you will build effective teams implementing new technologies that will accelerate innovation and strengthen your company's competitiveness in the market. Check out our training offerings designed for companies, created to develop your employees' competencies in the IT field.
Training Description
- The "Creating Pipelines / ETL" training is a comprehensive course aimed at equipping participants with the skills necessary to design, implement, and manage ETL processes. The course focuses on the practical application of tools such as Apache Airflow and Talend, allowing participants to learn through direct experience in creating effective data pipelines. Required Technical Skills from Training Participants:
- Basic knowledge of the Python programming language
- Basic knowledge of databases and SQL
- Understanding of basic data processing concepts
Who the Training is For
This training is aimed at data analysts, data engineers, developers, and anyone who wants to learn how to create and manage ETL processes to effectively utilize data in their organizations.
Goals
Benefits
- You will learn:
- Designing and implementing effective ETL processes – you will gain the skills necessary to build effective data pipelines
- Practical work with ETL tools – you will learn to use Apache Airflow and Talend, two leading tools for managing ETL processes
- Managing complex data flows – you will master the techniques needed to monitor, debug, and optimize ETL processes
Training Program
Duration
16 h/2 days
Price Includes
- Certificate of completion
- Monthly access to the training recording (in case of online format)
- Customization of the training program to client needs