What are you looking for?

Simply enter your keyword and we will help you find what you need.



XomniaTrainingsApache Airflow
  • 09:30 - 13:00
  • Xomnia HQ
  • €400

Apache Airflow

When the number of transformations and ETL jobs grow every team needs a tool to schedule them in a particular order. Airflow is the latest open source scheduling tool in the big data world.

During this training, we’ll dive into Apache Airflow, a highly flexible framework to develop maintainable, complex workflows and schedule them. We’ll look at how Airflow works, what possible use cases you could apply it in, and how it interacts with tools like Apache Spark. Some of the pitfalls will also be discussed. The training will involve a lot of hands-on Python coding, so be prepared to get your hands dirty!

Learning goals

  • Understand what Apache Airflow is, when to use it, and when not to use it.
  • Be able to define and build your own data pipeline with Apache Airflow
  • Interact with external (big data) systems from within Apache Airflow
  • Recognise and understand some of the pitfalls that can occur when working with Apache Airflow

Requirements: Basic Python scripting knowledge, basic Apache Spark knowledge, basic Docker knowledge.

  • Data engineering
  • Data pipelines
  • Scheduling