sideara-image

Xomnia is a word combining the letter ‘X’ – the unknown – and “Omnia” – Latin for everything. Our team of data scientists and big data engineers are trained to find the undefined – X – in all the relevant data sources – Omnia. This unknown – X – is untapped business value. Combining the X and Omnia you get the Xomnia spirit. Eager, curious and dedicated people, who have the belief that the future is big data.

Want to feel the Xomnia spirit? Follow us

What are you looking for?

Simply enter your keyword and we will help you find what you need.

Training

XomniaTrainingsApache Airflow
  • 09:30 - 13:00
  • Xomnia HQ
  • €400

For questions or additional information view contact information below

Apache Airflow

When the number of transformations and ETL jobs grow every team needs a tool to schedule them in a particular order. Airflow is the latest open source scheduling tool in the big data world.

During this training, we’ll dive into Apache Airflow, a highly flexible framework to develop maintainable, complex workflows and schedule them. We’ll look at how Airflow works, what possible use cases you could apply it in, and how it interacts with tools like Apache Spark. Some of the pitfalls will also be discussed. The training will involve a lot of hands-on Python coding, so be prepared to get your hands dirty!

Learning goals

  • Understand what Apache Airflow is, when to use it, and when not to use it.
  • Be able to define and build your own data pipeline with Apache Airflow
  • Interact with external (big data) systems from within Apache Airflow
  • Recognise and understand some of the pitfalls that can occur when working with Apache Airflow

Requirements: Basic Python scripting knowledge, basic Apache Spark knowledge, basic Docker knowledge.

  • Big data engineering
  • Data pipelines
  • Scheduling