Xomnia is a word combining the letter ‘X’ – the unknown – and “Omnia” – Latin for everything. Our team of data scientists and big data engineers are trained to find the undefined – X – in all the relevant data sources – Omnia. This unknown – X – is untapped business value. Combining the X and Omnia you get the Xomnia spirit. Eager, curious and dedicated people, who have the belief that the future is big data.

Want to feel the Xomnia spirit? Follow us

What are you looking for?

Simply enter your keyword and we will help you find what you need.


XomniaTrainingsApache Spark
  • 09:30 - 13:00
  • Xomnia HQ
  • €800

For questions or additional information view contact information below

Apache Spark

Apache Spark is one of the most popular and powerful data processing frameworks nowadays. In this training, we’ll give you a crash course on Spark, introducing concepts of distributed data processing, the Spark RDD API, the Spark SQL API, and how you can use Spark to do distributed machine learning. We’ll also take a look at what performance problems and bottlenecks to watch for, and how you can approach resolving them.

Learning goals

  • Understand the challenges of distributed data processing
  • Understand Spark’s position in the big data ecosystem
  • Learn about Spark’s two main API’s: the RDD and the SQL API.
  • Learn how to use Spark to apply distributed, scalable machine learning
  • Understand what might go wrong when working with Spark, and how to think about resolving the problems

Requirements: Some programming knowledge and/or experience with databases.

  • Big data engineering
  • Data processing