What are you looking for?

Simply enter your keyword and we will help you find what you need.



XomniaTrainingsApache Spark
  • 09:30 - 13:00
  • Xomnia HQ
  • €800

Apache Spark

Apache Spark is one of the most popular and powerful data processing frameworks nowadays. In this training, we’ll give you a crash course on Spark, introducing concepts of distributed data processing, the Spark RDD API, the Spark SQL API, and how you can use Spark to do distributed machine learning. We’ll also take a look at what performance problems and bottlenecks to watch for, and how you can approach resolving them.

Learning goals

  • Understand the challenges of distributed data processing
  • Understand Spark’s position in the big data ecosystem
  • Learn about Spark’s two main API’s: the RDD and the SQL API.
  • Learn how to use Spark to apply distributed, scalable machine learning
  • Understand what might go wrong when working with Spark, and how to think about resolving the problems

Requirements: Some programming knowledge and/or experience with databases.

  • Data engineering
  • Data processing