Optimizing Apache Spark & Tuning Best Practices

28 May, 2026Amsterdam, The Netherlands

2 days
In Person
Data Engineering

As data scales up, efficiently processing data becomes more crucial. Building on our experience as one of the world’s most significant Apache Spark users, this 2-day course provides an in-depth overview of the do’s and don’ts of one of the most popular analytics engines available. 

Book this training

Book now

Looking to upskill your team(s) or organization? 

Rozaliia will gladly help you further with custom training solutions. 

Get in touch

Duration

2 days

Time

09:00 – 17:00 (GMT +2:00)

Language

English

Lunch

Included

Certification

No

Level

Professional

What will you learn?

After the training, you will be able to:

Explain what Apache Spark does under the hood.

Use best practices to write performant code.

Read and understand the query plans for your Spark applications.

Explain the Spark fundamentals, including the execution model: Driver/Executors.

Efficiently work with caching, shuffle service, and fair scheduling.

Troubleshoot optimization problems and memory issues.

Program

The trainer facilitates the content using notebooks hosted in a cloud environment. Each participant will have a Spark cluster to experiment with. 

  • Download & understand dataset used during training
  • Theory about various Spark basics and Spark UI
  • Apply optimisations in practice

This training is for you if:

You are comfortable using Spark but want to learn how optimizations can be applied to improve runtime

You want to learn how Spark works fundamentally – from text, to plan, to execution.

You are comfortable using Python.

This training is not for you if:

You don’t use Python with Spark (PySpark)

You want to learn how to transform notebook code into production-ready code (check out our Production-Ready Machine Learning course instead)

You want to learn how to use Databricks (this course is based on open-source Spark and is applicable to Databricks, but we are not covering Databricks concepts such as Jobs, Notebooks, Sharing, Repos, connectors, Databricks-Runtimes, etc.)

Why should I follow this training?

 Learn about Apache Spark, using best practices to write performant code and tweaking and debugging Spark applications. 

Grasp the Spark fundamentals, including the execution model: Driver/Executors, caching, shuffle service, and fair scheduling. 

Learn from and network with Apache Spark data experts. 

What else
should I know?

After registering for this training, you will receive a confirmation email with practical information. A week before the training, we will ask you about any dietary requirements and share literature if you need to prepare.

See you soon!

All literature and course materials are included in the price. 

After registering for this course, you will receive a confirmation email with practical information. 

Also interesting for you

View all trainings
Data Warehousing and Data Modeling

Build a solid foundation of data warehousing and modeling with this training. You’ll learn everything about data warehouse architectures, formal data modeling, performance tuning and more.

Lucy Sheppard 

Data Engineering
Data Modeling
data warehousing
2 days
Virtual

Next:

23 – 23 Apr, 2026

From:

€670

View training
Professional Scrum Master – AI Essentials (PSM-AI)

Sjoerd Nijland

ai
Scrum
Scrum Master
1 day
In Person

Next:

30 Mar, 2026

From:

€925

View training
AI Bootcamp for Business

Master the fundamentals of Generative AI, explore the power of Agentic AI, and become a certified Analytics Translator — all in one structured, high-impact program.

Lysanne van Beek

5.5 days
Virtual

Next:

8 Apr, 2026

From:

€2975

View training
Communicate Like a CTO

Amplify your impact and learn how CTOs communicate with less-technical audiences. In one day, you’ll gain practical tools to explain technical ideas clearly, speak the language of business, and influence stakeholders as a trusted engineer or leader.

Patrick Kua

Agile Leadership
Leadership
techtraining
1 day
In Person

Next:

22 Apr, 2026

From:

€925

View training
Agentic AI for Business

Through demos, guided exercises, and collaborative activities, you’ll explore how agents make decisions, where they can add value, and what risks and limitations to watch for. You’ll leave with practical ideas for applying agentic AI in your own team or organization.

Lucy Sheppard 

2 days
In Person

Next:

26 – 27 May, 2026

From:

€1450

View training

Frequently Asked Questions