1. Vacatures
  2. Technische Universiteit Delft (TUD)
  3. 2 PhD Positions on Reinforcement Learning in the Real World

Helaas, deze vacature staat inmiddels niet meer online

Kijk gerust verder naar andere vacatures.

2 PhD Positions on Reinforcement Learning in the Real World

Do you have what it takes to push reinforcement learning beyond the realm of games?

2 maanden geleden


Mekelweg, Delft, Zuid-Holland
Tijdelijk contract / Tijdelijke opdracht
Uren per week:
36 - 40 uur
€ 2395 - € 3061 per maand


We are seeking 2 PhD candidates focusing on reinforcement learning in the recently established Mercury Machine Learning Lab (MMLL). In this lab, researchers from the University of Amsterdam (UvA) and Delft University of Technology (TU Delft) will be working together with data scientists from Booking.com to develop more usable reinforcement learning and other machine learning techniques.

Reinforcement learning (RL) is a promising approach to learn to control decision making problems that extend over time, but so far applications have been largely limited to synthetic settings such as games. Motivated by real-world problems faced in industry, we will investigate fundamental problems in reinforcement learning. For instance, we will study effective exploration in non-stationary environments and learning using many parallel trials. The candidates will be jointly supervised by Dr. M.T.J. Spaan and Dr. F.A. Oliehoek.

The MMLL collaboration provides the unique opportunity to test AI techniques in the real world, allowing new machine learning methods to be safely developed for wide application, for example in mobility, energy or healthcare. In addition to the existing researchers, the Mercury Machine Learning Lab will comprise six PhD candidates and two postdocs who will work on six different projects related to bias and generalisation problems over the course of the next five years. More details on the contents of the MMLL research projects are provided on the MMLL webpage


The ideal candidate:

  • has a Master’s degree in computer science, math, or physics.
  • has excellent math skills.
  • has a thorough understanding of the basics of reinforcement learning
  • took one or more machine learning courses
  • has strong coding skills and experience with deep learning frameworks
  • is fluent in English
  • is a strong communicator, and can work well in a team, and willing to interact with both business and academic advisors
  • is self-motivated to do cutting-edge research.


TU Delft offers PhD-candidates a 4-year contract, with an official go/no go progress assessment after one year. Salary and benefits are in accordance with the Collective Labour Agreement for Dutch Universities, increasing from € 2395 per month in the first year to € 3061 in the fourth year. As a PhD candidate you will be enrolled in the TU Delft Graduate School. The TU Delft Graduate School provides an inspiring research environment with an excellent team of supervisors, academic staff and a mentor. The Doctoral Education Programme is aimed at developing your transferable, discipline-related and research skills.

The TU Delft offers a customisable compensation package, discounts on health insurance and sport memberships, and a monthly work costs contribution. Flexible work schedules can be arranged. For international applicants we offer the Coming to Delft Service and Partner Career Advice to assist you with your relocation.

Additional information

For information about this vacancy, please contact Dr. Matthijs Spaan <m.t.j.spaan@tudelft.nl> and Dr. Frans Oliehoek , who will be the supervisors for these positions.