There are three assignments (one every two weeks), and a mini-project:
The students can make groups of three and will be graded on an oral examination.
- Simulated Annealing, Gradient descent and Genetic-Algorithms
- Q-Learning and SARSA
- Model based learning and Policy gradient
Every student of each group have to submit all assignments on canvas .
During the examination the students will present one random assignment and the mini-project.
The grade is based on 50 % on the assignments and 50% on the mini-project.