logo-polimi
Loading...
Degree programme
Show/Search Programme
Course Details
Print
Save Document
Course Details
Context
Academic Year 2023/2024
Name Dott. - MI (1380) Ingegneria dell'Informazione / Information Technology
Programme Year 1

Course Details
ID Code 061642
Course Title REINFORCEMENT LEARNING
Course Type MONO-DISCIPLINARY COURSE
Credits (CFU / ECTS) 5.0
Course Description Reinforcement learning deals with solving sequential decision-making problems when no (or minimal) prior information is available. Solving sequential decision-making problems means finding their optimal control policies. Using reinforcement-learning algorithms, the optimal policy is learned through the direct interaction between the agent (or controller) and the system to be controlled. The course will introduce the main modeling frameworks, analyze the most relevant reinforcement-learning techniques, and, finally, show some interesting applications of these techniques to real-world domains. 1) Models 1.1) Finite Markov Decision Processes 1.2) Continuous Markov Decision Processes 1.3) Markov Games 2) Algorithms 2.1) Dynamic Programming (Policy Iteration, Value Iteration) 2.2) Model-free algorithms for prediction (Monte Carlo, Temporal Difference) 2.3) Model-free algorithms for control (Q-learning, SARSA) 2.4) Online RL with value function approximation (GTD, TDC) 2.5) Batch RL with value function approximation (LSTD, LSPI, FQI) 2.6) Policy Search (policy gradient, natural policy gradient, PGPE) 2.7) Deep Reinforcement Learning (DQN, TRPO, PPO, DDPG, SAC) 2.8) Planning and Learning (MCTS, AlphaZero) 2.9) Multi-agent Reinforcement Learning (overview) 3) Applications 3.1) Autonomous Driving 3.2) Water Resources Management 3.3) Algorithmic Trading
Scientific-Disciplinary Sector (SSD)
SSD Code SSD Description CFU
ING-INF/05 INFORMATION PROCESSING SYSTEMS 5.0

Details
Alphabetical group Name Teaching Assignment Details
From (included) To (excluded)
A ZZZZ Restelli Marcello, Metelli Alberto Maria
manifestidott v. 1.10.0 / 1.10.0
Area Servizi ICT
24/01/2025