Department of Information and Computing Sciences

Departement Informatica Onderwijs
Bachelor Informatica Informatiekunde Kunstmatige intelligentie Master Computing Science Game&Media Technology Artifical Intelligence Business Informatics

Onderwijs Informatica en Informatiekunde

Vak-informatie Informatica en Informatiekunde

Multi-agent learning

Website:website containing additional information
Course code:INFOMAA
Credits:7.5 ECTS
Period:period 2 (week 46 through 5, i.e., 13-11-2017 through 2-2-2018; retake week 16)
Participants:up till now 45 subscriptions
Schedule:Official schedule representation can be found in Osiris
Teachers:Dit is een oud rooster!
lab session          studentassistent WH
studentassistent MB
lecture   Mon 13.15-15.0046-51 BBG-001 Gerard Vreeswijk
2-4 BBG-001
Thu 11.00-12.4546-51 UNNIK-211
2-4 UNNIK-211
week: 21Thu 23-5-20198.30-10.30 uurroom: EDUC-BETA
week: 26Thu 27-6-20198.30-10.30 uurroom: EDUC-GAMMA
week: 28Thu 11-7-20198.30-10.30 uurroom: BBG-023retake exam
Contents:This course discusses forms of machine learning that typically occur in multi-agent systems. Topics: learning and teaching, fictitious play, rational learning, no-regret learning, multi-agent reinforcement learning, evolutionary learning.

The course assumes knowledge of probability theory and game theory.
Literature:Available through the course site. Download and print of material is a responsibility of the student.
Course form:Lectures.
Exam form:Midterm / final exam (70%), two programming assignments (30%).
Minimum effort to qualify for 2nd chance exam:Average 4.
Description:Multi-agent learning (MAL) studies software agents that learn and adapt to the behaviour of other software agents, that themselves adapt to the behaviour of other software agents. The presence of other learning agents complicates learning, which makes the environment non-stationary (a situation of learning a moving target) and non-Markovian (a situation where not only experiences from the immediate past but also earlier experiences are relevant). With adaptive agents it also becomes less beneficent to only adapt to the behaviour of other agents, on the pain of being exploited by more steadfast agents that do not follow but instead impose their strategy on others. Important topics of adaptive agents include statistical learning and single-agent reinforcement learning. Important topics of MAL include (evolutionary) game theory, fictitious play, gradient dynamics, no-regret learning, multi-agent reinforcement learning (MinMax-Q, Nash-Q), leader (teacher) vs. follower (learner) adaptation, and the emergence of social conventions. Examples of domains that need robust MAL algorithms include manufacturing systems (managers of a factory coordinate to maximise their profit), distributed sensor networks (multiple sensors collaborate to perform a large-scale sensing task under strict power constraints), robo-soccer, disaster rescue (robots must safely find victims as fast as possible after an earthquake) and recreational games of imperfect information such as poker. Indeed, poker and simplified forms of poker are an important topics of research in multi-agent learning.