ALA 2020

9 & 10 May 2020, Auckland

News

08 May 2020: The ALA presentations are now available on underline!
4 May 2020: The preliminary program for the live events is now online! The ALA live sessions will be streamed on Twitch!
27 April 2020: We are happy to announce our invited speakers for this year, Diederik M. Roijers and Jakob Foerster!
27 April 2020: We invite all authors and participants to join our Slack workspace! Check out the program for more details.
15 April 2020: We are happy to announce that ALA will take place this year as a virtual workshop. The content will consist of a mix of pre-recorded contributions together with live Q&A sessions and invited speaker presentations.
18 March 2020: AAMAS has decided to move to a virtual only conference this year. We will delay the paper notifications until we will receive further news and instructions on how AAMAS decides to organise the workshops under these circumstances. We will make sure to provide all the information before April 1.
26 February 2020: Submissions are now closed. We received 45 submissions this year!
5 February 2020: The submission deadline has been extended to 24 February 2020 23:59 UTC!
4 February 2020: Program Committee members added
21 January 2020: We recommend authors to also append reviews received for AAMAS submissions
21 November 2019: ALA 2020 site launched

Paper #	Authors	Title
13	Ganesh Ghalme, Swapnil Dhamal, Shweta Jain, Sujit Gujar and Y Narahari	Ballooning Multi-Armed Bandits
17	Lisa Torrey	Reinforcement Learning via Reasoning from Demonstration
18	Daniel Willemsen, Hendrik Baier and Michael Kaisers	Value targets in off-policy AlphaZero: a new greedy backup
19	Pieter Libin, Arno Moonens, Timothy Verstraeten, Fabian Perez-Sanjines, Niel Hens, Philippe Lemey and Ann Nowé	Deep reinforcement learning for large-scale epidemic control
20	Timothy Verstraeten, Eugenio Bargiacchi, Pieter Libin, Jan Helsen, Diederik Roijers and Ann Nowé	Thompson Sampling for Loosely-Coupled Multi-Agent Systems: An Application to Wind Farm Control
23	João Vitor de Oliveira Barbosa, Francisco C. Santos, Francisco S. Melo, Anna Helena Reali Costa and Jaime Simão Sichman	Emergence of Cooperation in N-Person Dilemmas through Actor-Critic Reinforcement Learning
24	Panayiotis Danassis and Boi Faltings	Learning to Persist or Switch: Efficient and Fair Allocations in Large-scale Multi-agent Systems
25	Silviu Pitis, Harris Chan, Stephen Zhao, Bradly Stadie and Jimmy Ba	Maximum Entropy Gain Exploration for Long Horizon Multi-goal Reinforcement Learning
28	Peter Vamplew, Cameron Foale and Richard Dazeley	A Demonstration of Issues with Value-Based Multiobjective Reinforcement Learning Under Stochastic State Transitions
38	Grigory Neustroev, Canmanie Ponnambalam, Mathijs de Weerdt and Matthijs Spaan	Interval Q-Learning: Balancing Deep and Wide Exploration
41	Yunshu Du, Garrett Warnell, Assefaw Gebremedhin, Peter Stone and Matthew E. Taylor	Work-in-progress: Corrected Self Imitation learning via Demonstrations
45	Aly Ibrahim, Anirudha Jitani, Daoud Piracha and Doina Precup	Reward Redistribution Mechanisms in Multi-agent Reinforcement Learning

Paper #

Authors

Title

Ganesh Ghalme, Swapnil Dhamal, Shweta Jain, Sujit Gujar and Y Narahari

Ballooning Multi-Armed Bandits

Lisa Torrey

Reinforcement Learning via Reasoning from Demonstration

Daniel Willemsen, Hendrik Baier and Michael Kaisers

Value targets in off-policy AlphaZero: a new greedy backup

Pieter Libin, Arno Moonens, Timothy Verstraeten, Fabian Perez-Sanjines, Niel Hens, Philippe Lemey and Ann Nowé

Deep reinforcement learning for large-scale epidemic control

Timothy Verstraeten, Eugenio Bargiacchi, Pieter Libin, Jan Helsen, Diederik Roijers and Ann Nowé

Thompson Sampling for Loosely-Coupled Multi-Agent Systems: An Application to Wind Farm Control

João Vitor de Oliveira Barbosa, Francisco C. Santos, Francisco S. Melo, Anna Helena Reali Costa and Jaime Simão Sichman

Emergence of Cooperation in N-Person Dilemmas through Actor-Critic Reinforcement Learning

Panayiotis Danassis and Boi Faltings

Learning to Persist or Switch: Efficient and Fair Allocations in Large-scale Multi-agent Systems

Silviu Pitis, Harris Chan, Stephen Zhao, Bradly Stadie and Jimmy Ba

Maximum Entropy Gain Exploration for Long Horizon Multi-goal Reinforcement Learning

Peter Vamplew, Cameron Foale and Richard Dazeley

A Demonstration of Issues with Value-Based Multiobjective Reinforcement Learning Under Stochastic State Transitions

Grigory Neustroev, Canmanie Ponnambalam, Mathijs de Weerdt and Matthijs Spaan

Interval Q-Learning: Balancing Deep and Wide Exploration

Yunshu Du, Garrett Warnell, Assefaw Gebremedhin, Peter Stone and Matthew E. Taylor

Work-in-progress: Corrected Self Imitation learning via Demonstrations

Aly Ibrahim, Anirudha Jitani, Daoud Piracha and Doina Precup

Reward Redistribution Mechanisms in Multi-agent Reinforcement Learning

Paper #	Authors	Title
1	Abhik Singla, Sindhu Padakandla and Shalabh Bhatnagar	Memory-based Deep Reinforcement Learning Method for Obstacle Avoidance in UAV
5	Hardik Meisheri, Vinita Baniwal, Nazneen N Sultana, Balaraman Ravindran and Harshad Khadilkar	Using Reinforcement Learning for a Large Variable-Dimensional Inventory Management Problem
8	Zerong Xi and Gita Sukthankar	Learning Correlation Functions on Mixed Data Sequences for Computer Architecture Applications
14	Swapnil Dhamal, Walid Ben-Ameur, Tijani Chahed and Eitan Altman	A Two Phase Investment Game for Competitive Opinion Dynamics in Social Networks
22	Xiangyu Liu and Ying Tan	Feudal Latent Space Exploration for Coordinated Multi-agent Reinforcement Learning
27	Jiachen Yang, Ang Li, Mehrdad Farajtabar, Peter Sunehag, Edward Hughes and Hongyuan Zha	Learning to Incentivize Other Learning Agents
29	Rohit Prasad, Harshad Khadilkar and Shivaram Kalyanakrishnan	Optimising a Real-time Scheduler for Railway Lines using Policy Search
30	Paniz Behboudian, Yash Satsangi, Matthew Taylor, Anna Harutyunyan and Michael Bowling	Useful Policy Invariant Shaping from Arbitrary Advice
32	Yijie Zhang, Roxana Radulescu, Patrick Mannion, Diederik M. Roijers and Ann Nowé	Opponent Modelling using Policy Reconstruction for Multi-Objective Normal Form Games
33	Abhinav Gupta, Agnieszka Słowik, William L. Hamilton, Mateja Jamnik, Sean B. Holden and Christopher Pal	Analyzing structural priors in multi-agent communication
34	Hang Xu, Ridhima Bector and Zinovi Rabinovich	Teaching Multiple Learning Agents by Environment-Dynamics Tweaks
39	Michael Sullins and Ian Kash	Increased Optimism in Multi-Agent Policy Gradients
43	Finbarr Timbers, Edward Lockhart, Martin Schmid, Marc Lanctot and Michael Bowling	Approximate exploitability: Learning a best response in large games
44	Arjun Manoharan, Rahul Ramesh and Balaraman Ravindran	Option Encoder: A Framework for Discovering a Policy Basis in Reinforcement Learning

Paper #

Authors

Title

Abhik Singla, Sindhu Padakandla and Shalabh Bhatnagar

Memory-based Deep Reinforcement Learning Method for Obstacle Avoidance in UAV

Hardik Meisheri, Vinita Baniwal, Nazneen N Sultana, Balaraman Ravindran and Harshad Khadilkar

Using Reinforcement Learning for a Large Variable-Dimensional Inventory Management Problem

Zerong Xi and Gita Sukthankar

Learning Correlation Functions on Mixed Data Sequences for Computer Architecture Applications

Swapnil Dhamal, Walid Ben-Ameur, Tijani Chahed and Eitan Altman

A Two Phase Investment Game for Competitive Opinion Dynamics in Social Networks

Xiangyu Liu and Ying Tan

Feudal Latent Space Exploration for Coordinated Multi-agent Reinforcement Learning

Jiachen Yang, Ang Li, Mehrdad Farajtabar, Peter Sunehag, Edward Hughes and Hongyuan Zha

Learning to Incentivize Other Learning Agents

Rohit Prasad, Harshad Khadilkar and Shivaram Kalyanakrishnan

Optimising a Real-time Scheduler for Railway Lines using Policy Search

Paniz Behboudian, Yash Satsangi, Matthew Taylor, Anna Harutyunyan and Michael Bowling

Useful Policy Invariant Shaping from Arbitrary Advice

Yijie Zhang, Roxana Radulescu, Patrick Mannion, Diederik M. Roijers and Ann Nowé

Opponent Modelling using Policy Reconstruction for Multi-Objective Normal Form Games

Abhinav Gupta, Agnieszka Słowik, William L. Hamilton, Mateja Jamnik, Sean B. Holden and Christopher Pal

Analyzing structural priors in multi-agent communication

Hang Xu, Ridhima Bector and Zinovi Rabinovich

Teaching Multiple Learning Agents by Environment-Dynamics Tweaks

Michael Sullins and Ian Kash

Increased Optimism in Multi-Agent Policy Gradients

Finbarr Timbers, Edward Lockhart, Martin Schmid, Marc Lanctot and Michael Bowling

Approximate exploitability: Learning a best response in large games

Arjun Manoharan, Rahul Ramesh and Balaraman Ravindran

Option Encoder: A Framework for Discovering a Policy Basis in Reinforcement Learning

Paper #	Authors	Title
3	Budi Kurniawan, Peter Vamplew, Michael Papasimeon, Richard Dazeley and Cameron Foale	Discrete-to-Deep Supervised Policy Learning: An effective training method for neural reinforcement learning
9	Saloni Laddha and Shrisha Rao	Dynamic Interactions by Strong Influencers in Social Networks Using Opinion Propagation
11	Shripad Salsingikar and Narayan Rangaraj	Reinforcement Learning for Train Movement Planning at Railway Stations
15	Wolfram Barfuss	Infinite population evolutionary dynamics match infinite memory reinforcement learning dynamics
16	Craig Sherstan, Bilal Kartal, Pablo Hernandez-Leal and Matthew E. Taylor	Work in Progress: Temporally Extended Auxiliary Tasks
31	Conor F Hayes, Enda Howley and Patrick Mannion	Dynamic Thresholded Lexicographic Ordering
35	Tapan Shah	State Aware Principal Action Space Embedding for Centralized MARL
36	Thomy Phan, Lenz Belzner, Kyrill Schmid, Thomas Gabor, Fabian Ritz, Sebastian Feld and Claudia Linnhoff-Popien	A Distributed Policy Iteration Scheme for Cooperative Multi-Agent Policy Approximation

Paper #

Authors

Title

Budi Kurniawan, Peter Vamplew, Michael Papasimeon, Richard Dazeley and Cameron Foale

Discrete-to-Deep Supervised Policy Learning: An effective training method for neural reinforcement learning

Saloni Laddha and Shrisha Rao

Dynamic Interactions by Strong Influencers in Social Networks Using Opinion Propagation

Shripad Salsingikar and Narayan Rangaraj

Reinforcement Learning for Train Movement Planning at Railway Stations

Wolfram Barfuss

Infinite population evolutionary dynamics match infinite memory reinforcement learning dynamics

Craig Sherstan, Bilal Kartal, Pablo Hernandez-Leal and Matthew E. Taylor

Work in Progress: Temporally Extended Auxiliary Tasks

Conor F Hayes, Enda Howley and Patrick Mannion

Dynamic Thresholded Lexicographic Ordering

Tapan Shah

State Aware Principal Action Space Embedding for Centralized MARL

Thomy Phan, Lenz Belzner, Kyrill Schmid, Thomas Gabor, Fabian Ritz, Sebastian Feld and Claudia Linnhoff-Popien

A Distributed Policy Iteration Scheme for Cooperative Multi-Agent Policy Approximation

15:45 - 16:00 UTC	Welcome & Opening Remarks
16:00 - 17:00 UTC	Invited Talk: Jakob N. Foerster Self-Play and Zero-Shot Coordination in Hanabi
17:00 - 18:00 UTC	Discussion Panel Topic: Building an AI syllabus Chair: Diederik M. Roijers (HU University of Applied Sciences Utrecht, Vrije Universiteit Brussel) Panelists: Ann Nowé (Vrije Universiteit Brussel) Matt Taylor (University of Alberta) Senthil Yogamani (Valeo ) Peter Stone (University of Texas at Austin)

09:00 - 10:00 UTC	Invited Talk: Diederik M. Roijers Multi-objective decision making: why, how, and what now?
10:00 - 10:30 UTC	Awards, closing remarks and ALA 2021 Best Paper Award: Silviu Pitis, Harris Chan, Stephen Zhao, Bradly Stadie and Jimmy Ba, Maximum Entropy Gain Exploration for Long Horizon Multi-goal Reinforcement Learning

ALA 2020

ALA 2020

9 & 10 May 2020, Auckland

News

ALA 2020 - Workshop at AAMAS 2020

Important Dates

Submission Details

Journal Special Issue

Program

Accepted Papers

Long Talks

Short Talks

Spotlight

Invited Talks

Diederik M. Roijers

Jakob N. Foerster

Programe Committee

Organization

Contact

ALA 2020

9 & 10 May 2020, Auckland

News

Important Dates

Submission Details

Journal Special Issue

Program

Accepted Papers

Long Talks

Short Talks

Spotlight

Invited Talks

Diederik M. Roijers

Jakob N. Foerster

Programe Committee

Organization

Sponsorship

Contact