Wednesday May 15 (17:15-18:15) |
Title |
Authors |
Approximation Gradient Error Variance Reduced Optimization |
Weiye Zhao |
Credulous Acceptability, Poison Games and Modal Logic |
Davide Grossi, Simon Rey |
Social Mobilization to Reposition Indiscriminately Parked Shareable Bikes |
Zelei Liu, Han Yu, Leye Wang, Liang Hu, Qiang Yang |
A Regulation Enforcement Solution for Multi-agent Reinforcement Learning |
Sun Fan-Yun, Yen-Yu Chang, Yueh-Hua Wu, Shou-De Lin |
Bayes-ToMoP: A Fast Detection and Best Response Algorithm Towards
Sophisticated Opponents |
Tianpei Yang, Jianye Hao, Zhaopeng Meng, Chongjie Zhang, Yan Zheng |
Multi-agent Path Planning with Non-constant Velocity Motion |
Ngai Meng Kou, Cheng Peng, Xiaowei Yan, Zhiyuan Yang, Heng Liu, Kai Zhou,
Haibing Zhao, Lijun Zhu, Yinghui Xu |
Complexity and Approximations in Robust Coalition Formation via Max-Min
k-Partitioning |
Anisse Ismaili, Makoto Yokoo, Noam Hazon, Sarit Kraus, Emi Watanabe |
Contradict The Machine: a Hybrid Approach to Identifying Unknown Unknowns |
Colin Vandenhof, Edith Law |
Invincible Strategies of Iterated Prisoner's Dilemma |
Shiheng Wang, Fangzhen Lin |
General-Sum Cyber Deception Games under Partial Attacker Valuation
Information |
Omkar Thakoor, Phebe Vayanos, Christopher Kiekintveld, Milind Tambe,
Haifeng Xu |
Optimising Worlds to Evaluate and Influence Reinforcement Learning Agents |
Richard Everett, Adam Cobb, Stephen Roberts, Andrew Markham |
Broken Signals in Security Games: Coordinating Patrollers and Sensors in
the Real World |
Elizabeth Bondi, Hoon Oh, Haifeng Xu, Fei Fang, Bistra Dilkina, Milind
Tambe |
Probabilistic resource-bounded alternating-time temporal logic |
Hoang Nga Nguyen, Abdur Rakib |
A Polynomial-time Fragment of Epistemic Probabilistic Argumentation |
Nico Potyka |
Bayesian-DPOP for continuous Distributed Constraint Optimization Problems |
Jeroen Fransman, Bart De Schutter, Henry Dol, Erik Theunissen, Joris Sijs |
Distributed Policy Iteration for Scalable Approximation of Cooperative
Multi-Agent Policies |
Thomy Phan, Kyrill Schmid, Lenz Belzner, Thomas Gabor, Sebastian Feld,
Claudia Linnhoff-Popien |
Avoiding Social Disappointment in Elections |
Mohammad Ali Javidian, Pooyan Jamshidi, Rasoul Ramezanian |
Landmark Based Reward Shaping in Reinforcement Learning with Hidden
States |
Alper Demir, Erkin Çilden, Faruk Polat |
Cooperating in Long-term Relationships with Time-Varying Structure |
Jacob Crandall, Huy Pham |
Dynamic and intelligent control of autonomous vehicles for highway
on-ramp merge |
Zine el abidine Kherroubi, Samir Aknine, Rebiha Bacha |
MCTS-based Automated Negotiation Agent |
Cédric Buron, Zahia Guessoum, Sylvain Ductor |
The Complexity of Additive Committee Selection with Outliers |
Yongjie Yang, Jianxin Wang |
Oblivious Envy-Free Allocations of Indivisible Goods |
Hau Chan, Jing Chen, Bo Li, Xiaowei Wu |
Advice Replay Approach for Richer Knowledge Transfer in Teacher Student
Framework |
Vaibhav Gupta, Daksh Anand, Praveen Paruchuri, Balaraman Ravindran |
Proportional Representation in Elections: STV vs PAV |
Piotr Faliszewski, Piotr Skowron, Stanislaw Szufa, Nimrod Talmon |
Simple Contest Enhancers |
Michal Habani, Priel Levy, David Sarne |
Policy Networks: A Framework for Scalable Integration of Multiple
Decision-Making Models |
Kyle Wray, Shlomo Zilberstein |
Multiagent Monte Carlo Tree Search |
Nicholas Zerbel, Logan Yliniemi |
Using surrogate models to calibrate agent-based model parameters under
data scarcity |
Priscilla Avegliano, Jaime Sichman |
Learning Simulation-Based Games from Data |
Enrique Areyan Viqueira, Cyrus Cousins, Amy Greenwald, Eli Upfal |
Attention-based Deep Reinforcement Learning for Multi-view Environments |
Elaheh Barati, Xuewen Chen |
Generating an Agent Taxonomy Using Topological Data Analysis |
Samarth Swarup, Reza Rezazadegan |
Warning Time: Optimizing Strategic Signaling for Security Against
Boundedly Rational Adversaries |
Sarah Cooney, Phebe Vayanos, Thanh Nguyen, Cleotilde Gonzalez, Christian
Lebiere, Edward Cranford, Milind Tambe |
Coordination Structures Generated by Deep Reinforcement Learning in
Distributed Task Executions |
Yuki Miyashita, Toshiharu Sugawara |
Memory Based Multiagent One Shot Learning |
Shauharda Khadka, Connor Yates, Kagan Tumer |
Obvious Strategyproofness, Bounded Rationality and Approximation |
Diodato Ferraioli, Carmine Ventre |
An Optimal Rewiring Strategy for Cooperative Multiagent Social Learning |
Hongyao Tang, Jianye Hao, Li Wang, Zan Wang, Tim Baarslag |
Improving Wind Power Forecasting through Cooperation: A Case-Study on
Operating Farms |
Tanguy Esteoule, Carole Bernon, Marie-Pierre Gleizes, Morgane Barthod |
Evaluation of Optimization for Pedestrian Route Guidance in Real-world
Crowded Scene |
Shusuke Shigenaka, Shunki Takami, Masaki Onishi, Itsuki Noda, Tomohisa
Yamashita |
Cooperative Routing with Heterogeneous Vehicles |
Keisuke Otaki, Satoshi Koide, Ayano Okoso, Tomoki Nishi |
Distributed Task Assignment and Path Planning with Limited Communication
for Robot Teams |
Dario Albani, Wolfgang Hoenig, Nora Ayanian, Daniele Nardi, Vito Trianni |
How to get the most from goods donated to charities |
Christopher Culley, Ji Qi, Carmine Ventre |
Actor-Critic Algorithms for Constrained Multi-agent Reinforcement
Learning |
Raghuram Bharadwaj Diddigi, Sai Koti Reddy Danda, Prabuchandran
Krithivasan Jayachandran, Shalabh Bhatnagar |
Thompson Sampling Based Multi-Armed-Bandit Mechanism Using Neural
Networks |
Manisha Padala, Sujit Gujar |
Computing Stable Solutions in Threshold Network Flow Games With Bounded
Treewidth |
Aldo Pacchiano, Yoram Bachrach |
Hybrid BiLSTM-Siamese Network for Relation Extraction |
Zeyuan Cui, Shijun Liu |
Efficient City-Scale Patrolling Using Decomposition and Grafting |
Wanyuan Wang, Zichen Dong, Bo An, Yichuan Jiang |
Risk Averse Reinforcement Learning for Mixed Multi-agent Environments |
Sai Koti Reddy Danda, Amrita Saha, Srikanth Tamilselvam, Priyanka
Agrawal, Pankaj Dayama |
From Hotelling to Load Balancing: Approximation and the Principle of
Minimum Differentiation |
Matthias Feldotto, Pascal Lenzner, Louise Molitor, Alexander Skopalik |
Online Motion Concept Learning: A novel algorithm for sample-efficient
learning and recognition of human actions |
Miguel Vasco, Francisco Melo, David Martins de Matos, Ana Paiva,
Tetsunari Inamura |
Delayed and Time-Variant Patrolling Strategies against Attackers with
Local Observation Capabilities |
Carlos Diaz Alvarenga, Nicola Basilico, Stefano Carpin |
Deriving norms from actions, values and context |
Myrthe Tielman, Catholijn Jonker, M. Birna van Riemsdijk |
Rethinking the Neutrality Axiom in Judgment Aggregation |
Zoi Terzopoulou, Ulle Endriss |
Explaining Failures Propagations in the Execution of Multi-Agent Temporal
Plans |
Gianluca Torta, Roberto Micalizio, Samuele Sormano |
Logically-Constrained Neural Fitted Q-iteration |
Mohammadhosein Hasanbeig, Alessandro Abate, Daniel Kroening |
A Homophily-Free Community Detection Framework for Trajectories with
Delayed Responses |
Chung-Kyun Han, Shih-Fen Cheng, Pradeep Varakantham |
Stability of Human-Inspired Agent Societies |
Joe Collenette, Katie Atkinson, Daan Bloembergen, Karl Tuyls |
Deep Generative and Discriminative Domain Adaptation |
Han Zhao |
Predictive Execution Monitoring of BDI Recipes |
Mika Barkan, Gal Kaminka |
Aggregating Citizen Preferences for Public Projects Through Civic
Crowdfunding |
Sankarshan Damle, Moin Hussain Moti, Praphul Chandra, Sujit Gujar |
The Gift Exchange Game: Managing Opponent Actions |
Steven Damer, Maria Gini, Jeffrey Rosenschein |
DeepAggregation: A New Approach for Aggregating Incomplete Ranked Lists
using Multi-Layer Graph Embedding |
Rohith D Vallam, Ramasuri Narayanam, Srikanth Tamilselvam, Nicholas
Mattei, Sudhanshu Singh, Shweta Garg, Gyana Parija |
A Social Choice Perspective on Database Aggregation |
Francesco Belardinelli, Umberto Grandi |
A Privacy Preserving Multiagent System for Load Balancing in the Smart
Grid |
Shangyu Xie, Yuan Hong, Peng-Jun Wan |
Collaborative Reinforcement Learning Model for Sustainability of
Cooperation in Sequential Social Dilemmas |
Ritwik Chaudhuri, Kushal Mukherjee, Rohith D Vallam, Ayush Kumar,
Antriksh Mathur, Shweta Garg, Sudhanshu Singh, Gyana Parija, Ramasuri
Narayanam |
Interpretable Automated Machine Learning in Maana Knowledge Platform |
Fangkai Yang, Alexander Elkholy, Steven Gustafson |
Teaching Social Behavior through Human Reinforcement for Ad hoc Teamwork:
The STAR Framework |
Shani Alkoby, Avilash Rath, Peter Stone |
Power indices for team reformation planning under uncertainty |
Jonathan Cohen, Abdel-Illah Mouaddib |
The StarCraft Multi-Agent Challenge |
Mikayel Samvelyan, Tabish Rashid, Gregory Farquhar, Jakob Foerster,
Christian Schroeder de Witt, Nantas Nardelli, Tim G. J. Rudner, Chia-Man
Hung, Philip H. S. Torr, Shimon Whiteson |
Generative Adversarial Imitation from Observation |
Faraz Torabi, Garrett Warnell, Peter Stone |
Verifying Strategic Abilities in Multi-agent Systems with Private
Data-Sharing |
Francesco Belardinelli, Ioana Boureanu, Catalin Dima, Vadim Malvone |
Curriculum Learning for Tightly Coupled Multiagent Systems |
Golden Rockefeller, Patrick Mannion, Kagan Tumer |
A Compression-Inspired Framework for Macro Discovery |
Francisco Garcia, Bruno da Silva, Philip Thomas |
When to stop for safe manipulation in unstructured environments? |
Abdullah Cihan Ak, Arda Inceoglu, Sanem Sariel |
X*: Anytime Multiagent Planning With Bounded Search |
Kyle Vedder, Joydeep Biswas |
What Stands-in for a Missing Tool?: A Prototypical Grounded
Knowledge-based Approach to Tool Substitution |
Madhura Thosar, Christian Mueller, Georg Jäger, Till Mossakowski,
Sebastian Zug |
Training Cooperative Agents for Multi-Agent Reinforcement Learning |
Sushrut Bhalla, Sriram Ganapathi Subramanian, Mark Crowley |
Long-term Autonomous Mobile Manipulation under Uncertainty |
Michael Lanighan, Roderic Grupen |
Agent Software is More Complex than Other Software: An Empirical
Investigation |
Alon Zanbar, Gal Kaminka |
A Property-based Testing Framework for Multi-Agent Systems |
Lars-Åke Fredlund, Clara Benac Earle |
Removing the Target Network from Deep Q-Networks with Mellowmax Operator |
Seungchan Kim, Kavosh Asadi, Michael Littman, George Konidaris |
Modeling Human Decision-Making during Hurricanes: From Model to Data
Collection to Prediction |
Nutchanon Yongsatianchot, Stacy Marsella |
Social Power in Human-Robot Interaction: Towards more Persuasive Robots |
Mojgan Hashemian, Ana Paiva, Samuel Mascarenhas, Pedro Santos, Rui Prada |
Applying Norms and Sanctions to Promote Cybersecurity Hygiene |
Shubham Goyal, Nirav Ajmeri, Munindar Singh |
Learn a Robust Policy in Adversarial Games via Playing with an Expert
Opponent |
Jialian Li, Tongzheng Ren, Hang Su, Jun Zhu |
Smart Targets to Avoid Observation in CTO problem |
Leonardo Ferreira da Costa, Thayanne Franca da Silva, Jose Luis Alves
Leite, Raimundo Juracy Campos Ferro Junior, Raphael Pinheiro de Souza, Joao
Pedro Bernardino Andrade, Gustavo Augusto Lima de Campos |
The unbroken telephone game: keeping swarms connected |
Vivek Shankar Varadharajan, Bram Adams, Giovanni Beltrame |
Optimal Risk in Multiagent Blind Tournaments |
Theodore Perkins |
To be Big Picture Thinker or Detail-Oriented? Utilizing Perceived Gist
Information to Achieve Efficient Convention Emergence with Bilateralism and
Multilateralism |
Shuyue Hu, Chin-wing Leung, Ho-fung Leung, Jiamou Liu |
The DARPA SocialSim Challenge: Massive Multi-Agent Simulations of the
Github Ecosystem |
Jim Blythe, Emilio Ferrara, Diana Huang, Kristina Lerman, Goran Muric,
Anna Sapienza, Alexey Tregubov, Diogo Pacheco, John Bollenbacher, Alessandro
Flammini, Pik-Mai Hui, Fil Menczer |
Active Learning with Gaussian Processes for High Throughput Phenotyping |
Sumit Kumar, Wenhao Luo, George Kantor, Katia Sycara |
Escape Room: A Configurable Testbed for Hierarchical Reinforcement
Learning |
Jacob Menashe, Peter Stone |
Bribery in Balanced Knockout Tournaments |
Christine Konicki, Virginia Vassilevska Williams |
Cooperative Multi-Agent Deep Reinforcement Learning in Soccer Domains |
Jim Martin Catacora Ocana, Francesco Riccio, Roberto Capobianco, Daniele
Nardi |
Explicable Planning as Minimizing Distance from Expected Behavior |
Anagha Kulkarni, Yantian Zha, Tathagata Chakraborti, Satya Gautam
Vadlamudi, Yu Zhang, Subbarao Kambhampati |