Approximation Gradient Error
Variance Reduced Optimization |
Weiye Zhao |
Credulous Acceptability,
Poison Games and Modal Logic |
Davide Grossi, Simon Rey |
Learning Efficient
Communication in Cooperative Multi-Agent Environment |
Yuhang Zhao, Xiujun Ma |
Optimal Bribery in Voting |
Palash Dey |
Coordinating Sacrifices to
Enhance Social Welfare in Multi-agent Systems |
Han Yu, Zhiqi Shen, Lizhen Cui, Yongqing Zheng, Victor
Lesser |
Social Mobilization to
Reposition Indiscriminately Parked Shareable Bikes |
Zelei Liu, Han Yu, Leye Wang, Liang Hu, Qiang Yang |
A Regulation Enforcement
Solution for Multi-agent Reinforcement Learning |
Sun Fan-Yun, Yen-Yu Chang, Yueh-Hua Wu, Shou-De Lin |
Bayes-ToMoP: A Fast Detection
and Best Response Algorithm Towards Sophisticated Opponents |
Tianpei Yang, Jianye Hao, Zhaopeng Meng, Chongjie Zhang, Yan
Zheng |
Multi-agent Path Planning
with Non-constant Velocity Motion |
Ngai Meng Kou, Cheng Peng, Xiaowei Yan, Zhiyuan Yang, Heng
Liu, Kai Zhou, Haibing Zhao, Lijun Zhu, Yinghui Xu |
Installing Resilience in
Distributed Constraint Optimization Operated by Physical Multi-Agent Systems |
Pierre Rust, Gauthier Picard, Fano Ramparany |
Student-Project-Resource
Matching-Allocation Problems: Two-Sided Matching Meets Resource Allocation |
Anisse Ismaili, Kentaro Yahiro, Makoto Yokoo, Tomoaki
Yamaguchi |
Complexity and Approximations
in Robust Coalition Formation via Max-Min k-Partitioning |
Anisse Ismaili, Makoto Yokoo, Noam Hazon, Sarit Kraus, Emi
Watanabe |
Contradict The Machine: a
Hybrid Approach to Identifying Unknown Unknowns |
Colin Vandenhof, Edith Law |
Invincible Strategies of
Iterated Prisoner's Dilemma |
Shiheng Wang, Fangzhen Lin |
An Urgency-Dependent Quorum
Sensing Algorithm for N-Site Selection in Autonomous Swarms |
Grace Cai, Don Sofge |
General-Sum Cyber Deception
Games under Partial Attacker Valuation Information |
Omkar Thakoor, Phebe Vayanos, Christopher Kiekintveld, Milind
Tambe, Haifeng Xu |
The Representational Capacity
of Action-Value Networks for Multi-Agent Reinforcement Learning |
Jacopo Castellini, Frans Oliehoek, Rahul Savani, Shimon
Whiteson |
Simple Contrapositive
Assumption-Based Frameworks |
Ofer Arieli, Jesse Heyninck |
Optimising Worlds to Evaluate
and Influence Reinforcement Learning Agents |
Richard Everett, Adam Cobb, Stephen Roberts, Andrew
Markham |
Broken Signals in Security
Games: Coordinating Patrollers and Sensors in the Real World |
Elizabeth Bondi, Hoon Oh, Haifeng Xu, Fei Fang, Bistra
Dilkina, Milind Tambe |
Probabilistic
resource-bounded alternating-time temporal logic |
Hoang Nga Nguyen, Abdur Rakib |
Fair Division of Indivisible
Goods Among Strategic Agents |
Siddharth Barman, Ganesh Ghalme, Shivika Narang, Shweta Jain,
Pooja Kulkarni |
A Polynomial-time Fragment of
Epistemic Probabilistic Argumentation |
Nico Potyka |
Polynomial-Time Multi-Agent
Pathfinding with Heterogeneous and Self-Interested Agents |
Manao Machida |
Facility Location for Three
Agents |
Reshef Meir |
Distributed Policy Iteration
for Scalable Approximation of Cooperative Multi-Agent Policies |
Thomy Phan, Kyrill Schmid, Lenz Belzner, Thomas Gabor,
Sebastian Feld, Claudia Linnhoff-Popien |
Learning Factored Markov
Decision Processes with Unawareness |
Craig Innes, Alex Lascarides |
Reachability and Coverage
Planning for Connected Agents |
Tristan Charrier, Arthur Queffelec, Ocan Sankur, Francois
Schwarzentruber |
The Complexity of the
Possible Winner Problem with Partitioned Preferences |
Batya Kenig |
Avoiding Social
Disappointment in Elections |
Mohammad Ali Javidian, Pooyan Jamshidi, Rasoul Ramezanian |
A Q-values Sharing Framework
for Multiple Independent Q-learners |
Changxi Zhu, Ho-fung Leung, Shuyue Hu, Yi Cai |
Multiagent Adversarial
Inverse Reinforcement Learning |
Ermo Wei, Drew Wicke, Sean Luke |
Landmark Based Reward Shaping
in Reinforcement Learning with Hidden States |
Alper Demir, Erkin Çilden, Faruk Polat |
Personality-Based
Representations of Imperfect-Recall Games |
Andrea Celli, Nicola Gatti, Giulia Romano |
Generating Voting Rules from
Random Relations |
Nic Wilson |
Multi-Agent Hierarchical
Reinforcement Learning with Dynamic Termination |
Dongge Han, Wendelin Boehmer, Michael Wooldridge, Alex
Rogers |
Dynamic Trip-Vehicle Dispatch
with Scheduled and On-Demand Requests |
Taoan Huang, Bohui Fang, Hoon Oh, Xiaohui Bei, Fei Fang |
Cooperating in Long-term
Relationships with Time-Varying Structure |
Jacob Crandall, Huy Pham |
Regular Decision Processes: Modelling Dynamic Systems without Using Hidden Variables |
Ronen Brafman, Giuseppe De Giacomo |
On Enactability of Agent
Interaction Protocols: Towards a Unified Approach |
Angelo Ferrando, Michael Winikoff, Stephen Cranefield, Viviana
Mascardi, Frank Dignum |
MARL-PPS: Multi-agent
Reinforcement Learning with Periodic Parameter Sharing |
Safa Cicek, Alireza Nakhaei, Stefano Soatto, Kikuo
Fujimura |
A New Constraint Satisfaction
Perspective on Multi-Agent Path Finding |
Jiangxing Wang, Jiaoyang Li, Hang Ma, Sven Koenig, T. K.
Satish Kumar |
Entailment Functions and
Reasoning Under Inconsistency |
Yakoub Salhi |
Vote for Me! Election Control
via Social Influence in Arbitrary Scoring Rule Voting Systems |
Federico Corò, Emilio Cruciani, Gianlorenzo D'Angelo, Stefano
Ponziani |
Coordinated Multiagent
Reinforcement Learning for Teams of Mobile Sensing Robots |
Chao Yu, Xin Wang, Zhanbo Feng |
Learning through Probing: a
decentralized reinforcement learning architecture for social dilemmas |
Nicolas Anastassacos, Mirco Musolesi |
MCTS-based Automated
Negotiation Agent |
Cédric Buron, Zahia Guessoum, Sylvain Ductor |
Towards a “Master Algorithm”
for Forming Faster Conventions On Various Networks |
Mohammad Hasan |
The Complexity of Additive
Committee Selection with Outliers |
Yongjie Yang, Jianxin Wang |
Maximin-Aware Allocations of Indivisible Goods |
Hau Chan, Jing Chen, Bo Li, Xiaowei Wu |
Advice Replay Approach for
Richer Knowledge Transfer in Teacher Student Framework |
Vaibhav Gupta, Daksh Anand, Praveen Paruchuri, Balaraman
Ravindran |
Proportional Representation
in Elections: STV vs PAV |
Piotr Faliszewski, Piotr Skowron, Stanislaw Szufa, Nimrod
Talmon |
Simple Contest Enhancers |
Michal Habani, Priel Levy, David Sarne |
Temporal Information Design
in Contests |
Priel Levy, David Sarne, Yonatan Aumann |
Policy Networks: A Framework
for Scalable Integration of Multiple Decision-Making Models |
Kyle Wray, Shlomo Zilberstein |
Learning Self-Game-Play
Agents for Combinatorial Optimization Problems |
Ruiyang Xu, Karl Lieberherr |
Multiagent Monte Carlo Tree
Search |
Nicholas Zerbel, Logan Yliniemi |
Using surrogate models to
calibrate agent-based model parameters under data scarcity |
Priscilla Avegliano, Jaime Sichman |
Learning Simulation-Based
Games from Data |
Enrique Areyan Viqueira, Cyrus Cousins, Amy Greenwald, Eli
Upfal |
Maxmin Share Fair Allocation
of Indivisible Chores to Asymmetric Agents |
Haris Aziz, Hau Chan, Bo Li |
Modeling Random Guessing and
Task Difficulty for Truth Discovery in Crowdsourcing |
Yi Yang, Quan Bai, Qing Liu |
Attention-based Deep
Reinforcement Learning for Multi-view Environments |
Elaheh Barati, Xuewen Chen |
Generating an Agent Taxonomy
Using Topological Data Analysis |
Samarth Swarup, Reza Rezazadegan |
Warning Time: Optimizing
Strategic Signaling for Security Against Boundedly Rational Adversaries |
Sarah Cooney, Phebe Vayanos, Thanh Nguyen, Cleotilde Gonzalez,
Christian Lebiere, Edward Cranford, Milind Tambe |
Optimal Sequential Planning
for Communicative Actions: A Bayesian Approach |
Piotr Gmytrasiewicz, Sarit Adhikari |
Coordination Structures
Generated by Deep Reinforcement Learning in Distributed Task Executions |
Yuki Miyashita, Toshiharu Sugawara |
Memory Based Multiagent One
Shot Learning |
Shauharda Khadka, Connor Yates, Kagan Tumer |
Robustness against Agent
Failure in Hedonic Games |
Ayumi Igarashi, Kazunori Ota, Yuko Sakurai, Makoto Yokoo |
Obvious Strategyproofness,
Bounded Rationality and Approximation |
Diodato Ferraioli, Carmine Ventre |
An Optimal Rewiring Strategy
for Cooperative Multiagent Social Learning |
Hongyao Tang, Jianye Hao, Li Wang, Zan Wang, Tim Baarslag |
A dynamic aleatoric calculus
for reasoning in games of bluffing and chance |
Tim French, Andrew Gozzard, Mark Reynolds |
A Truthful,
Privacy-Preserving, Approximately Efficient Combinatorial Auction For
Single-minded Bidders |
Sankarshan Damle, Boi Faltings, Sujit Gujar |
Cooperative Routing with
Heterogeneous Vehicles |
Keisuke Otaki, Satoshi Koide, Ayano Okoso, Tomoki Nishi |
On the maximization of
influence over an unknown social network |
Kexiu Song, Jiamou Liu, Bo Yan, Yiping Liu, Hongyi Su, Hong
Zheng |
How to get the most from
goods donated to charities |
Christopher Culley, Ji Qi, Carmine Ventre |
Actor-Critic Algorithms for
Constrained Multi-agent Reinforcement Learning |
Raghuram Bharadwaj Diddigi, Sai Koti Reddy Danda,
Prabuchandran Krithivasan Jayachandran, Shalabh Bhatnagar |
Meta-Strategy for Multi-Time
Negotiation: A Multi-Armed Bandit Approach |
Ryohei Kawata, Katsuhide Fujita |
Stackelberg Equilibrium
approximation in general-sum extensive-form games with double-oracle sampling
method |
Jan Karwowski, Jacek Mańdziuk |
Thompson Sampling Based
Multi-Armed-Bandit Mechanism Using Neural Networks |
Manisha Padala, Sujit Gujar |
Computing Stable Solutions in
Threshold Network Flow Games With Bounded Treewidth |
Aldo Pacchiano, Yoram Bachrach |
Hybrid BiLSTM-Siamese Network
for Relation Extraction |
Zeyuan Cui, Shijun Liu |
Efficient City-Scale
Patrolling Using Decomposition and Grafting |
Wanyuan Wang, Zichen Dong, Bo An, Yichuan Jiang |
Risk Averse Reinforcement
Learning for Mixed Multi-agent Environments |
Sai Koti Reddy Danda, Amrita Saha, Srikanth Tamilselvam,
Priyanka Agrawal, Pankaj Dayama |
Evidence Propagation and
Consensus Formation in Noisy Environments |
Michael Crosscombe, Jonathan Lawry |
Emergence of
Scenario-Appropriate Collaborative Behaviors for Teams of Robotic Bodyguards |
Hassam Sheikh, Ladislau Bölöni |
The Imitation Game: Learned
reciprocity in Markov games |
Tom Eccles, Edward Hughes, Steven Wheelwright, Joel Leibo,
János Kramár |
From Hotelling to Load
Balancing: Approximation and the Principle of Minimum Differentiation |
Matthias Feldotto, Pascal Lenzner, Louise Molitor, Alexander
Skopalik |
Online Motion Concept
Learning: A novel algorithm for sample-efficient learning and recognition of
human actions |
Miguel Vasco, Francisco Melo, David Martins de Matos, Ana
Paiva, Tetsunari Inamura |
Automatic Feature Engineering
by Deep Reinforcement Learning |
Jianyu Zhang, Jianye Hao, Françoise Fogelman-Soulié, Zan
Wang |
Rethinking the Neutrality
Axiom in Judgment Aggregation |
Zoi Terzopoulou, Ulle Endriss |
Explaining Failures
Propagations in the Execution of Multi-Agent Temporal Plans |
Gianluca Torta, Roberto Micalizio, Samuele Sormano |
Logically-Constrained Neural
Fitted Q-iteration |
Mohammadhosein Hasanbeig, Alessandro Abate, Daniel
Kroening |
A Homophily-Free Community
Detection Framework for Trajectories with Delayed Responses |
Chung-Kyun Han, Shih-Fen Cheng, Pradeep Varakantham |
Stability of Human-Inspired
Agent Societies |
Joe Collenette, Katie Atkinson, Daan Bloembergen, Karl
Tuyls |
Deep Generative and
Discriminative Domain Adaptation |
Han Zhao |
Exploration in the face of
Parametric and Intrinsic Uncertainties |
Borislav Mavrin, Shangtong Zhang, Hengshuai Yao, Linglong
Kong |
Predictive Execution
Monitoring of BDI Recipes |
Mika Barkan, Gal Kaminka |
Priority driven Local
Optimization for Crowd Simulation |
Himangshu Saikia, Fangkai Yang, Christopher Peters |
Aggregating Citizen
Preferences for Public Projects Through Civic Crowdfunding |
Sankarshan Damle, Moin Hussain Moti, Praphul Chandra, Sujit
Gujar |
Adaptive multi-agent system
for situated task allocation |
Quentin Baert, Anne-Cécile Caron, Maxime Morge,
Jean-Christophe Routier |
The Gift Exchange Game: Managing Opponent Actions |
Steven Damer, Maria Gini, Jeffrey Rosenschein |
DeepAggregation: A New
Approach for Aggregating Incomplete Ranked Lists using Multi-Layer Graph
Embedding |
Rohith D Vallam, Ramasuri Narayanam, Srikanth Tamilselvam,
Nicholas Mattei, Sudhanshu Singh, Shweta Garg, Gyana Parija |
A Social Choice Perspective
on Database Aggregation |
Francesco Belardinelli, Umberto Grandi |
A Privacy Preserving
Multiagent System for Load Balancing in the Smart Grid |
Shangyu Xie, Yuan Hong, Peng-Jun Wan |
Collaborative Reinforcement
Learning Model for Sustainability of Cooperation in Sequential Social
Dilemmas |
Ritwik Chaudhuri, Kushal Mukherjee, Rohith D Vallam, Ayush
Kumar, Antriksh Mathur, Shweta Garg, Sudhanshu Singh, Gyana Parija, Ramasuri
Narayanam |
A Truthful Online Mechanism
for Allocating Fog Computing Resources |
Fan Bi, Sebastian Stein, Enrico Gerding, Nick Jennings, Tom La
Porta |
Reinforcement Learning with
Derivative-Free Exploration |
Xionghui Chen, Yang Yu |
Strategic Majoritarian Voting
with Propositional Goals |
Arianna Novaro, Umberto Grandi, Dominique Longin, Emiliano
Lorini |
Teaching Social Behavior
through Human Reinforcement for Ad hoc Teamwork: The STAR Framework |
Shani Alkoby, Avilash Rath, Peter Stone |
Classification of Contractual
Conflicts via Learning of Semantic Representations |
João Paulo Aires, Roger Granada, Juarez Monteiro, Rodrigo
Coelho Barros, Felipe Meneguzzi |
Deep Fictitious Play for
Games with Continuous Action Spaces |
Nitin Kamra, Umang Gupta, Kai Wang, Fei Fang, Yan Liu, Milind
Tambe |
Power indices for team
reformation planning under uncertainty |
Jonathan Cohen, Abdel-Illah Mouaddib |
Verifying Strategic Abilities
in Multi-agent Systems with Private Data-Sharing |
Francesco Belardinelli, Ioana Boureanu, Catalin Dima, Vadim
Malvone |
Masquerade Attack Detection
Through Observation Planning for Multi-Robot Systems |
Kacper Wardega, Wenchao Li, Roberto Tron |
Meta-learning of Bidding
Agent with Knowledge Gradient in a Fully Agent-based Sponsored Search Auction
Simulator |
Donghun Lee, Warren Powell |
Curriculum Learning for
Tightly Coupled Multiagent Systems |
Golden Rockefeller, Patrick Mannion, Kagan Tumer |
A Compression-Inspired
Framework for Macro Discovery |
Francisco Garcia, Bruno da Silva, Philip Thomas |
A Meta-MDP Approach to
Exploration for Lifelong Reinforcement Learning |
Francisco Garcia, Philip Thomas |
X*: Anytime Multiagent
Planning With Bounded Search |
Kyle Vedder, Joydeep Biswas |
Report-Sensitive
Spot-checking in Peer Grading |
Hedayat Zarkoob, Kevin Leyton-Brown, Hu Fu |
Training Cooperative Agents
for Multi-Agent Reinforcement Learning |
Sushrut Bhalla, Sriram Ganapathi Subramanian, Mark
Crowley |
Toward Robust Policy
Summarization |
Isaac Lage, Daphna Lifschitz, Finale Doshi-Velez, Ofra
Amir |
Manipulative Design of
Scoring Systems |
Dorothea Baumeister, Tobias Hogrebe |
Removing the Target Network
from Deep Q-Networks with Mellowmax Operator |
Seungchan Kim, Kavosh Asadi, Michael Littman, George
Konidaris |
Modeling Human
Decision-Making during Hurricanes: From Model to Data Collection to
Prediction |
Nutchanon Yongsatianchot, Stacy Marsella |
Preference Learning in
Automated Negotiation Using Gaussian Uncertainty Models |
Haralambie Leahu, Michael Kaisers, Tim Baarslag |
Social Power in Human-Robot
Interaction: Towards more Persuasive Robots |
Mojgan Hashemian, Ana Paiva, Samuel Mascarenhas, Pedro Santos,
Rui Prada |
Designing Emergent Swarm
Behaviors using Behavior Trees and Grammatical Evolution |
Aadesh Neupane, Michael A. Goodrich |
Multi-Agent Learning and
Coordination with Clustered Deep Q-Network |
Simon Pageaud, Véronique Deslandres, Vassilissa Lehoux, Salima
Hassas |
Applying Norms and Sanctions
to Promote Cybersecurity Hygiene |
Shubham Goyal, Nirav Ajmeri, Munindar Singh |
Robust Monitoring on Graphs
with an Application to Suicide Prevention in Social Networks |
Aida Rahmattalabi, Phebe Vayanos, Anthony Fulginiti, Milind
Tambe |
Learn a Robust Policy in
Adversarial Games via Playing with an Expert Opponent |
Jialian Li, Tongzheng Ren, Hang Su, Jun Zhu |
Smart Targets to Avoid
Observation in CTO problem |
Leonardo Ferreira da Costa, Thayanne França da Silva, José
Luis Alves Leite, Raimundo Juracy Campos Ferro Junior, Raphael Pinheiro de
Souza, João Pedro Bernardino Andrade, Gustavo Augusto Lima de Campos |
The Rise and Fall of Complex
Family Structures: Coalition Formation, Stability, and Power Struggle |
Angelina Brilliantova, Anton Pletenev, Hadi Hosseini |
Optimal Risk in Multiagent
Blind Tournaments |
Theodore Perkins |
To be Big Picture Thinker or
Detail-Oriented? Utilizing Perceived Gist Information to Achieve Efficient
Convention Emergence with Bilateralism and Multilateralism |
Shuyue Hu, Chin-wing Leung, Ho-fung Leung, Jiamou Liu |
The DARPA SocialSim
Challenge: Massive Multi-Agent Simulations of the Github Ecosystem |
Jim Blythe, Emilio Ferrara, Diana Huang, Kristina Lerman,
Goran Muric, Anna Sapienza, Alexey Tregubov, Diogo Pacheco, John
Bollenbacher, Alessandro Flammini, Pik-Mai Hui, Fil Menczer |
Object Exchangability in
Reinforcement Learning |
John Mern, Dorsa Sadigh, Mykel Kochenderfer |
Escape Room: A Configurable
Testbed for Hierarchical Reinforcement Learning |
Jacob Menashe, Peter Stone |
A Selective Exploration
Method for Policy Transfer in Reinforcement Learning |
Akshay Narayan, Tze Yun Leong |
Two-stage n-person prisoner's
dilemma with social preferences |
Seji Takanashi, Makoto Yokoo |
Bribery in Balanced Knockout
Tournaments |
Christine Konicki, Virginia Vassilevska Williams |
Fairness Through the Lens of
Proportional Equality |
Arpita Biswas, Suvam Mukherjee |
Recognising and Explaining
Bidding Strategies in Negotiation Support Systems |
Vincent Koeman, Koen Hindriks, Jonathan Gratch, Catholijn
Jonker |
Domain Adaptation for
Reinforcement Learning on the Atari |
Thomas Carr, Maria Chli, George Vogiatzis |