Publications

Aerobatic Helicopter
Scalable Identification of Partially Observed Systems with Certainty-Equivalent EM
Kunal Menda*, Jean de Becdelièvre*, Jayesh K. Gupta*, Ilan Kroo, Mykel Kochenderfer and Zachary Manchester
ICML 2020
Dynamic Multi-Robot Task Allocation
Dynamic Multi-Robot Task Allocation under Uncertainty and Temporal Constraints
Shushman Choudhary, Jayesh K. Gupta, Mykel Kochenderfer, Dorsa Sadigh and Jeannette Bohg
RSS 2020
structured mechanical models
Structured Mechanical Models for Robot Learning and Control
Jayesh K. Gupta*, Kunal Menda*, Zachary Manchester and Mykel Kochenderfer
L4DC 2020
model primitive hierarchical reinforcement learning
Model Primitives for Hierarchical Lifelong Reinforcement Learning
Bohan Wu, Jayesh K. Gupta and Mykel Kochenderfer
JAAMAS 2020
normalizing flow model for policy representation in continuous action multi-agent systems
Normalizing Flow Model for Policy Representation in Continuous Action Multi-agent Systems
Xiaobai Ma, Jayesh K. Gupta and Mykel Kochenderfer
AAMAS 2020
Simulating Emergent Properties of Human Driving Behavior Using Multi-Agent Reward Augmented Imitation Learning
Simulating Emergent Properties of Human Driving Behavior Using Multi-Agent Reward Augmented Imitation Learning
Raunak P. Bhattacharyya, Derek J. Phillips, Changliu Liu, Jayesh K. Gupta, Katherine Driggs-Campbell, Mykel J. Kochenderfer
ICRA 2019
model primitive hierarchical reinforcement learning
Model Primitive Hierarchical Lifelong Reinforcement Learning
Bohan Wu, Jayesh K. Gupta and Mykel Kochenderfer
AAMAS 2019
speaker listener env description
Learning Policy Representations in Multiagent Systems
Aditya Grover, Maruan Al-Shedivat, Jayesh K. Gupta, Yuri Burda and Harrison Edwards
ICML 2018
nodes connected in a graph
Evaluating Generalization in Multiagent Systems using Agent-Interaction Graphs
Aditya Grover, Maruan Al-Shedivat, Jayesh K. Gupta, Yuri Burda and Harrison Edwards
AAMAS 2018
multiple agents carrying a box forward
Cooperative Multi-agent Control using Deep Reinforcement Learning
Jayesh K. Gupta, Maxim Egorov and Mykel Kochenderfer
AAMAS 2017
POMDPs.jl: A Framework for Sequential Decision Making under Uncertainty
Maxim Egorov, Zachary N. Sunberg, Edward Balaban, Tim A Wheeler, Jayesh K. Gupta, Mykel J. Kochenderfer
Journal of Machine Learning Research
highway image
Model-Free Imitation Learning with Policy Optimization
Jonathan Ho, Jayesh K. Gupta and Stefano Ermon
ICML 2016
planit image
PlanIt: A Crowdsourcing Approach for Learning to Plan Paths from Large Scale Preference Feedback
Ashesh Jain, Debarghya Das, Jayesh K. Gupta and Ashutosh Saxena
ICRA 2015
show more
ANN vs SNN
Layer-wise synapse optimization for implementing neural networks on general neuromorphic architectures
John Mern, Jayesh K. Gupta and Mykel Kochenderfer
IEEE Symposium Series on Computational Intelligence (SSCI) 2017
A general framework for structured learning of mechanical systems
Jayesh K. Gupta*, Kunal Menda*, Zachary Manchester and Mykel Kochenderfer
Preprint
Health-informed Policy Gradients for Multi-agent Reinforcement Learning
Ross E. Allen, Javona White Bear, Jayesh K. Gupta, Mykel Kochenderfer
Preprint