Preprints

  • BindGPT: A Scalable Framework for 3D Molecular Design via Language Modeling and Reinforcement Learning
    , Maksim Kuznetsov, Roman Schutski, Rim Shayakhmetov, Daniil Polykovskiy, , and Alex Zhavoronkov
    In arXiv, 2024.
    #DL, #RL
    [arXiv], [website]

  • Maximum Reward Formulation In Reinforcement Learning
    Sai Krishna Gottipati, Yashaswi Pathak, Rohan Nuttall, Raviteja Chunduru, Ahmed Touati, Sriram Ganapathi Subramanian, Matthew E Taylor, and
    In arXiv, 2020.
    #RL
    [arXiv]

Conference and Journal Papers

2024

  1. Balancing Context Length and Mixing Times for Reinforcement Learning at Scale
    Matthew Riemer, Khimya Khetarpal, , and
    Neural Information Processing Systems (NeurIPS), 2024.
    #RL

  2. Toward Debugging Deep Reinforcement Learning Programs with RLExplorer
    Rached Bouchoucha, Ahmed Haj Yahmed, , , Amin Nikanjam, , and Foutse Khomh
    International Conference on Software Maintenance and Evolution (ICSME), 2024.
    #RL

  3. Sub-goal Distillation: A Method to Improve Small Language Agents
    , Elias Stengel-Eskin, , and Marc-Alexandre Cote
    Conference on Lifelong Learning Agents (CoLLAs), 2024. [Oral presentation.]
    #RL, #NLP
    [arXiv]

  4. Partial Models for Building Adaptive Model-Based Reinforcement Learning Agents
    Safa Alver, , and Doina Precup
    Conference on Lifelong Learning Agents (CoLLAs), 2024.
    #RL
    [arXiv]

  5. Mastering Memory Tasks with World Models
    , , , and
    International Conference on Learning Representations (ICLR), 2024. [Oral presentation.]
    #RL, #DL
    [openreview]

  6. Intelligent Switching for Reset-Free RL
    , , Glen Berseth, and
    International Conference on Learning Representations (ICLR), 2024.
    #RL
    [openreview]

  7. Learning Conditional Policies for Crystal Design Using Offline Reinforcement Learning
    , Santiago Miret, , Mariano Phielipp, , and
    Digital Discovery Journal, 2024.
    #RL
    [openreview]

2023

  1. Replay Buffer with Local Forgetting for Adapting to Local Environment Changes in Deep Model-Based Reinforcement Learning
    , , Ida Momennejad, Harm van Seijen, and
    Conference on Lifelong Learning Agents (CoLLAs), 2023.
    [Deep Reinforcement Learning Workshop, NeurIPS, 2022]
    #RL
    [arXiv]

  2. Towards Few-shot Coordination: Revisiting Ad-hoc Teamplay Challenge In the Game of Hanabi
    , , , Miao Liu, and
    Conference on Lifelong Learning Agents (CoLLAs), 2023.
    #RL
    [paper]

  3. Dealing With Non-stationarity in Decentralized Cooperative Multi-Agent Deep Reinforcement Learning via Multi-Timescale Learning
    , , Amit Sinha, Mohammad Amini, , Aditya Mahajan, and
    Conference on Lifelong Learning Agents (CoLLAs), 2023.
    #RL
    [arXiv]

  4. Conditionally Optimistic Exploration for Cooperative Deep Multi-Agent Reinforcement Learning
    , Yangchen Pan, Chenjun Xiao, , and
    Conference on Uncertainty in Artificial Intelligence (UAI), 2023.
    #RL
    [arXiv]

  5. Multi-Agent Reinforcement Learning for Fast-Timescale Demand Response of Residential Loads
    Vincent Mai, Philippe Maisonneuve, Tianyu Zhang, , Liam Paull, and Antoine Lesage-Landry
    International Conference on Autonomous Agents and Multiagent Systems (AAMAS), 2023.
    #RL
    [arXiv]

2022

  1. Combining Reinforcement Learning and Constraint Programming for Sequence-Generation Tasks with Hard Constraints
    , , and Gilles Pesant
    Principles and Practice of Constraint Programming (CP), 2022.
    #RL

  2. Towards Evaluating Adaptivity of Model-Based Reinforcement Learning Methods
    , , , Ida Momennejad, , and Harm van Seijen
    International Conference on Machine Learning (ICML), 2022.
    #RL
    [arXiv], [code]

2021

  1. Continuous Coordination As a Realistic Scenario for Lifelong Learning
    , , Aaron Courville, and
    International Conference on Machine Learning (ICML), 2021.
    #RL
    [arXiv], [code]

  2. Towered Actor Critic for Handling Multiple Action Types in Reinforcement Learning For Drug Discovery
    Sai Krishna Gottipati, Yashaswi Pathak, Boris Sattarov, Sahir, Rohan Nuttall, Mohammad Amini, Matthew E. Taylor, and
    AAAI Conference on Artificial Intelligence (AAAI), 2021.
    #RL

2020

  1. The LoCA Regret: A Consistent Metric to Evaluate Model-Based Behavior in Reinforcement Learning
    Harm van Seijen, , , and
    Neural Information Processing Systems (NeurIPS), 2020.
    #RL
    [arXiv], [code]

  2. Learning To Navigate The Synthetically Accessible Chemical Space Using Reinforcement Learning
    Sai Krishna Gottipati*, Boris Sattarov*, Sufeng Niu, Yashaswi Pathak, Haoran Wei, Shengchao Liu, Karam MJ Thomas, Simon Blackburn, Connor W Coley, Jian Tang, , and Yoshua Bengio
    International Conference on Machine Learning (ICML), 2020.
    #RL
    [arXiv]