Publications d'Sarath Chandar

Sarath Chandar

Activité

Chercheur principal: fév. 2020 - maintenant

Prépublications

Failed Reasoning Traces Tell You What Is Fixable (But Not by Reading Them)
Nizar Islah, Istabrak Abbes, Irina Rish, Sarath Chandar et Eilif B. Muller
In ArXiv, 2026.
#NLP
[arXiv]
Dialectics of Alignment: Harnessing Unsafe Knowledge for Dynamic Safety Routing
Maryam Hashemzadeh, Jerry Huang, Minseon Kim, Marc-Alexandre Côté et Sarath Chandar
In ArXiv, 2026.
#DL, #NLP
[arXiv]
Probabilistic Calibration Is a Trainable Capability in Language Models
Davide Baldelli, Sruthi Kuriakose, Maryam Hashemzadeh, Amal Zouaq et Sarath Chandar
In ArXiv, 2026.
#NLP
[arXiv]
Reconstruction or Semantics? What Makes a Latent Space Useful for Robotic World Models
Nilaksh*, Saurav Jha*, Artem Zholus* et Sarath Chandar
In ArXiv, 2026.
#DL, #RL
[arXiv]
CoPeP: Benchmarking Continual Pretraining for Protein Language Models
Darshan Patil, Pranshu Malviya, Mathieu Reymond, Quentin Fournier et Sarath Chandar
In ArXiv, 2026.
#NLP
[arXiv]
LLMs Can't Play Hangman: On the Necessity of a Private Working Memory for Language Agents
Davide Baldelli, Ali Parviz, Amal Zouaq et Sarath Chandar
In ArXiv, 2026.
#NLP
[arXiv]
Effect of Document Packing on the Latent Multi-Hop Reasoning Capabilities of Large Language Models
Gabriele Prato, Shagun Sodhani, Alessandro Sordoni et Sarath Chandar
In ArXiv, 2025.
#NLP
[arXiv]
Probing the effectiveness of World Models for Spatial Reasoning through Test-time Scaling
Saurav Jha, M. Jehanzeb Mirza, Wei Lin, Shiqi Yang et Sarath Chandar
In ArXiv, 2025.
#DL
[arXiv]
Neural Coherence: Find higher performance to out-of-distribution tasks from few samples
Simon Guiroy, Mats Richter, Sarath Chandar et Christopher Pal
In ArXiv, 2025.
#DL
[arXiv]
Just-in-time Episodic Feedback Hinter: Leveraging Offline Knowledge to Improve LLM Agents Adaptation
Hadi Nekoei, Aman Jaiswal, Patrice Bechard, Oleh Shliazhko, Orlando Marquez Ayala, Mathieu Reymond, Massimo Caccia, Alexandre Drouin, Sarath Chandar et Alexandre Lacoste
In ArXiv, 2025.
#NLP, #RL
[arXiv]
GRPO-λ: Credit Assignment improves LLM Reasoning
Prasanna Parthasarathi*, Mathieu Reymond*, Boxing Chen, Yufei Cui et Sarath Chandar
In ArXiv, 2025.
#NLP, #RL
[arXiv]
CrystalGym: A New Benchmark for Materials Discovery Using Reinforcement Learning
Prashant Govindarajan, Mathieu Reymond, Antoine Clavaud, Mariano Phielipp, Santiago Miret et Sarath Chandar
In ArXiv, 2025.
#RL, #Other
[arXiv], [code]
NovoMolGen: Rethinking Molecular Language Model Pretraining
Kamran Chitsaz*, Roshan Balaji*, Quentin Fournier, Nirav Pravinbhai Bhatt et Sarath Chandar
In ArXiv, 2025.
#NLP, #Other
[arXiv], [huggingface], [code]
Optimizers Qualitatively Alter Solutions And We Should Leverage This
Razvan Pascanu, Clare Lyle, Ionut-Vlad Modoranu, Naima Elosegui Borras, Dan Alistarh, Petar Velickovic, Sarath Chandar, Soham De et James Martens
In ArXiv, 2025.
#DL
[arXiv]
Did I Faithfully Say What I Thought? Bridging the Gap Between Neural Activity and Self-Explanations in Large Language Models
Milan Bhan, Jean-Noel Vittaut, Nicolas Chesneau, Sarath Chandar et Marie-Jeanne Lesot
In ArXiv, 2025.
#NLP
[arXiv]
V-JEPA 2: Self-Supervised Video Models Enable Understanding, Prediction and Planning
Mido Assran*, Adrien Bardes*, David Fan*, Quentin Garrido*, Russell Howes*, Mojtaba Komeili*, Matthew Muckley*, Ammar Rizvi*, Claire Roberts*, Koustuv Sinha*, Artem Zholus*, Sergio Arnaud*, Abha Gejji*, Ada Martin*, Francois Robert Hogan*, Daniel Dugas*, Piotr Bojanowski, Vasil Khalidov, Patrick Labatut, Francisco Massa, Marc Szafraniec, Kapil Krishnakumar, Yong Li, Xiaodong Ma, Sarath Chandar, Franziska Meier*, Yann LeCun*, Michael Rabbat* et Nicolas Ballas*
Technical Report, 2025.
#DL
[website], [arXiv], [code], [huggingface], [blogpost]
Torque-Aware Momentum
Pranshu Malviya, Gonçalo Mordido, Aristide Baratin, Reza Babanezhad Harikandeh, Gintare Karolina Dziugaite, Razvan Pascanu et Sarath Chandar
In ArXiv, 2024.
#DL
[arXiv]
Too Big to Fool: Resisting Deception in Language Models
Mohammad Reza Samsami, Mats Leon Richter, Juan Rodriguez, Megh Thakkar, Sarath Chandar et Maxime Gasse
In ArXiv, 2024.
#NLP
[arXiv]
Interpretability Needs a New Paradigm
Andreas Madsen, Himabindu Lakkaraju, Siva Reddy et Sarath Chandar
In ArXiv, 2024.
#NLP, #DL
[arXiv]
Protein Language Models: Is Scaling Necessary?
Quentin Fournier, Robert M. Vernon, Almer van der Sloot, Benjamin Schulz, Sarath Chandar et Christopher James Langmead
In bioRxiv, 2024.
#NLP, #Other
[bioRxiv], [code], [huggingface]
Segmentation of Multiple Sclerosis Lesions across Hospitals: Learn Continually or Train from Scratch?
Enamundram Naga Karthik, Anne Kerbrat, Pierre Labauge, Tobias Granberg, Jason Talbott, Daniel S. Reich, Massimo Filippi, Rohit Bakshi, Virginie Callot, Sarath Chandar et Julien Cohen-Adad
In ArXiv, 2022.
[Medical Imaging meets NeurIPS, 2022]
#DL, #Other
[arXiv], [code]
An Introduction to Lifelong Supervised Learning
Shagun Sodhani, Mojtaba Faramarzi, Sanket Vaibhav Mehta, Pranshu Malviya, Mohamed Abdelsalam, Janarthanan Rajendran et Sarath Chandar
In ArXiv, 2022.
#DL
[arXiv]
Maximum Reward Formulation In Reinforcement Learning
Sai Krishna Gottipati, Yashaswi Pathak, Rohan Nuttall, Raviteja Chunduru, Ahmed Touati, Sriram Ganapathi Subramanian, Matthew E Taylor et Sarath Chandar
In arXiv, 2020.
#RL
[arXiv]

Articles de conférence et de revue

2026

Squeezing More from the Stream: Learning Representation Online for Streaming Reinforcement Learning
Nilaksh*, Antoine Clavaud*, Mathieu Reymond, François Rivest et Sarath Chandar
International Conference on Machine Learning (ICML), 2026.
#RL, #DL
[openreview], [arXiv], [code]
TAPNext++: What's Next for Tracking Any Point (TAP)?
Sebastian Jung*, Artem Zholus*, Martin Sundermeyer, Carl Doersch, Ross Goroshin, David Joseph Tan, Sarath Chandar, Rudolph Triebel et Federico Tombari
Findings of the IEEE CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2026.
#DL
[arXiv], [website], [code]
The Expressive Limits of Diagonal SSMs for State-Tracking
Mehran Shakerinava, Behnoush Khavari, Siamak Ravanbakhsh et Sarath Chandar
International Conference on Learning Representations (ICLR), 2026.
#DL
[openreview], [arXiv]
The Markovian Thinker: Architecture-Agnostic Linear Scaling of Reasoning
Milad Aghajohari, Kamran Chitsaz, Amirhossein Kazemnejad, Sarath Chandar, Alessandro Sordoni, Aaron Courville et Siva Reddy
International Conference on Learning Representations (ICLR), 2026.
#RL, #NLP
[openreview], [arXiv], [code]
Investigating the Multilingual Calibration Effects of Language Model Instruction-Tuning
Jerry Huang, Peng Lu, Qiuhao Zeng, Yusuke Iwasawa, Yutaka Matsuo, Sarath Chandar, Edison Marrese-Taylor et Irene Li
Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2026.
#NLP, #DL
[acl], [arXiv], [code]
Monitoring morphometric drift in lifelong learning segmentation of the spinal cord
Enamundram Naga Karthik, Sandrine Bédard, Jan Valošek, Christoph S. Aigner, Elise Bannier, Josef Bednařík, Virginie Callot, Anna Combes, Armin Curt, Gergely David, Falk Eippert, Lynn Farner, Michael G Fehlings, Patrick Freund, Tobias Granberg, Cristina Granziera, RHSCIR Network Imaging Group, Ulrike Horn, Tomáš Horák, Suzanne Humphreys, Markus Hupp, Anne Kerbrat, Nawal Kinany, Shannon Kolind, Petr Kudlička, Anna Lebret, Lisa Eunyoung Lee, Caterina Mainero, Allan R. Martin, Megan McGrath, Govind Nair, Kristin P. O'Grady, Jiwon Oh, Russell Ouellette, Nikolai Pfender, Dario Pfyffer, Pierre-François Pradat, Alexandre Prat, Emanuele Pravatà, Daniel S. Reich, Ilaria Ricchi, Naama Rotem-Kohavi, Simon Schading-Sassenhausen, Maryam Seif, Andrew Smith, Seth A Smith, Grace Sweeney, Roger Tam, Anthony Traboulsee, Constantina Andrada Treaba, Charidimos Tsagkas, Zachary Vavasour, Dimitri Van De Ville, Kenneth Arnold Weber II, Sarath Chandar et Julien Cohen-Adad
Imaging Neuroscience, 2026.
#DL, #Other
[mit], [arXiv]
CADmium: Fine-Tuning Code Language Models for Text-Driven Sequential CAD Design
Prashant Govindarajan*, Davide Baldelli*, Jay Pathak, Quentin Fournier et Sarath Chandar
Transactions on Machine Learning Research, 2026.
#NLP
[openreview], [arXiv], [code], [huggingface]

2025

Shielded Controller Units for RL with Operational Constraints Applied to Remote Microgrids
Hadi Nekoei, Alexandre Blondin Massé, Rachid Hassani, Sarath Chandar et Vincent Mai
Reinforcement Learning Conference (RLC), 2025.
#RL
[arXiv], [code]
TAPNext: Tracking Any Point (TAP) as Next Token Prediction
Artem Zholus, Carl Doersch, Yi Yang, Skanda Koppula, Viorica Pătrăucean, Xu Owen He, Ignacio Rocco, Mehdi S. M. Sajjadi, Sarath Chandar et Ross Goroshin
International Conference on Computer Vision (ICCV), 2025.
#DL, #Other
[website], [arXiv], [code], [huggingface], [YouTube]
Steering Large Language Model Activations in Sparse Spaces
Reza Bayat*, Ali Rahimi-Kalahroudi*, Mohammad Pezeshki, Sarath Chandar et Pascal Vincent
Conference on Language Modeling (COLM), 2025.
#NLP, #DL
[openreview], [arXiv]
Boosting LLM Reasoning via Spontaneous Self-Correction
Xutong Zhao, Tengyu Xu, Xuewei Wang, Zhengxing Chen, Di Jin, Liang Tan, Yen-Ting, Zishun Yu, Zhuokai Zhao, Yun He, Sinong Wang, Han Fang, Sarath Chandar et Chen Zhu
Conference on Language Modeling (COLM), 2025.
#NLP, #RL
[openreview], [arXiv]
Manifold Metric: A Loss Landscape Approach for Predicting Model Performance
Pranshu Malviya, Jerry Huang, Ariside Baratin, Quentin Fournier et Sarath Chandar
Conference on Lifelong Learning Agents (CoLLAs), 2025.
#DL
[pmlr], [arXiv]
Revisiting Replay and Gradient Alignment for Continual Pre-Training of Large Language Models
Istabrak Abbes, Gopeshh Subbaraj, Matthew Riemer, Nizar Islah, Tsuguchika Tabaru, Hiroaki Kingetsu, Sarath Chandar et Irina Rish
Conference on Lifelong Learning Agents (CoLLAs), 2025.
#NLP, #DL
[pmlr], [arXiv], [code]
Combining Domain and Alignment Vectors Provides Better Knowledge-Safety Trade-offs in LLMs
Megh Thakkar, Quentin Fournier, Matthew Riemer, Pin-Yu Chen, Amal Zouaq, Payel Das et Sarath Chandar
Annual Meeting of the Association for Computational Linguistics (ACL), 2025.
#NLP
[acl], [arXiv]
Small Encoders Can Rival Large Decoders in Detecting Groundedness
Istabrak Abbes, Gabriele Prato, Quentin Fournier, Fernando Rodriguez, Alaa Boukhary, Adam Elwood et Sarath Chandar
Findings of the Association for Computational Linguistics (ACL), 2025.
#NLP
[acl], [arXiv]
Do Robot Snakes Dream like Electric Sheep? Investigating the Effects of Architectural Inductive Biases on Hallucination
Jerry Huang, Prasanna Parthasarathi, Mehdi Rezagholizadeh, Boxing Chen et Sarath Chandar
Findings of the Association for Computational Linguistics (ACL), 2025.
#NLP
[acl], [arXiv]
NeoBERT: A Next Generation BERT
Lola Le Breton, Quentin Fournier, Mariam El Mezouar et Sarath Chandar
Transactions on Machine Learning Research (TMLR), 2025.
[ICLR Journal-to-Conference Track, 2026]
#NLP
[openreview], [arXiv], [code], [huggingface]
A Generalist Hanabi Agent
Arjun Vaithilingam Sudhakar*, Hadi Nekoei*, Mathieu Reymond, Miao Liu, Janarthanan Rajendran et Sarath Chandar
International Conference on Learning Representations (ICLR), 2025.
#RL
[website], [openreview], [arXiv], [code]
BindGPT: A Scalable Framework for 3D Molecular Design via Language Modeling and Reinforcement Learning
Artem Zholus, Maksim Kuznetsov, Roman Schutski, Rim Shayakhmetov, Daniil Polykovskiy, Sarath Chandar et Alex Zhavoronkov
AAAI Conference on Artificial Intelligence (AAAI), 2025. [Best poster award]
#DL, #RL
[website], [aaai], [arXiv], [code], [YouTube]

2024

Balancing Context Length and Mixing Times for Reinforcement Learning at Scale
Matthew Riemer, Khimya Khetarpal, Janarthanan Rajendran et Sarath Chandar
Conference on Neural Information Processing Systems (NeurIPS), 2024.
#RL
[neurips], [openreview]
Exploring Quantization for Efficient Pre-Training of Transformer Language Models
Kamran Chitsaz, Quentin Fournier, Gonçalo Mordido et Sarath Chandar
Findings of the Association for Computational Linguistics (EMNLP), 2024.
#NLP, #DL
[acl], [arXiv]
Context-Aware Assistant Selection for Improved Inference Acceleration with Large Language Models
Jerry Huang, Prasanna Parthasarathi, Mehdi Rezagholizadeh et Sarath Chandar
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024.
#NLP
[acl], [arXiv]
Do Large Language Models Know How Much They Know?
Gabriele Prato, Jerry Huang, Prasanna Parthasarathi, Shagun Sodhani et Sarath Chandar
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024.
#NLP
[acl], [arXiv]
Sharpness-Aware Minimization Scaled by Outlier Normalization for Robust DNNs on In-Memory Computing Accelerators
Sébastien Henwood, Gonçalo Mordido, Yvon Savaria, Sarath Chandar et François Leduc-Primeau
Asilomar Conference on Signals, Systems, and Computers, 2024.
[Conference on Lifelong Learning Agents (CoLLAs) Workshop Track, 2022]
[Edge Intelligence Workshop (EIW), 2022]
#DL
[paper], [arXiv]
Toward Debugging Deep Reinforcement Learning Programs with RLExplorer
Rached Bouchoucha, Ahmed Haj Yahmed, Darshan Patil, Janarthanan Rajendran, Amin Nikanjam, Sarath Chandar et Foutse Khomh
International Conference on Software Maintenance and Evolution (ICSME), 2024.
#RL
[arXiv]
Should We Attend More or Less? Modulating Attention for Fairness
Abdelrahman Zayed, Gonçalo Mordido, Samira Shabanian et Sarath Chandar
Conference on Language Modeling (COLM), 2024.
#NLP
[openreview], [arXiv]
Are self-explanations from Large Language Models faithful?
Andreas Madsen, Sarath Chandar et Siva Reddy
Findings of the Association for Computational Linguistics (ACL), 2024.
#NLP
[acl], [arXiv], [code], [YouTube]
A deep-dive into the tradeoffs of preference alignment with PEFT
Megh Thakkar, Quentin Fournier, Matthew Riemer, Pin-Yu Chen, Amal Zouaq, Payel Das et Sarath Chandar
Annual Meeting of the Association for Computational Linguistics (ACL), 2024.
#NLP
[acl], [arXiv]
Why Don't Prompt-Based Fairness Metrics Correlate?
Abdelrahman Zayed, Gonçalo Mordido, Ioana Baldini et Sarath Chandar
Annual Meeting of the Association for Computational Linguistics (ACL), 2024.
#NLP
[acl], [arXiv], [code], [YouTube]
Sub-goal Distillation: A Method to Improve Small Language Agents
Maryam Hashemzadeh, Elias Stengel-Eskin, Sarath Chandar et Marc-Alexandre Cote
Conference on Lifelong Learning Agents (CoLLAs), 2024. [Oral presentation.]
#RL, #NLP
[arXiv], [code]
Lookbehind-SAM: k steps back, 1 step forward
Gonçalo Mordido, Pranshu Malviya, Aristide Baratin et Sarath Chandar
International Conference on Machine Learning (ICML), 2024.
#DL
[pmlr], [arXiv], [code], [YouTube]
Faithfulness Measurable Masked Language Models
Andreas Madsen, Siva Reddy et Sarath Chandar
International Conference on Machine Learning (ICML), 2024. [Spotlight award - top 3.5%]
#NLP
[pmlr], [arXiv], [code], [YouTube], [blogpost]
Contrast-agnostic Spinal Cord Segmentation: A Comparative Study of ConvNets and Vision Transformers
Enamundram Naga Karthik, Sandrine Bedard, Jan Valosek, Sarath Chandar et Julien Cohen-Adad
Medical Imaging with Deep Learning (MIDL), 2024.
#DL, #Other
[openreview]
Promoting Exploration in Memory-Augmented Adam using Critical Momenta
Pranshu Malviya, Gonçalo Mordido, Aristide Baratin, Reza Babanezhad Harikandeh, Jerry Huang, Simon Lacoste-Julien, Razvan Pascanu et Sarath Chandar
Transactions on Machine Learning Research (TMLR), 2024.
#DL
[openreview], [arXiv]
A Responsible Framework for Applying Artificial Intelligence on Medical Images and Signals at the Point-of-care: the PACS-AI Platform
Pascal Theriault-Lauzier, Denis Cobin, Olivier Tastet, Elodie Labrecque Langlais, Bahareh Taji, Guson Kang, Aun-Yeong Chong, Derek So, An Tang, Judy Wawira Gichoya, Sarath Chandar, Pierre-Luc Déziel, Julie G Hussin, Samuel Kadoury et Robert Avram
Canadian Journal of Cardiology, 2024.
#DL, #Other
[paper]
MVP: Minimal Viable Phrase for Long Text Understanding
Louis Clouâtre, Amal Zouaq et Sarath Chandar
Joint International Conference on Computational Linguistics, Language, Resources and Evaluation (LREC-COLING), 2024.
#NLP
[acl]
Mastering Memory Tasks with World Models
Mohammad Reza Samsami*, Artem Zholus*, Janarthanan Rajendran et Sarath Chandar
International Conference on Learning Representations (ICLR), 2024. [Oral presentation.]
#RL, #DL
[openreview], [arXiv], [code]
Intelligent Switching for Reset-Free RL
Darshan Patil, Janarthanan Rajendran, Glen Berseth et Sarath Chandar
International Conference on Learning Representations (ICLR), 2024.
#RL
[openreview], [arXiv], [code]
On the Costs and Benefits of Adopting Lifelong Learning for Software Analytics - Empirical Study on Brown Build and Risk Prediction
Doriane Olewicki, Sarra Habchi, Mathieu Nayrolles, Mojtaba Faramarzi, Sarath Chandar et Bram Adams
International Conference on Software Engineering (ICSE) - Software Engineering in Practice Track, 2024. [ICSE24 SEIP Distinguished Paper Award]
#DL
[arXiv]
Fairness-Aware Structured Pruning in Transformers
Abdelrahman Zayed, Gonçalo Mordido, Samira Shabanian, Ioana Baldini et Sarath Chandar
AAAI Conference on Artificial Intelligence (AAAI), 2024.
#NLP
[aaai], [arXiv], [code], [YouTube]
Learning Conditional Policies for Crystal Design Using Offline Reinforcement Learning
Prashant Govindarajan, Santiago Miret, Jarrid Rector-Brooks, Mariano Phielipp, Janarthanan Rajendran et Sarath Chandar
Digital Discovery Journal, 2024.
#RL
[paper]

2023

Self-Influence Guided Data Reweighting for Language Model Pre-training
Megh Thakkar, Tolga Bolukbasi, Sriram Ganapathy, Shikhar Vashishth, Sarath Chandar et Partha Talukdar
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023.
#NLP
[acl], [openreview], [arXiv]
EpiK-Eval: Evaluation for Language Models as Epistemic Models
Gabriele Prato, Jerry Huang, Prasanna Parthasarathi, Shagun Sodhani et Sarath Chandar
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023.
#NLP
[acl], [openreview], [arXiv], [code]
Measuring the Knowledge Acquisition-Utilization Gap in Pretrained Language Models
Amirhossein Kazemnejad, Mehdi Rezagholizadeh, Prasanna Parthasarathi et Sarath Chandar
Findings of the Association for Computational Linguistics (EMNLP), 2023.
#NLP
[acl], [openreview], [arXiv]
Replay Buffer with Local Forgetting for Adapting to Local Environment Changes in Deep Model-Based Reinforcement Learning
Ali Rahimi-Kalahroudi, Janarthanan Rajendran, Ida Momennejad, Harm van Seijen et Sarath Chandar
Conference on Lifelong Learning Agents (CoLLAs), 2023.
[Deep Reinforcement Learning Workshop @ NeurIPS, 2022]
#RL
[pmlr], [arXiv]
Towards Few-shot Coordination: Revisiting Ad-hoc Teamplay Challenge In the Game of Hanabi
Hadi Nekoei, Xutong Zhao, Janarthanan Rajendran, Miao Liu et Sarath Chandar
Conference on Lifelong Learning Agents (CoLLAs), 2023.
#RL
[pmlr], [arXiv]
Dealing With Non-stationarity in Decentralized Cooperative Multi-Agent Deep Reinforcement Learning via Multi-Timescale Learning
Hadi Nekoei, Akilesh Badrinaaraayanan, Amit Sinha, Mohammad Amini, Janarthanan Rajendran, Aditya Mahajan et Sarath Chandar
Conference on Lifelong Learning Agents (CoLLAs), 2023.
#RL
[pmlr], [arXiv]
Conditionally Optimistic Exploration for Cooperative Deep Multi-Agent Reinforcement Learning
Xutong Zhao, Yangchen Pan, Chenjun Xiao, Sarath Chandar et Janarthanan Rajendran
Conference on Uncertainty in Artificial Intelligence (UAI), 2023.
#RL
[pmlr], [arXiv]
An Empirical Investigation of the Role of Pre-training in Lifelong Learning
Sanket Vaibhav Mehta, Darshan Patil, Sarath Chandar et Emma Strubell
Journal of Machine Learning Research, 2023.
#DL
[jmlr], [arXiv]
Deep Learning on a Healthy Data Diet: Finding Important Examples for Fairness
Abdelrahman Zayed, Prasanna Parthasarathi, Gonçalo Mordido, Hamid Palangi, Samira Shabanian et Sarath Chandar
AAAI Conference on Artificial Intelligence (AAAI), 2023.
#NLP
[aaai], [arXiv], [YouTube]

2022

Detecting Languages Unintelligible to Multilingual Models through Local Structure Probes
Louis Clouâtre, Prasanna Parthasarathi, Amal Zouaq et Sarath Chandar
Findings of the Association for Computational Linguistics (EMNLP), 2022.
#NLP
[acl]
Local Structure Matters Most in Most Languages
Louis Clouâtre, Prasanna Parthasarathi, Amal Zouaq et Sarath Chandar
Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the International Joint Conference on Natural Language Processing (AACL-IJCNLP), 2022.
#NLP
[acl]
TAG: Task-based Accumulated Gradients for Lifelong Learning
Pranshu Malviya, Balaraman Ravindran et Sarath Chandar
Conference on Lifelong Learning Agents (CoLLAs), 2022.
[Theory and Foundation of Continual Learning @ ICML, 2021]
#DL
[pmlr], [arXiv], [code]
Improving Meta-Learning Generalization with Activation-Based Early-Stopping
Simon Guiroy, Christopher Pal, Gonçalo Mordido et Sarath Chandar
Conference on Lifelong Learning Agents (CoLLAs), 2022.
#DL
[pmlr], [arXiv], [code], [YouTube]
Combining Reinforcement Learning and Constraint Programming for Sequence-Generation Tasks with Hard Constraints
Daphné Lafleur, Sarath Chandar et Gilles Pesant
International Conference on Principles and Practice of Constraint Programming (CP), 2022.
#RL
[paper]
Towards Evaluating Adaptivity of Model-Based Reinforcement Learning Methods
Yi Wan*, Ali Rahimi-Kalahroudi*, Janarthanan Rajendran, Ida Momennejad, Sarath Chandar et Harm van Seijen
International Conference on Machine Learning (ICML), 2022.
#RL
[pmlr], [arXiv], [code]
Post-hoc Interpretability for Neural NLP: A Survey
Andreas Madsen, Siva Reddy et Sarath Chandar
ACM Computing Surveys, 2022.
#NLP
[acm], [arXiv]
Local Structure Matters Most: Perturbation Study in NLU
Louis Clouâtre, Prasanna Parthasarathi, Amal Zouaq et Sarath Chandar
Findings of the Association for Computational Linguistics (ACL), 2022.
#NLP
[acl], [arXiv]
Memory Augmented Optimizers for Deep Learning
Paul-Aymeric McRae, Prasanna Parthasarathi, Mido Assran et Sarath Chandar
International Conference on Learning Representations (ICLR), 2022.
#DL
[openreview], [arXiv], [code]
PatchUp: A Feature-Space Block-Level Regularization Technique for Convolutional Neural Networks
Mojtaba Faramarzi, Mohammad Amini, Akilesh Badrinaaraayanan, Vikas Verma et Sarath Chandar
AAAI Conference on Artificial Intelligence (AAAI), 2022.
#DL
[aaai], [arXiv], [code]

2021

A Survey of Data Augmentation Approaches for NLP
Steven Y. Feng, Varun Gangal, Jason Wei, Sarath Chandar, Soroush Vosoughi, Teruko Mitamura et Eduard Hovy
Findings of the Association for Computational Linguistics (ACL-IJCNLP), 2021.
#NLP
[acl], [arXiv]
MLMLM: Link Prediction with Mean Likelihood Masked Language Model
Louis Clouâtre, Philippe Trempe, Amal Zouaq et Sarath Chandar
Findings of the Association for Computational Linguistics (ACL-IJCNLP), 2021.
#NLP
[acl], [arXiv]
Continuous Coordination As a Realistic Scenario for Lifelong Learning
Hadi Nekoei, Akilesh Badrinaaraayanan, Aaron Courville et Sarath Chandar
International Conference on Machine Learning (ICML), 2021.
#RL
[pmlr], [arXiv], [code]
A Brief Study on the Effects of Training Generative Dialogue Models with a Semantic loss
Prasanna Parthasarathi, Mohamed Abdelsalam, Joelle Pineau et Sarath Chandar
Proceedings of the 22nd Annual SIGdial Meeting on Discourse and Dialogue, 2021.
#NLP
[acl]
Do Encoder Representations of Generative Dialogue Models Encode Sufficient Information about the Task ?
Prasanna Parthasarathi, Sarath Chandar et Joelle Pineau
Proceedings of the 22nd Annual SIGdial Meeting on Discourse and Dialogue, 2021.
#NLP
[acl]
IIRC: Incremental Implicitly-Refined Classification
Mohamed Abdelsalam, Mojtaba Faramarzi, Shagun Sodhani et Sarath Chandar
IEEE CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021.
#DL
[website], [paper], [arXiv], [code], [PyPI], [docs]
Towered Actor Critic for Handling Multiple Action Types in Reinforcement Learning For Drug Discovery
Sai Krishna Gottipati, Yashaswi Pathak, Boris Sattarov, Sahir, Rohan Nuttall, Mohammad Amini, Matthew E. Taylor et Sarath Chandar
AAAI Conference on Artificial Intelligence (AAAI), 2021.
#RL, #Other
[aaai]

2020

The LoCA Regret: A Consistent Metric to Evaluate Model-Based Behavior in Reinforcement Learning
Harm van Seijen, Hadi Nekoei, Evan Racah et Sarath Chandar
Conference on Neural Information Processing Systems (NeurIPS), 2020.
#RL
[neurips], [arXiv], [code]
Learning To Navigate The Synthetically Accessible Chemical Space Using Reinforcement Learning
Sai Krishna Gottipati*, Boris Sattarov*, Sufeng Niu, Yashaswi Pathak, Haoran Wei, Shengchao Liu, Karam MJ Thomas, Simon Blackburn, Connor W Coley, Jian Tang, Sarath Chandar et Yoshua Bengio
International Conference on Machine Learning (ICML), 2020.
#RL
[pmlr], [arXiv]