Publications | Traitement du langage naturel | Laboratoire de Recherche Chandar

Prépublications

The Markovian Thinker
Milad Aghajohari*, Kamran Chitsaz*, Amirhossein Kazemnejad*, Sarath Chandar, Alessandro Sordoni, Aaron Courville et Siva Reddy
In ArXiv, 2025.
#NLP, #RL
[arXiv]
Just-in-time Episodic Feedback Hinter: Leveraging Offline Knowledge to Improve LLM Agents Adaptation
Hadi Nekoei, Aman Jaiswal, Patrice Bechard, Oleh Shliazhko, Orlando Marquez Ayala, Mathieu Reymond, Massimo Caccia, Alexandre Drouin, Sarath Chandar et Alexandre Lacoste
In ArXiv, 2025.
#NLP, #RL
[arXiv]
GRPO-λ: Credit Assignment improves LLM Reasoning
Prasanna Parthasarathi*, Mathieu Reymond*, Boxing Chen, Yufei Cui et Sarath Chandar
In ArXiv, 2025.
#NLP, #RL
[arXiv]
NovoMolGen: Rethinking Molecular Language Model Pretraining
Kamran Chitsaz*, Roshan Balaji*, Quentin Fournier, Nirav Pravinbhai Bhatt et Sarath Chandar
In ArXiv, 2025.
#NLP, #Other
[arXiv], [huggingface], [code]
CADmium: Fine-Tuning Code Language Models for Text-Driven Sequential CAD Design
Prashant Govindarajan*, Davide Baldelli*, Jay Pathak, Quentin Fournier et Sarath Chandar
In ArXiv, 2025.
#NLP
[arXiv], [code], [huggingface]
Did I Faithfully Say What I Thought? Bridging the Gap Between Neural Activity and Self-Explanations in Large Language Models
Milan Bhan, Jean-Noel Vittaut, Nicolas Chesneau, Sarath Chandar et Marie-Jeanne Lesot
In ArXiv, 2025.
#NLP
[arXiv]
Structure-Aligned Protein Language Model
Can Chen, David Heurtel-Depeiges, Robert M. Vernon, Christopher James Langmead, Yoshua Bengio et Quentin Fournier
In ArXiv, 2025.
#NLP, #Other
[arXiv], [huggingface]
Monitoring morphometric drift in lifelong learning segmentation of the spinal cord
Enamundram Naga Karthik, Sandrine Bédard, Jan Valošek, Christoph S. Aigner, Elise Bannier, Josef Bednařík, Virginie Callot, Anna Combes, Armin Curt, Gergely David, Falk Eippert, Lynn Farner, Michael G Fehlings, Patrick Freund, Tobias Granberg, Cristina Granziera, RHSCIR Network Imaging Group, Ulrike Horn, Tomáš Horák, Suzanne Humphreys, Markus Hupp, Anne Kerbrat, Nawal Kinany, Shannon Kolind, Petr Kudlička, Anna Lebret, Lisa Eunyoung Lee, Caterina Mainero, Allan R. Martin, Megan McGrath, Govind Nair, Kristin P. O'Grady, Jiwon Oh, Russell Ouellette, Nikolai Pfender, Dario Pfyffer, Pierre-François Pradat, Alexandre Prat, Emanuele Pravatà, Daniel S. Reich, Ilaria Ricchi, Naama Rotem-Kohavi, Simon Schading-Sassenhausen, Maryam Seif, Andrew Smith, Seth A Smith, Grace Sweeney, Roger Tam, Anthony Traboulsee, Constantina Andrada Treaba, Charidimos Tsagkas, Zachary Vavasour, Dimitri Van De Ville, Kenneth Arnold Weber II, Sarath Chandar et Julien Cohen-Adad
In ArXiv, 2025.
#NLP
[arXiv]
Too Big to Fool: Resisting Deception in Language Models
Mohammad Reza Samsami, Mats Leon Richter, Juan Rodriguez, Megh Thakkar, Sarath Chandar et Maxime Gasse
In ArXiv, 2024.
#NLP
[arXiv]
Interpretability Needs a New Paradigm
Andreas Madsen, Himabindu Lakkaraju, Siva Reddy et Sarath Chandar
In ArXiv, 2024.
#NLP, #DL
[arXiv]
Protein Language Models: Is Scaling Necessary?
Quentin Fournier, Robert M. Vernon, Almer van der Sloot, Benjamin Schulz, Sarath Chandar et Christopher James Langmead
In bioRxiv, 2024.
#NLP, #Other
[bioRxiv], [code], [huggingface]

Articles de conférence et de revue

2025

How to Train Your LLM Web Agent: A Statistical Diagnosis
Dheeraj Vattikonda, Santhoshi Ravichandran, Emiliano Penaloza, Hadi Nekoei, Megh Thakkar, Thibault Le Sellier de Chezelles, Nicolas Gontier, Miguel Muñoz-Mármol, Sahar Omidi Shayegan, Stefania Raimondo, Xue Liu, Alexandre Drouin, Laurent Charlin, Alexandre Piché, Alexandre Lacoste et Massimo Caccia
Conference on Neural Information Processing Systems (NeurIPS), 2025.
#NLP, #RL
[openreview], [arXiv]
Rendering-Aware Reinforcement Learning for Vector Graphics Generation
Juan A. Rodriguez, Haotian Zhang, Abhay Puri, Aarash Feizi, Rishav Pramanik, Pascal Wichmann, Arnab Mondal, Mohammad Reza Samsami, Rabiul Awal, Perouz Taslakian, Spandana Gella, Sai Rajeswar, David Vazquez, Christopher Pal et Marco Pedersoli
Conference on Neural Information Processing Systems (NeurIPS), 2025.
#NLP, #RL
[openreview], [arXiv]
Steering Large Language Model Activations in Sparse Spaces
Reza Bayat*, Ali Rahimi-Kalahroudi*, Mohammad Pezeshki, Sarath Chandar et Pascal Vincent
Conference on Language Modeling (COLM), 2025.
#NLP, #DL
[arXiv]
Boosting LLM Reasoning via Spontaneous Self-Correction
Xutong Zhao, Tengyu Xu, Xuewei Wang, Zhengxing Chen, Di Jin, Liang Tan, Yen-Ting, Zishun Yu, Zhuokai Zhao, Yun He, Sinong Wang, Han Fang, Sarath Chandar et Chen Zhu
Conference on Language Modeling (COLM), 2025.
#NLP, #RL
[openreview], [arXiv]
Do Biased Models Have Biased Thoughts?
Swati Rajwal, Shivank Garg, Reem Abdel-Salam et Abdelrahman Zayed
Conference on Language Modeling (COLM), 2025.
#NLP
[openreview], [arXiv]
Revisiting Replay and Gradient Alignment for Continual Pre-Training of Large Language Models
Istabrak Abbes, Gopeshh Subbaraj, Matthew Riemer, Nizar Islah, Tsuguchika Tabaru, Hiroaki Kingetsu, Sarath Chandar et Irina Rish
Conference on Lifelong Learning Agents (CoLLAs), 2025.
#NLP, #DL
[arXiv], [code]
Combining Domain and Alignment Vectors Provides Better Knowledge-Safety Trade-offs in LLMs
Megh Thakkar, Quentin Fournier, Matthew Riemer, Pin-Yu Chen, Amal Zouaq, Payel Das et Sarath Chandar
Annual Meeting of the Association for Computational Linguistics (ACL), 2025.
#NLP
[acl], [arXiv]
Small Encoders Can Rival Large Decoders in Detecting Groundedness
Istabrak Abbes, Gabriele Prato, Quentin Fournier, Fernando Rodriguez, Alaa Boukhary, Adam Elwood et Sarath Chandar
Findings of the Association for Computational Linguistics (ACL), 2025.
#NLP
[acl], [arXiv]
Do Robot Snakes Dream like Electric Sheep? Investigating the Effects of Architectural Inductive Biases on Hallucination
Jerry Huang, Prasanna Parthasarathi, Mehdi Rezagholizadeh, Boxing Chen et Sarath Chandar
Findings of the Association for Computational Linguistics (ACL), 2025.
#NLP
[acl], [arXiv]
IDAT: A Multi-Modal Dataset and Toolkit for Building and Evaluating Interactive Task-Solving Agents
Shrestha Mohanty, Negar Arabzadeh, Andrea Tupini, Yuxuan Sun, Alexey Skrynnik, Artem Zholus, Marc-Alexandre Côté et Julia Kiseleva
ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2025.
#NLP
[arXiv]
NeoBERT: A Next Generation BERT
Lola Le Breton, Quentin Fournier, Mariam El Mezouar et Sarath Chandar
Transactions on Machine Learning Research (TMLR), 2025.
#NLP
[openreview], [arXiv], [code], [huggingface]
ChartGemma: Visual Instruction-tuning for Chart Reasoning in the Wild
Ahmed Masry*, Megh Thakkar*, Aayush Bajaj, Aaryaman Kartha, Enamul Hoque et Shafiq Joty
International Conference on Computational Linguistics (COLING) Industry Track, 2025.
#NLP
[acl], [arXiv], [huggingface]

2024

WorkArena++: Towards Compositional Planning and Reasoning-based Common Knowledge Work Tasks
Leo Boisvert*, Megh Thakkar*, Maxime Gasse, Massimo Caccia, Thibault Le Sellier De Chezelles, Quentin Cappart, Nicolas Chapados, Alexandre Lacoste et Alexandre Drouin
Conference on Neural Information Processing Systems (NeurIPS) Datasets and Benchmarks Track, 2024.
#NLP
[neurips], [openreview], [arXiv], [code]
Exploring Quantization for Efficient Pre-Training of Transformer Language Models
Kamran Chitsaz, Quentin Fournier, Gonçalo Mordido et Sarath Chandar
Findings of the Association for Computational Linguistics (EMNLP), 2024.
#NLP, #DL
[acl], [arXiv]
Context-Aware Assistant Selection for Improved Inference Acceleration with Large Language Models
Jerry Huang, Prasanna Parthasarathi, Mehdi Rezagholizadeh et Sarath Chandar
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024.
#NLP
[acl], [arXiv]
Do Large Language Models Know How Much They Know?
Gabriele Prato, Jerry Huang, Prasanna Parthasarathi, Shagun Sodhani et Sarath Chandar
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024.
#NLP
[acl], [arXiv]
Should We Attend More or Less? Modulating Attention for Fairness
Abdelrahman Zayed, Gonçalo Mordido, Samira Shabanian et Sarath Chandar
Conference on Language Modeling (COLM), 2024.
#NLP
[openreview], [arXiv]
Are self-explanations from Large Language Models faithful?
Andreas Madsen, Sarath Chandar et Siva Reddy
Findings of the Association for Computational Linguistics (ACL), 2024.
#NLP
[acl], [arXiv], [code], [YouTube]
A deep-dive into the tradeoffs of preference alignment with PEFT
Megh Thakkar, Quentin Fournier, Matthew Riemer, Pin-Yu Chen, Amal Zouaq, Payel Das et Sarath Chandar
Annual Meeting of the Association for Computational Linguistics (ACL), 2024.
#NLP
[acl], [arXiv]
Why Don't Prompt-Based Fairness Metrics Correlate?
Abdelrahman Zayed, Gonçalo Mordido, Ioana Baldini et Sarath Chandar
Annual Meeting of the Association for Computational Linguistics (ACL), 2024.
#NLP
[acl], [arXiv], [code], [YouTube]
Sub-goal Distillation: A Method to Improve Small Language Agents
Maryam Hashemzadeh, Elias Stengel-Eskin, Sarath Chandar et Marc-Alexandre Cote
Conference on Lifelong Learning Agents (CoLLAs), 2024. [Oral presentation.]
#RL, #NLP
[arXiv], [code]
Faithfulness Measurable Masked Language Models
Andreas Madsen, Siva Reddy et Sarath Chandar
International Conference on Machine Learning (ICML), 2024. [Spotlight award - top 3.5%]
#NLP
[pmlr], [arXiv], [code], [YouTube], [blogpost]
MVP: Minimal Viable Phrase for Long Text Understanding
Louis Clouâtre, Amal Zouaq et Sarath Chandar
Joint International Conference on Computational Linguistics, Language, Resources and Evaluation (LREC-COLING), 2024.
#NLP
[acl]
Fairness-Aware Structured Pruning in Transformers
Abdelrahman Zayed, Gonçalo Mordido, Samira Shabanian, Ioana Baldini et Sarath Chandar
AAAI Conference on Artificial Intelligence (AAAI), 2024.
#NLP
[aaai], [arXiv], [code], [YouTube]

2023

Self-Influence Guided Data Reweighting for Language Model Pre-training
Megh Thakkar, Tolga Bolukbasi, Sriram Ganapathy, Shikhar Vashishth, Sarath Chandar et Partha Talukdar
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023.
#NLP
[acl], [openreview], [arXiv]
EpiK-Eval: Evaluation for Language Models as Epistemic Models
Gabriele Prato, Jerry Huang, Prasanna Parthasarathi, Shagun Sodhani et Sarath Chandar
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023.
#NLP
[acl], [openreview], [arXiv], [code]
Measuring the Knowledge Acquisition-Utilization Gap in Pretrained Language Models
Amirhossein Kazemnejad, Mehdi Rezagholizadeh, Prasanna Parthasarathi et Sarath Chandar
Findings of the Association for Computational Linguistics (EMNLP), 2023.
#NLP
[acl], [openreview], [arXiv]
Deep Learning on a Healthy Data Diet: Finding Important Examples for Fairness
Abdelrahman Zayed, Prasanna Parthasarathi, Gonçalo Mordido, Hamid Palangi, Samira Shabanian et Sarath Chandar
AAAI Conference on Artificial Intelligence (AAAI), 2023.
#NLP
[aaai], [arXiv], [YouTube]

2022

Evaluating the Faithfulness of Importance Measures in NLP by Recursively Masking Allegedly Important Tokens and Retraining
Andreas Madsen, Nicholas Meade, Vaibhav Adlakha et Siva Reddy
Findings of the Association for Computational Linguistics (EMNLP), 2022.
[BlackboxNLP, 2022]
#NLP
[acl], [arXiv], [code]
Detecting Languages Unintelligible to Multilingual Models through Local Structure Probes
Louis Clouâtre, Prasanna Parthasarathi, Amal Zouaq et Sarath Chandar
Findings of the Association for Computational Linguistics (EMNLP), 2022.
#NLP
[acl]
Local Structure Matters Most in Most Languages
Louis Clouâtre, Prasanna Parthasarathi, Amal Zouaq et Sarath Chandar
Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the International Joint Conference on Natural Language Processing (AACL-IJCNLP), 2022.
#NLP
[acl]
Post-hoc Interpretability for Neural NLP: A Survey
Andreas Madsen, Siva Reddy et Sarath Chandar
ACM Computing Surveys, 2022.
#NLP
[acm], [arXiv]
Local Structure Matters Most: Perturbation Study in NLU
Louis Clouâtre, Prasanna Parthasarathi, Amal Zouaq et Sarath Chandar
Findings of the Association for Computational Linguistics (ACL), 2022.
#NLP
[acl], [arXiv]

2021

Benchmarking Bias Mitigation Algorithms in Representation Learning through Fairness Metrics
Charan Reddy, Deepak Sharma, Soroush Mehri, Adriana Romero, Samira Shabanian et Sina Honari
Conference on Neural Information Processing Systems (NeurIPS) Datasets and Benchmarks Track, 2021.
#NLP
[neurips], [openreview], [code]
A Survey of Data Augmentation Approaches for NLP
Steven Y. Feng, Varun Gangal, Jason Wei, Sarath Chandar, Soroush Vosoughi, Teruko Mitamura et Eduard Hovy
Findings of the Association for Computational Linguistics (ACL-IJCNLP), 2021.
#NLP
[acl], [arXiv]
MLMLM: Link Prediction with Mean Likelihood Masked Language Model
Louis Clouâtre, Philippe Trempe, Amal Zouaq et Sarath Chandar
Findings of the Association for Computational Linguistics (ACL-IJCNLP), 2021.
#NLP
[acl], [arXiv]
A Brief Study on the Effects of Training Generative Dialogue Models with a Semantic loss
Prasanna Parthasarathi, Mohamed Abdelsalam, Joelle Pineau et Sarath Chandar
Proceedings of the 22nd Annual SIGdial Meeting on Discourse and Dialogue, 2021.
#NLP
[acl]
Do Encoder Representations of Generative Dialogue Models Encode Sufficient Information about the Task ?
Prasanna Parthasarathi, Sarath Chandar et Joelle Pineau
Proceedings of the 22nd Annual SIGdial Meeting on Discourse and Dialogue, 2021.
#NLP
[acl]

2020

Fully Quantized Transformer for Machine Translation
Gabriele Prato, Ella Charlaix et Mehdi Rezagholizadeh
Findings of the Association for Computational Linguistics (EMNLP), 2020.
#NLP, #DL
[acl], [arXiv]