Membre des anciens élèves

Activité

  • Étudiant à la maitrise: oct. 2023 - mar. 2025

Thèses de maitrise

  1. Towards Efficient and Effective Preference Alignment for Large Language Models
    par , avec Sarath Chandar comme superviseur.
    Université de Montréal ⸺ 2024.
    [thesis]

Prépublications

Articles de conférence et de revue

2025

  1. How to Train Your LLM Web Agent: A Statistical Diagnosis
    Dheeraj Vattikonda, Santhoshi Ravichandran, Emiliano Penaloza, , , Thibault Le Sellier de Chezelles, Nicolas Gontier, Miguel Muñoz-Mármol, Sahar Omidi Shayegan, Stefania Raimondo, Xue Liu, Alexandre Drouin, Laurent Charlin, Alexandre Piché, Alexandre Lacoste et Massimo Caccia
    Conference on Neural Information Processing Systems (NeurIPS), 2025.
    #NLP, #RL
    [arXiv]

  2. Combining Domain and Alignment Vectors Provides Better Knowledge-Safety Trade-offs in LLMs
    , Quentin Fournier, Matthew Riemer, Pin-Yu Chen, Amal Zouaq, Payel Das et
    Annual Meeting of the Association for Computational Linguistics (ACL), 2025.
    #NLP
    [acl], [arXiv]

  3. ChartGemma: Visual Instruction-tuning for Chart Reasoning in the Wild
    Ahmed Masry*, , Aayush Bajaj, Aaryaman Kartha, Enamul Hoque et Shafiq Joty
    International Conference on Computational Linguistics (COLING) Industry Track, 2025.
    #NLP
    [acl], [arXiv], [code]

2024

  1. WorkArena++: Towards Compositional Planning and Reasoning-based Common Knowledge Work Tasks
    Leo Boisvert*, , Maxime Gasse, Massimo Caccia, Thibault Le Sellier De Chezelles, Quentin Cappart, Nicolas Chapados, Alexandre Lacoste et Alexandre Drouin
    Conference on Neural Information Processing Systems (NeurIPS) Datasets and Benchmarks Track, 2024.
    #NLP
    [neurips], [openreview], [arXiv], [code]

  2. A deep-dive into the tradeoffs of preference alignment with PEFT
    , Quentin Fournier, Matthew Riemer, Pin-Yu Chen, Amal Zouaq, Payel Das et
    Annual Meeting of the Association for Computational Linguistics (ACL), 2024.
    #NLP
    [acl], [arXiv]

2023

  1. Self-Influence Guided Data Reweighting for Language Model Pre-training
    , Tolga Bolukbasi, Sriram Ganapathy, Shikhar Vashishth, et Partha Talukdar
    Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023.
    #NLP
    [acl], [openreview], [arXiv]