Publications d'Mathieu Reymond

Membre des anciens élèves

Activité

CoPeP: Benchmarking Continual Pretraining for Protein Language Models
Darshan Patil, Pranshu Malviya, Mathieu Reymond, Quentin Fournier et Sarath Chandar
In ArXiv, 2026.
#NLP
[arXiv]
Just-in-time Episodic Feedback Hinter: Leveraging Offline Knowledge to Improve LLM Agents Adaptation
Hadi Nekoei, Aman Jaiswal, Patrice Bechard, Oleh Shliazhko, Orlando Marquez Ayala, Mathieu Reymond, Massimo Caccia, Alexandre Drouin, Sarath Chandar et Alexandre Lacoste
In ArXiv, 2025.
#NLP, #RL
[arXiv]
GRPO-λ: Credit Assignment improves LLM Reasoning
Prasanna Parthasarathi*, Mathieu Reymond*, Boxing Chen, Yufei Cui et Sarath Chandar
In ArXiv, 2025.
#NLP, #RL
[arXiv]
CrystalGym: A New Benchmark for Materials Discovery Using Reinforcement Learning
Prashant Govindarajan, Mathieu Reymond, Antoine Clavaud, Mariano Phielipp, Santiago Miret et Sarath Chandar
In ArXiv, 2025.
#RL, #Other
[arXiv], [code]

Squeezing More from the Stream: Learning Representation Online for Streaming Reinforcement Learning
Nilaksh*, Antoine Clavaud*, Mathieu Reymond, François Rivest et Sarath Chandar
International Conference on Machine Learning (ICML), 2026.
#RL, #DL
[openreview], [arXiv], [code]

A Generalist Hanabi Agent
Arjun Vaithilingam Sudhakar*, Hadi Nekoei*, Mathieu Reymond, Miao Liu, Janarthanan Rajendran et Sarath Chandar
International Conference on Learning Representations (ICLR), 2025.
#RL
[website], [openreview], [arXiv], [code]