Publications d'Hadi Nekoei

Hadi Nekoei

Domaines de recherche: Apprentissage par renforcement, Apprentissage continu et permanent

Activité

Mem-π: Adaptive Memory through Learning When and What to Generate
Xiaoqiang Wang, Chao Wang, Hadi Nekoei, Christopher Pal, Alexandre Lacoste, Spandana Gella, Bang Liu et Perouz Taslakian
In ArXiv, 2026.
#RL, #NLP
[arXiv]
Just-in-time Episodic Feedback Hinter: Leveraging Offline Knowledge to Improve LLM Agents Adaptation
Hadi Nekoei, Aman Jaiswal, Patrice Bechard, Oleh Shliazhko, Orlando Marquez Ayala, Mathieu Reymond, Massimo Caccia, Alexandre Drouin, Sarath Chandar et Alexandre Lacoste
In ArXiv, 2025.
#NLP, #RL
[arXiv]
Balancing Profit and Fairness in Risk-Based Pricing Markets
Jesse Thibodeau, Hadi Nekoei, Afaf Taïk, Janarthanan Rajendran et Golnoosh Farnadi
In ArXiv, 2025.
#RL
[arXiv]

How to Train Your LLM Web Agent: A Statistical Diagnosis
Dheeraj Vattikonda, Santhoshi Ravichandran, Emiliano Penaloza, Hadi Nekoei, Megh Thakkar, Thibault Le Sellier de Chezelles, Nicolas Gontier, Miguel Muñoz-Mármol, Sahar Omidi Shayegan, Stefania Raimondo, Xue Liu, Alexandre Drouin, Laurent Charlin, Alexandre Piché, Alexandre Lacoste et Massimo Caccia
Conference on Neural Information Processing Systems (NeurIPS), 2025.
#NLP, #RL
[openreview], [arXiv]
Shielded Controller Units for RL with Operational Constraints Applied to Remote Microgrids
Hadi Nekoei, Alexandre Blondin Massé, Rachid Hassani, Sarath Chandar et Vincent Mai
Reinforcement Learning Conference (RLC), 2025.
#RL
[arXiv], [code]
A Generalist Hanabi Agent
Arjun Vaithilingam Sudhakar*, Hadi Nekoei*, Mathieu Reymond, Miao Liu, Janarthanan Rajendran et Sarath Chandar
International Conference on Learning Representations (ICLR), 2025.
#RL
[website], [openreview], [arXiv], [code]

Multi-Agent Reinforcement Learning for Fast-Timescale Demand Response of Residential Loads
Vincent Mai, Philippe Maisonneuve, Tianyu Zhang, Hadi Nekoei, Liam Paull et Antoine Lesage-Landry
Machine Learning, 2023.
[International Conference on Autonomous Agents and Multiagent Systems (AAMAS) Extended Abstracts, 2023]
#RL
[springer], [acm], [arXiv]
Towards Few-shot Coordination: Revisiting Ad-hoc Teamplay Challenge In the Game of Hanabi
Hadi Nekoei, Xutong Zhao, Janarthanan Rajendran, Miao Liu et Sarath Chandar
Conference on Lifelong Learning Agents (CoLLAs), 2023.
#RL
[pmlr], [arXiv]
Dealing With Non-stationarity in Decentralized Cooperative Multi-Agent Deep Reinforcement Learning via Multi-Timescale Learning
Hadi Nekoei, Akilesh Badrinaaraayanan, Amit Sinha, Mohammad Amini, Janarthanan Rajendran, Aditya Mahajan et Sarath Chandar
Conference on Lifelong Learning Agents (CoLLAs), 2023.
#RL
[pmlr], [arXiv]
DEUP: Direct Epistemic Uncertainty Prediction
Moksh Jain, Salem Lahlou, Hadi Nekoei, Victor Butoi, Paul Bertin, Jarrid Rector-Brooks, Maksym Korablyov et Yoshua Bengio
Transactions on Machine Learning Research (TMLR), 2023.
#DL
[openreview], [arXiv], [code]

Continuous Coordination As a Realistic Scenario for Lifelong Learning
Hadi Nekoei, Akilesh Badrinaaraayanan, Aaron Courville et Sarath Chandar
International Conference on Machine Learning (ICML), 2021.
#RL
[pmlr], [arXiv], [code]

The LoCA Regret: A Consistent Metric to Evaluate Model-Based Behavior in Reinforcement Learning
Harm van Seijen, Hadi Nekoei, Evan Racah et Sarath Chandar
Conference on Neural Information Processing Systems (NeurIPS), 2020.
#RL
[neurips], [arXiv], [code]