Publications by Milan Bhan
-
Milan Bhan
Research Areas: Interpretability, Natural Language Processing, Large Language Models
Activity
- Intern: Jun 2025 - now
Preprints
-
Did I Faithfully Say What I Thought? Bridging the Gap Between Neural Activity and Self-Explanations in Large Language Models
Milan Bhan, Jean-Noel Vittaut, Nicolas Chesneau, Sarath Chandar, and Marie-Jeanne Lesot
In ArXiv, 2025.
#NLP
[arXiv]