Publications d'Andreas Madsen
Activité
- Étudiant au doctorat: août 2020 - nov. 2024
Thèse de doctorat
-
New Faithfulness-Centric Interpretability Paradigms for Natural Language Processing
par Andreas Madsen, avec Siva Reddy et Sarath Chandar comme superviseurs.
Polytechnique Montreal ⸺ novembre 2024.
[thesis]
Prépublications
-
Interpretability Needs a New Paradigm
Andreas Madsen, Himabindu Lakkaraju, Siva Reddy et Sarath Chandar
In ArXiv, 2024.
#NLP, #DL, #Other
[arXiv]
Articles de conférence et de revue
2024
-
Are self-explanations from Large Language Models faithful?
Andreas Madsen, Sarath Chandar et Siva Reddy
Findings of the Association for Computational Linguistics (ACL), 2024.
#NLP
[acl], [arXiv], [code], [YouTube] -
Faithfulness Measurable Masked Language Models
Andreas Madsen, Siva Reddy et Sarath Chandar
International Conference on Machine Learning (ICML), 2024. [Spotlight award - top 3.5%]
#NLP
[pmlr], [arXiv], [code], [YouTube], [blogpost]
2022
-
Evaluating the Faithfulness of Importance Measures in NLP by Recursively Masking Allegedly Important Tokens and Retraining
Andreas Madsen, Nicholas Meade, Vaibhav Adlakha et Siva Reddy
Findings of the Association for Computational Linguistics (EMNLP), 2022.
[BlackboxNLP Workshop, 2022]
#NLP
[acl], [arXiv], [code] -
Post-hoc Interpretability for Neural NLP: A Survey
Andreas Madsen, Siva Reddy et Sarath Chandar
ACM Computing Surveys, 2022.
#NLP
[acm], [arXiv]