Membre des anciens élèves

Activité

  • Étudiant au doctorat: août 2020 - nov. 2024

Thèse de doctorat

  1. New Faithfulness-Centric Interpretability Paradigms for Natural Language Processing
    par , avec Siva Reddy et Sarath Chandar comme superviseurs.
    Polytechnique Montreal ⸺ novembre 2024.
    [thesis]

Prépublications

Articles de conférence et de revue

2024

  1. Are self-explanations from Large Language Models faithful?
    , et Siva Reddy
    Findings of the Association for Computational Linguistics (ACL), 2024.
    #NLP
    [acl], [arXiv], [code], [YouTube]

  2. Faithfulness Measurable Masked Language Models
    , Siva Reddy et
    International Conference on Machine Learning (ICML), 2024. [Spotlight award - top 3.5%]
    #NLP
    [pmlr], [arXiv], [code], [YouTube], [blogpost]

2022

  1. Evaluating the Faithfulness of Importance Measures in NLP by Recursively Masking Allegedly Important Tokens and Retraining
    , Nicholas Meade, Vaibhav Adlakha et Siva Reddy
    Findings of the Association for Computational Linguistics (EMNLP), 2022.
    [BlackboxNLP Workshop, 2022]
    #NLP
    [acl], [arXiv], [code]

  2. Post-hoc Interpretability for Neural NLP: A Survey
    , Siva Reddy et
    ACM Computing Surveys, 2022.
    #NLP
    [acm], [arXiv]