• Picture of Jerry Huang

    Research Areas: Deep Learning, Natural Language Processing
    (Fast Tracked from Masters)

Activity

  • PhD Student: Oct 2023 - now

Conference and Journal Papers

2025

  1. Mamba Modulation: On the Length Generalization of Mamba
    Peng Lu*, , Qiuhao Zeng, Xinyu Wang, Boxing Wang, Philippe Langlais, and Yufei Cui
    Conference on Neural Information Processing Systems (NeurIPS), 2025.
    #NLP, #RL
    [arXiv]

  2. Resona: Improving Context Copying in Linear Recurrence Models with Retrieval
    , Linrui Ma*, Xinyu Wang*, Peng Lu, Prasanna Parthasarathi, Xiao-Wen Chang, Boxing Chen, and Yufei Cui
    Conference on Language Modeling (COLM), 2025.
    #NLP, #DL
    [openreview], [arXiv]

  3. Manifold Metric: A Loss Landscape Approach for Predicting Model Performance
    , , Ariside Baratin, Quentin Fournier, and
    Conference on Lifelong Learning Agents (CoLLAs), 2025.
    #DL
    [arXiv]

  4. Do Robot Snakes Dream like Electric Sheep? Investigating the Effects of Architectural Inductive Biases on Hallucination
    , Prasanna Parthasarathi, Mehdi Rezagholizadeh, Boxing Chen, and
    Findings of the Association for Computational Linguistics (ACL), 2025.
    #NLP
    [acl], [arXiv]

  5. Calibrated Language Models and How to Find Them with Label Smoothing
    , Peng Lu*, and Qiuhao Zeng
    International Conference on Machine Learning, 2025.
    #NLP
    [openreview], [arXiv]

2024

  1. Context-Aware Assistant Selection for Improved Inference Acceleration with Large Language Models
    , Prasanna Parthasarathi, Mehdi Rezagholizadeh, and
    Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024.
    #NLP
    [acl], [arXiv]

  2. Do Large Language Models Know How Much They Know?
    , , Prasanna Parthasarathi, Shagun Sodhani, and
    Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024.
    #NLP
    [acl], [arXiv]

  3. Promoting Exploration in Memory-Augmented Adam using Critical Momenta
    , , Aristide Baratin, Reza Babanezhad Harikandeh, , Simon Lacoste-Julien, Razvan Pascanu, and
    Transactions on Machine Learning Research (TMLR), 2024.
    #DL
    [openreview], [arXiv]

2023

  1. EpiK-Eval: Evaluation for Language Models as Epistemic Models
    , , Prasanna Parthasarathi, Shagun Sodhani, and
    Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023.
    #NLP
    [acl], [openreview], [arXiv], [code]