Activity

  • Intern: Apr 2023 - now

Preprints

Conference and Journal Papers

2026

  1. The Markovian Thinker: Architecture-Agnostic Linear Scaling of Reasoning
    Milad Aghajohari, , Amirhossein Kazemnejad, , Alessandro Sordoni, Aaron Courville, and Siva Reddy
    International Conference on Learning Representations (ICLR), 2026.
    #RL, #NLP
    [openreview], [arXiv], [code]

2024

  1. Exploring Quantization for Efficient Pre-Training of Transformer Language Models
    , , , and
    Findings of the Association for Computational Linguistics (EMNLP), 2024.
    #NLP, #DL
    [acl], [arXiv]

2023

  1. Training DNNs Resilient to Adversarial and Random Bit-Flips by Learning Quantization Ranges
    , , Jean Pierre David, and François Leduc-Primeau
    Transactions on Machine Learning Research (TMLR), 2023.
    #DL
    [openreview], [code]