Publications d'Artem Zholus
-
Artem Zholus
Domaines de recherche: Traitement du langage naturel, Apprentissage par renforcement
Activité
- Étudiant au doctorat: fév. 2023 - maintenant
Prépublications
-
Hierarchical Planning with Latent World Models
Wancong Zhang, Basile Terver, Artem Zholus, Soham Chitnis, Harsh Sutaria, Mido Assran, Randall Balestriero, Amir Bar, Adrien Bardes, Yann LeCun et Nicolas Ballas
In ArXiv, 2026.
#DL, #RL
[arXiv], [website], [code] -
V-JEPA 2: Self-Supervised Video Models Enable Understanding, Prediction and Planning
Mido Assran*, Adrien Bardes*, David Fan*, Quentin Garrido*, Russell Howes*, Mojtaba Komeili*, Matthew Muckley*, Ammar Rizvi*, Claire Roberts*, Koustuv Sinha*, Artem Zholus*, Sergio Arnaud*, Abha Gejji*, Ada Martin*, Francois Robert Hogan*, Daniel Dugas*, Piotr Bojanowski, Vasil Khalidov, Patrick Labatut, Francisco Massa, Marc Szafraniec, Kapil Krishnakumar, Yong Li, Xiaodong Ma, Sarath Chandar, Franziska Meier*, Yann LeCun*, Michael Rabbat* et Nicolas Ballas*
Technical Report, 2025.
#DL
[website], [arXiv], [code], [huggingface], [blogpost] -
Interpretability in Action: Exploratory Analysis of VPT, a Minecraft Agent
Karolis Jucys, George Adamopoulos, Mehrab Hamidi, Stephanie Milani, Mohammad Reza Samsami, Artem Zholus, Sonia Joseph, Blake Richards, Irina Rish et Özgür Şimşek
Workshop on Mechanistic Interpretability @ ICML, 2024.
#DL
[arXiv]
Articles de conférence et de revue
2026
-
TAPNext++: What's Next for Tracking Any Point (TAP)?
Sebastian Jung*, Artem Zholus*, Martin Sundermeyer, Carl Doersch, Ross Goroshin, David Joseph Tan, Sarath Chandar, Rudolph Triebel et Federico Tombari
Findings of the IEEE CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2026.
#DL
[arXiv], [website], [code] -
Unraveling the Complexity of Memory in RL Agents: An Approach for Classification and Evaluation
Egor Cherepanov, Nikita Kachaev, Artem Zholus, Alexey K. Kovalev et Aleksandr I. Panov
International Conference on Learning Representations (ICLR), 2026.
#RL
[openreview], [arXiv]
2025
-
TRecViT: A Recurrent Video Transformer
Viorica Pătrăucean, Xu Owen He, Joseph Heyward, Chuhan Zhang, Mehdi S. M. Sajjadi, George-Cristian Muraru, Artem Zholus, Mahdi Karami, Ross Goroshin, Yutian Chen, Simon Osindero, João Carreira et Razvan Pascanu
Transactions on Machine Learning Research (TMLR), 2025.
#DL
[openreview], [arXiv], [code] -
TAPNext: Tracking Any Point (TAP) as Next Token Prediction
Artem Zholus, Carl Doersch, Yi Yang, Skanda Koppula, Viorica Pătrăucean, Xu Owen He, Ignacio Rocco, Mehdi S. M. Sajjadi, Sarath Chandar et Ross Goroshin
International Conference on Computer Vision (ICCV), 2025.
#DL, #Other
[website], [arXiv], [code], [huggingface], [YouTube] -
IDAT: A Multi-Modal Dataset and Toolkit for Building and Evaluating Interactive Task-Solving Agents
Shrestha Mohanty, Negar Arabzadeh, Andrea Tupini, Yuxuan Sun, Alexey Skrynnik, Artem Zholus, Marc-Alexandre Côté et Julia Kiseleva
ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2025.
#NLP
[arXiv] -
BindGPT: A Scalable Framework for 3D Molecular Design via Language Modeling and Reinforcement Learning
Artem Zholus, Maksim Kuznetsov, Roman Schutski, Rim Shayakhmetov, Daniil Polykovskiy, Sarath Chandar et Alex Zhavoronkov
AAAI Conference on Artificial Intelligence (AAAI), 2025. [Best poster award]
#DL, #RL
[website], [arXiv], [code], [YouTube]
2024
-
Mastering Memory Tasks with World Models
Mohammad Reza Samsami*, Artem Zholus*, Janarthanan Rajendran et Sarath Chandar
International Conference on Learning Representations (ICLR), 2024. [Oral presentation.]
#RL, #DL
[openreview], [arXiv], [code]