Computer-aided design (CAD) is the digital construction of 2D and 3D objects, and is central to a wide range of engineering and manufacturing applications like automobile and aviation. Despite its importance, CAD modeling remains largely a time-intensive, manual task. Recent works have attempted to automate this process with small transformer-based models and handcrafted CAD sequence representations. However, there has been little effort to leverage the potential of large language models (LLMs) for sequential CAD design.
To address these limitations, we introduce CADmium, a novel approach that reformulates CAD generation as a purely text-to-text task. Our contributions are as follows:
GPT-4.1 that processes multi-view images of 3D objects and their construction sequences to generate annotations combining natural language fluency with geometric precision. We applied this pipeline to 176,017 CAD sequences to release a new dataset with improved, expert-quality annotations.
Qwen-2.5-Coder, on our dataset to generate JSON-formatted CAD sequences from natural language prompts. This leverages the pre-trained model's capabilities without specialized embedding layers. We evaluated our approach against leading models using established metrics and validated it on the Fusion360 Reconstruction dataset and the expert-annotated CADPrompt benchmark.
@article{govindarajan2025cadmium,
title={CADmium: Fine-Tuning Code Language Models for Text-Driven Sequential CAD Design},
author={Govindarajan, Prashant and Baldelli, Davide and Pathak, Jay and Fournier, Quentin and Chandar, Sarath},
journal={arXiv preprint arXiv:2507.09792},
year={2025}
}