I am a masterβs student in Software Engineering at Beihang University, advised by Prof. Qian Yu. My research is centered on visual code generation: building models that generate structured, editable visual code rather than rasterized outputs. I work at the intersection of multimodal large language models, vector graphics, and generative modeling, with two primary research threads. The first is SVG code generation, where I study how models can understand, reason over, and synthesize vector graphics programs, with representative works including Render-in-the-Loop and LLM4SVG. The second is vector animation code generation, where I build models that generate temporally coherent, structure-preserving vector animations directly in code, represented by VAnim and GroupSketch.
More broadly, I am interested in turning code-centric multimodal models into practical systems for creation and interaction. In industry, I work on generative ranking and retrieval with multimodal large language models at Tencent Yuanbao, and on code generation and post-training for code agents within the iQuest foundation large language model team at Ubiquant Investment. These experiences further strengthen my goal of pushing AI beyond visual understanding toward controllable, executable, and production-ready visual code generation systems.
π₯ News
- 2026.05: Β ππ Our paper VAnim has been accepted by ICML 2026!
- 2025.07: Β ππ Our paper GroupSketch has been accepted by ACM MM 2025!
- 2025.02: Β ππ Our paper LLM4SVG has been accepted by CVPR 2025!
π Publications

VAnim: Rendering-Aware Sparse State Modeling for Structure-Preserving Vector Animation
Guotao Liang, Zhangcheng Wang, Chuang Wang, Juncheng Hu, Haitao Zhou, Junhua Liu, Jing Zhang, Dong Xu, Qian Yu
TL;DR: VAnim formulates SVG animation as sparse state updates on a persistent DOM tree, combining identification-first motion planning with rendering-aware RL to generate structure-preserving vector animations from text.
International Conference on Machine Learning (ICML), 2026.

Multi-Object Sketch Animation with Grouping and Motion Trajectory Priors
Guotao Liang, Juncheng Hu, Ximing Xing, Jing Zhang, Qian Yu
TL;DR: GroupSketch synthesizes multi-object sketch animations with grouping and motion trajectory priors, enabling users to create complex animations with ease.
ACM International Conference on Multimedia, 2025.

Empowering LLMs to Understand and Generate Complex Vector Graphics
Ximing Xing, Juncheng Hu, Guotao Liang, Jing Zhang, Dong Xu, Qian Yu
TL;DR: LLM4SVG introduces learnable SVG Semantic Tokens and a large SVGX-SFT dataset, enabling LLMs to understand and generate complex vector graphics.
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2025
π Project | π Code | π€ SVGX-SFT-1M Dataset
π Educations
- 2024.09 β Present: M.S. in Software Engineering, School of Software, Beihang University
π» Internships
-
2026.03 β Present, Ubiquant Investment, Algorithm Intern, iQuest Foundation Large Language Model Team β Beijing, China
Working on code generation and post-training for code agents within the iQuest foundation large language model team. -
2025.10 β 2026.02, Tencent Yuanbao, Algorithm Intern, Yuanbao Search Department β Beijing, China
Working on generative ranking and retrieval with multimodal large language models. -
2025.05 β 2025.09, 4Paradigm, Research Intern β Beijing, China
Conducted research on multimodal large language models for vector graphics animation.