I am a master’s student in Software Engineering at Beihang University, advised by Prof. Qian Yu. My research is centered on visual code generation: building models that generate structured, editable visual code rather than rasterized outputs. I work at the intersection of multimodal large language models, vector graphics, and generative modeling, with two primary research threads. The first is SVG code generation, where I study how models can understand, reason over, and synthesize vector graphics programs, with representative works including Render-in-the-Loop and LLM4SVG. The second is vector animation code generation, where I build models that generate temporally coherent, structure-preserving vector animations directly in code, represented by VAnim and GroupSketch.

More broadly, I am interested in turning code-centric multimodal models into practical systems for creation and interaction. In industry, I work on generative ranking and retrieval with multimodal large language models at Tencent Yuanbao, and on code generation and post-training for code agents within the iQuest foundation large language model team at Ubiquant Investment. These experiences further strengthen my goal of pushing AI beyond visual understanding toward controllable, executable, and production-ready visual code generation systems.

πŸ”₯ News

  • 2026.05: Β πŸŽ‰πŸŽ‰ Our paper VAnim has been accepted by ICML 2026!
  • 2025.07: Β πŸŽ‰πŸŽ‰ Our paper GroupSketch has been accepted by ACM MM 2025!
  • 2025.02: Β πŸŽ‰πŸŽ‰ Our paper LLM4SVG has been accepted by CVPR 2025!

πŸ“ Publications

ICML 2026
VAnim

VAnim: Rendering-Aware Sparse State Modeling for Structure-Preserving Vector Animation

Guotao Liang, Zhangcheng Wang, Chuang Wang, Juncheng Hu, Haitao Zhou, Junhua Liu, Jing Zhang, Dong Xu, Qian Yu

paper project

TL;DR: VAnim formulates SVG animation as sparse state updates on a persistent DOM tree, combining identification-first motion planning with rendering-aware RL to generate structure-preserving vector animations from text.

International Conference on Machine Learning (ICML), 2026.

πŸ“„ Paper | 🌐 Project

ACM MM 2025
GroupSketch

Multi-Object Sketch Animation with Grouping and Motion Trajectory Priors

Guotao Liang, Juncheng Hu, Ximing Xing, Jing Zhang, Qian Yu

project GitHub stars

TL;DR: GroupSketch synthesizes multi-object sketch animations with grouping and motion trajectory priors, enabling users to create complex animations with ease.

ACM International Conference on Multimedia, 2025.

🌐 Project | πŸ“ Code

CVPR 2025
LLM4SVG

Empowering LLMs to Understand and Generate Complex Vector Graphics

Ximing Xing, Juncheng Hu, Guotao Liang, Jing Zhang, Dong Xu, Qian Yu

project dataset

TL;DR: LLM4SVG introduces learnable SVG Semantic Tokens and a large SVGX-SFT dataset, enabling LLMs to understand and generate complex vector graphics.

IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2025

🌐 Project | πŸ“ Code | πŸ€— SVGX-SFT-1M Dataset

πŸ“– Educations

  • 2024.09 – Present: M.S. in Software Engineering, School of Software, Beihang University

πŸ’» Internships

  • 2026.03 – Present, Ubiquant Investment, Algorithm Intern, iQuest Foundation Large Language Model Team β€” Beijing, China
    Working on code generation and post-training for code agents within the iQuest foundation large language model team.

  • 2025.10 – 2026.02, Tencent Yuanbao, Algorithm Intern, Yuanbao Search Department β€” Beijing, China
    Working on generative ranking and retrieval with multimodal large language models.

  • 2025.05 – 2025.09, 4Paradigm, Research Intern β€” Beijing, China
    Conducted research on multimodal large language models for vector graphics animation.