OpenDinq
An open-source product alpha for evidence-backed AI-native profiles, card workspaces, and explainable people discovery.
View repositoryMultimodal Agents · Visual Generation
I am an undergraduate student at South China University of Technology. My research focuses on multimodal agent systems that turn human intent into controllable visual and interactive content. I am broadly interested in agents for image generation, video generation, 3D/4D generation, visual reasoning, and iterative editing: systems that can decompose creative tasks, coordinate specialized models or tools, and improve outputs through planning, feedback, and self-reflection. I work closely with Jinxiu Liu.
Multimodal Agents for Visual Creation
Research projects across multimodal agents, controllable image/video generation, motion transfer, 3D mesh generation, visual reasoning, and 2D/3D/4D creation workflows.
Selected papers will be added here.
OpenDinq
An open-source product alpha for evidence-backed AI-native profiles, card workspaces, and explainable people discovery.
View repositoryMulti-Agent Blender Generation System
A system for 2D/3D/4D generation in Blender that coordinates visual reasoning agents and symbolic-program agents to turn visual intent into structured, editable scenes, inspired by vision-as-inverse-graphics workflows.
Repository pendingMeritorious Winner
Mathematical Contest in Modeling / Interdisciplinary Contest in Modeling.
Second Prize
Contemporary Undergraduate Mathematical Contest in Modeling, Guangdong Division.