About
Hello! I'm Wenbo (Vito) Zhu, currently the Founding AI Lead & Head of AI Research at OpusClip. I joined when the team had around 5 engineers, failed and pivoted, and now lead a 30+ cross-functional AI team delivering next-generation multimodal and generative video products. I initiated, led, and shipped several flagship systems, including:
Before OpusClip, I was a Senior ML Engineer at ByteDance/TikTok, where I served as a founding engineer of Gauthmath — building the world's first AI-based geometry solver with 100M+ downloads.
Background
Earlier, as a Research Scientist at Cloudwalk Technology, I developed a billion-scale face clustering engine deployed across 10+ cities (patented).
Career
OpusClip
ByteDance / TikTok
Cloudwalk Technology
Research
Understanding, reasoning, and editing for video content
LLM-based planning, tool use, and evaluation frameworks
Automatic video repurposing and multimodal content creation
Projects
Generalization of SFT via Reward Rectification
Temporal Grounding Videos like Flipping Manga
Video Understanding with GPT-4V(ision)
Text-Prompted Video Object Tracking
Video Repurposing from User Generated Content
News
Publications
Awards