About Me
Hello! I’m Wenbo (Vito) Zhu, currently the Founding AI Lead & Head of AI Research at OpusClip.
I joined when the team had around 5 engineers, failed and pivoted, and now lead a 30+ cross-functional AI team delivering next-generation multimodal and generative video products.
I initiated, led, and shipped several flagship systems, including:
- OpusClip – No.1 AI Clipping Tool in the market
- ClipAnything – First multi-modal clipping tool with prompting capabilities
- Agent Opus – First video agent tailored for social media
Before OpusClip, I was a Senior ML Engineer at ByteDance/TikTok, where I served as a founding engineer of Gauthmath — building the world’s first AI-based geometry solver with 100M+ downloads.
Earlier, as a Research Scientist at Cloudwalk Technology, I developed a billion-scale face clustering engine deployed across 10+ cities (patented).
- Recognition: Both OpusClip and Gauthmath were recognized by Andreessen Horowitz (a16z) as Top 50 GenAI Apps.
- Leadership: I founded the Opus AI Research Team, the research team at OpusClip.
- Collaboration: I work with Prof. Xu Yang on multimodal video intelligence and robust vision-language alignment.
- Education: I hold a Master (OR) from UC Berkeley and an Undergrad (IE & Math) from Beihang University.
Career
- [2022–Present] Serving as Founding AI Lead & Head of AI Research at OpusClip
- [2020–2022] Joined ByteDance/TikTok as a Senior ML Engineer
- [2019–2020] Started as a Research Scientist at Cloudwalk Technology
Research Interests
- Multimodal Video Intelligence: understanding, reasoning, and editing for video content
- Agentic Systems: LLM-based planning, tool use, and evaluation frameworks
- Generative Media: automatic video repurposing and multimodal content creation
News
- [Jan. 2026] One paper is accepted by ICLR 2026.
- [Sept 2025] Two papers accepted by NeurIPS 2025.
- [May 2025] One paper accepted to ACL 2025.
- [Feb 2025] Two papers accepted to CVPR 2025, including One Highlight.
- [Jan 2025] One journal paper accepted to BIT
- [Dec 2024] Two papers accepted to AAAI 2025.
- [Nov 2024] Co-authored an article with Google AI team on Gemini Flash:
“OpusClip achieves 30% cost savings with Gemini Flash” (Google AI Showcase).
- [Feb 2024] One journal paper accepted to IJHCI
Publications
-
ICLR
Yongliang Wu*, Yizhou Zhou*, Zhou Ziheng, Yingzhe Peng, Xinyu Ye, Xinting Hu, Wenbo Zhu, Lu Qi, Ming-Hsuan Yang, Xu Yang
International Conference on Learning Representations (ICLR), 2026.
PDF
Code
Integrated by ms-swift, trl, llama-factory.
-
NeurIPS
Yongliang Wu, Zonghui Li, Xinting Hu, Xinyu Ye, Xianfang Zeng, Gang Yu, Wenbo Zhu, Bernt Schiele, Ming-Hsuan Yang, Xu Yang
Advances in Neural Information Processing Systems (NeurIPS), 2025.
-
CVPR
Yongliang Wu*, Xinting Hu*, Yuyang Sun, Yizhou Zhou, Wenbo Zhu, Fengyun Rao, Bernt Schiele, Xu Yang
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2025.
-
AAAI
Yongliang Wu*, Shiji Zhou*, Mingzhuo Yang, Lianzhe Wang, Wenbo Zhu, Heng Chang, Xinting Hu, Xiao Zhou, Xu Yang
AAAI Conference on Artificial Intelligence (AAAI), 2025.
-
AAAI
Yongliang Wu*, Wenbo Zhu*, Jiawang Cao*, Yi Lu, Bozheng Li, Weiheng Chi, Zihan Qiu, Lirian Su, Haolin Zheng, Jay Wu, Xu Yang
AAAI Conference on Artificial Intelligence (AAAI), 2025.
-
CVPR
Bozheng Li, Yongliang Wu, Yi Lu, Jiashuo Yu, Licheng Tang, Jiawang Cao, Wenqing Zhu, Yuyang Sun, Jay Wu, Wenbo Zhu
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2025.
-
ACL
Yi Lu*, Jiawang Cao*, Yongliang Wu*, Bozheng Li, Licheng Tang, Yangguang Ji, Chong Wu, Jay Wu, Wenbo Zhu
Annual Meeting of the Association for Computational Linguistics (ACL), 2025.
Competition & Awards
- [Oct 2025] 🥈 2nd Place – Perception Test Challenge 2025 (Task 5: Hour-Long Video QA)
- [Aug 2025] 🥉 3rd Place – CVPR 2025 SoccerNet Challenge (Multi-View Foul Recognition)
- [Jun 2025] 🥇 1st Place – CVPR 2025 VidLLMs Challenge (Multilingual Video Reasoning)
- [Jun 2025] 🥈 2nd Place – CVPR 2025 VidLLMs Challenge (Complex Video Reasoning & Robustness)
- [Oct 2024] 🏆 Winner – ECCV 2024 Perception Challenge (Hour-Long Video QA Track)
- [Jun 2024] 🏆 Winner – CVPR 2024 LOVEU Workshop (Long-Term Video QA Track)
- [2020] 🥈 Kaggle ASHRAE Great Energy Predictor III – Silver Medal (Top 2%)
Services
Conference Reviewers
Powered by Jekyll and Minimal Light theme.