GitHub - bytedance/Lance: A 3B-active-parameter native unified multimodal model for image and video understanding, generation, and editing.
Lance: Unified Multimodal Modeling by Multi-Task Synergy
Fengyi Fu*,
Mengqi Huang*,✉,
Shaojin Wu*,
Yunsheng Jiang*,
Yufei Huo,
Jianzhu Guo✉,§
Hao Li,
Yinghang Song,
Fei Ding,
Qian He,
Zheren Fu,
Zhendong Mao,
Yongdong Zhang
ByteDance
* Equal contribution ✉ Corresponding authors § Project lead
English | 简体中文
🌟 Highlights
Lance is a 3B native unified multimodal model that supports image and video understanding, generation, and editing within a single framework.
Efficient at 3B scale. With o...
Read more at github.com