An Empirical Study of Multimodal Model Merging
Yi-Lin Sung, Linjie Li, Kevin Lin, Zhe Gan, Mohit Bansal, Lijuan Wang
Findings: Language Grounding to Vision, Robotics and Beyond Findings Paper
Poster_Demo_Industry_Findings Virtual 5: Language Grounding to Vision, Robotics and Beyond (Poster)
Conference Room: Virtual-Gathertown
Conference Time: December 09, 11:00-12:30 (+08) (Asia/Singapore)
Global Time: December 09, Poster_Demo_Industry_Findings Virtual 5 (03:00-04:30 UTC)
TLDR:
You can open the
#paper-3452
channel in a separate window.
Abstract: