IdealGPT: Iteratively Decomposing Vision and Language Reasoning via Large Language Models

Haoxuan You, Rui Sun, Zhecan Wang, Long Chen, Gengyu Wang, Hammad Ayyubi, Kai-Wei Chang, Shih-Fu Chang

Findings: Language Grounding to Vision, Robotics and Beyond Findings Paper

Poster_Demo_Industry_Findings Virtual 3: Language Grounding to Vision, Robotics and Beyond (Poster)
Conference Room: Virtual-Gathertown
Conference Time: December 08, 16:00-17:30 (+08) (Asia/Singapore)
Global Time: December 08, Poster_Demo_Industry_Findings Virtual 3 (08:00-09:30 UTC)
TLDR:
You can open the #paper-2344 channel in a separate window.
Abstract: