STAIR: Learning Sparse Text and Image Representation in Grounded Tokens
Chen Chen, Bowen Zhang, Liangliang Cao, Jiguang Shen, Tom Gunter, Albin Madappally Jose, Alexander T Toshev, Yantao Zheng, Jonathon Shlens, Ruoming Pang, Yinfei Yang
Main: Speech and Multimodality Main-poster Paper
Poster_Demo_Industry_Findings In-person 3: Speech and Multimodality (Poster)
Conference Room: East Foyer
Conference Time: December 08, 16:00-17:30 (+08) (Asia/Singapore)
Global Time: December 08, Poster_Demo_Industry_Findings In-person 3 (08:00-09:30 UTC)
TLDR:
You can open the
#paper-296
channel in a separate window.
Abstract: