Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

Hang Zhang, Xin Li, Lidong Bing

Demo: Demo Demo Paper

Poster_Demo_Industry_Findings In-person 5: Demo (Poster)
Conference Room: East Foyer
Conference Time: December 09, 11:00-12:30 (+08) (Asia/Singapore)
Global Time: December 09, Poster_Demo_Industry_Findings In-person 5 (03:00-04:30 UTC)
TLDR:
You can open the #paper-Demo-232 channel in a separate window.
Abstract: