Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback

Viet Lai, Chien Nguyen, Nghia Ngo, Thuat Nguyen, Franck Dernoncourt, Ryan Rossi, Thien Nguyen

Demo: Demo Demo Paper

Poster_Demo_Industry Hybrid 3: Demo (Poster)
Conference Room: East Foyer(Virtual)
Conference Time: December 08, 16:00-17:30 (+08) (Asia/Singapore)
Global Time: December 08, Poster_Demo_Industry Hybrid 3 (08:00-09:30 UTC)
TLDR:
You can open the #paper-Demo-187 channel in a separate window.
Abstract: