On the Benefits of Learning to Route in Mixture-of-Experts Models

Nishanth Dikkala, Nikhil Ghosh, Raghu Meka, Rina Panigrahy, Nikhil Vyas, Xin Wang

Main: Theme Track: Large Language Models and the Future of NLP Main-poster Paper

Poster_Demo_Industry Hybrid 4: Theme Track: Large Language Models and the Future of NLP (Poster)
Conference Room: East Foyer(Virtual)
Conference Time: December 09, 09:00-10:30 (+08) (Asia/Singapore)
Global Time: December 09, Poster_Demo_Industry Hybrid 4 (01:00-02:30 UTC)
TLDR:
You can open the #paper-1124 channel in a separate window.
Abstract: