Transcending Scaling Laws with 0.1% Extra Compute

Yi Tay, Jason Wei, Hyung Won Chung, Vinh Q. Tran, David So, Siamak Shakeri, Xavier Garcia, Steven Zheng, Jinfeng Rao, Aakanksha Chowdhery, Denny Zhou, Donald Metzler, Slav Petrov, Neil Houlsby, Quoc V Le, Mostafa Dehghani

Main: Language Modeling and Analysis of Language Models Main-poster Paper

Poster_Demo_Industry_Findings In-person 2: Language Modeling and Analysis of Language Models (Poster)
Conference Room: East Foyer
Conference Time: December 08, 14:00-15:30 (+08) (Asia/Singapore)
Global Time: December 08, Poster_Demo_Industry_Findings In-person 2 (06:00-07:30 UTC)
TLDR:
You can open the #paper-4436 channel in a separate window.
Abstract: