Towards A Unified View of Sparse Feed-Forward Network in Pretraining Large Language Model

Zeyu Liu, Tim Dettmers, Xi Victoria Lin, Veselin Stoyanov, Xian Li

Main: Language Modeling and Analysis of Language Models Main-poster Paper

Poster_Demo_Industry_Findings In-person 2: Language Modeling and Analysis of Language Models (Poster)
Conference Room: East Foyer
Conference Time: December 08, 14:00-15:30 (+08) (Asia/Singapore)
Global Time: December 08, Poster_Demo_Industry_Findings In-person 2 (06:00-07:30 UTC)
TLDR:
You can open the #paper-429 channel in a separate window.
Abstract: