2025

Muon Is Scalable for LLM Training

Joseph Liu, Jianlin Su, Xingcheng Yao, Zhejun Jiang, Guokun Lai, Yulun Du, Y. Qin, Weixin Xu, Enzhe Lu, Junjie Yan, Yanru Chen, Huabin Zheng, Yibo Liu, Shuming Liu, Bohong Yin, Weiran He, H. Zhu, Yuzhi Wang, J. Wang, Mengnan Dong, Zhengyou Zhang, Y. Kang, Haowei Zhang, Xinran Xu, Y. Z. Zhang, Yonghui Wu, Xinyu Zhou, Zhilin Yang

citations

Citation Graph

Loading graph...

References [0]

Sort:
Filter:

No references match the current filters.

Cited by

1

papers in your library

Cites

0

papers in your library

Notes

Tags

Paper Aliases

No aliases