2024
Gated Linear Attention Transformers With Hardware-Efficient Training
Shusheng Yang, B. Wang, Y. Shen, R. Panda, Yoon Kim
Citation Graph
References [0]
No references match the current filters.
Cited by
1
papers in your library
Cites
0
Add to reading list
Notes
Tags
Paper Aliases
No aliases