2023

Efficient Memory Management for Large Language Model Serving With Pagedattention

W. Kwon, Zhiyuan Li, S. Zhuang, Y. Sheng, L. Zheng, C. H. Yu, Joseph Gonzalez, Haowei Zhang, Ion Stoica

citations

Cite Score

78

Citation Graph

Loading graph...

References [0]

Sort:
Filter:

No references match the current filters.

Cited by

5

papers in your library

Cites

0

papers in your library

Notes

Tags

canon

Paper Aliases

No aliases