2024

Unlocking Efficiency in Large Language Model Inference: A Comprehensive Survey of Speculative Decoding

H. Xia, Zhilin Yang, Q. Dong, Peng Wang, Yiwei Li, Tiezheng Ge, T. Liu, Wentao Li, Zhifang Sui

citations

Citation Graph

Loading graph...

References [0]

Sort:
Filter:

No references match the current filters.

Cited by

1

papers in your library

Cites

0

papers in your library

Notes

Tags

Paper Aliases

No aliases