2024

Deepseek-prover-v1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search

H. Xin, Z. Z. Ren, Junxiao Song, Zhihong Shao, W. Zhao, Haiming Wang, Bing Liu, Li Zhang, X. Lu, Q. Du, W. Gao, Qihao Zhu, Diyi Yang, Z. Gou, Z. F. Wu, F. Luo, C. Ruan

citations

Citation Graph

Loading graph...

References [0]

Sort:
Filter:

No references match the current filters.

Cited by

1

papers in your library

Cites

0

papers in your library

Notes

Tags

Paper Aliases

No aliases