Papperoni

2024

Deepseek-prover-v1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search

H. Xin, Z. Z. Ren, Junxiao Song, Zhihong Shao, W. Zhao, Haiming Wang, Bing Liu, Li Zhang, X. Lu, Q. Du, W. Gao, Qihao Zhu, Diyi Yang, Z. Gou, Z. F. Wu, F. Luo, C. Ruan

Google Scholar

citations

Citation Graph

Loading graph...

References [0]

Sort:

Filter:

No references match the current filters.

Cited by

papers in your library

Cites

papers in your library

Notes