Papperoni

2023

Fine-Grained Human Feedback Gives Better Rewards for Language Model Training

Ziyi Wu, Y. Hu, Weijia Shi, N. Dziri, A. Suhr, P. Ammanabrolu, Noah A. Smith, M. Ostendorf, Hananneh Hajishirzi

citations

Citation Graph

Loading graph...

References [0]

Sort:

Filter:

No references match the current filters.

Cited by

papers in your library

Cites

papers in your library

Notes