Self-Evolution Fine-Tuning for Policy Optimization

Published in EMNLP, 2024

Recommended citation: Ruijun Chen, Jiehao Liang, Shiping Gao, Fanqi Wan, Xiaojun Quan. (2024). "Self-Evolution Fine-Tuning for Policy Optimization." EMNLP 2024.
Download Paper