Weighted-Reward Preference Optimization for Implicit Model Fusion

Published in ICLR, 2025

Recommended citation: Ziyi Yang, Fanqi Wan, Longguang Zhong, Tianyuan Shi, Xiaojun Quan. (2025). "Weighted-Reward Preference Optimization for Implicit Model Fusion." ICLR 2025.
Download Paper