Weighted-Reward Preference Optimization for Implicit Model Fusion
Published in ICLR, 2025
Recommended citation: Ziyi Yang, Fanqi Wan, Longguang Zhong, Tianyuan Shi, Xiaojun Quan. (2025). "Weighted-Reward Preference Optimization for Implicit Model Fusion." ICLR 2025.
Download Paper
