Rethinking Bradley-Terry Models in Preference-Based Reward Modeling: Foundations, Theory, and Alternatives.
Published in arXiv (2024), 2025
Recommended citation: Sun, Hao*, Yunyi Shen*, and Jean-Francois Ton. "Rethinking Bradley-Terry Models in Preference-Based Reward Modeling: Foundations, Theory, and Alternatives." arXiv preprint arXiv:2411.04991 (2024), to appear in ICLR 2025 (oral). https://arxiv.org/abs/2411.04991