Rethinking Bradley-Terry Models in Preference-Based Reward Modeling: Foundations, Theory, and Alternatives.
Published in arXiv (2024), 2024
Recommended citation: Sun, Hao*, Yunyi Shen*, and Jean-Francois Ton. "Rethinking Bradley-Terry Models in Preference-Based Reward Modeling: Foundations, Theory, and Alternatives." arXiv preprint arXiv:2411.04991 (2024). https://arxiv.org/abs/2411.04991