Rethinking Bradley-Terry Models in Preference-Based Reward Modeling: Foundations, Theory, and Alternatives.

Published in arXiv (2024), 2024

Recommended citation: Sun, Hao*, Yunyi Shen*, and Jean-Francois Ton. "Rethinking Bradley-Terry Models in Preference-Based Reward Modeling: Foundations, Theory, and Alternatives." arXiv preprint arXiv:2411.04991 (2024). https://arxiv.org/abs/2411.04991