Reusing Embeddings: Reproducible Reward Model Research in Large Language Model Alignment without GPUs.
Published in arXiv (2025), 2025
Recommended citation: Sun, Hao, Yunyi Shen, Jean-Francois Ton, and Mihaela van der Schaar. "Reusing Embeddings: Reproducible Reward Model Research in Large Language Model Alignment without GPUs." arXiv preprint arXiv:2502.04357 (2025). https://arxiv.org/abs/2502.04357