Cite
Notes
Only stored in your browser.
Attribution
Reusing Embeddings: Reproducible Reward Model Research in Large Language Model Alignment without GPUs
arXiv 2025
Multi-marginal Schrödinger Bridges with Iterative Reference Refinement
arXiv 2024
from 2 papers
Hao Sun
Jean-Francois Ton
Mihaela van der Schaar
Renato Berlinghieri
Tamara Broderick