Cite
Notes
Only stored in your browser.
Attribution
Vectorizing the Trie: Efficient Constrained Decoding for LLM-based Generative Retrieval on Accelerators
arXiv 2026
from 1 papers
Ed H. Chi
researcher
Isay Katsman
Lichan Hong
Lukasz Heldt
Mingyan Gao
Onkar Dalal
Raghunandan Keshavan
Ruining He
Shao-Chuan Wang
Xinyang Yi