KDD 2026 – Mining Intrinsic Rewards from LLM Hidden States for Efficient Best-of-N Sampling
Jizhou Guo:Zhiyuan College, Shanghai Jiao Tong University,University of Illinois Chicago;Zhaomin Wu:Department of Computer Science, National University of Singapore;Hanchen Yang:Tongji University;Philip Yu:University of Illinois Chicago
source
