Dr. Daniil Mirylenka Google DeepMind
The Feedback Loop of Intelligence: Operationalizing User Signals in Frontier Model Alignment
Explored how production telemetry and implicit user signals drive the development of frontier LLMs, examining the engineering pipelines that transform noisy behaviors into reward models while addressing challenges like sycophancy and reward hacking.
