Seminar Lunch Series

Weak-to-Strong Generalization Even in Random Feature Networks, Provably
Zhiyuan Li, Toyota Technological Institute at Chicago
Mar 20, 2025
Vanishing Gradients in Reinforcement Finetuning of Language Models and Interpretability Insights for Localized Fine-tuning on LLM Representations
Sep 5, 2024
Speakers
Spectral State Space Models
Naman Agarwal, Google AI Princeton
Apr 25, 2024
Interpreting Modeling Training
Angelica Chen, New York University
Apr 18, 2024
Algorithmic LLM alignment
Michal Valko, Google Deep Mind
Feb 1, 2024
Reinforcement Learning from Human Feedback: Algorithms & Applications
Event organized by PLI
Nov 29, 2023
Speaker
Making LLMs Right
Event organized by PLI
Nov 9, 2023
Speaker