REFORM Schedule and Past Meetings
Upcoming and past REFORM sessions, papers, and discussion materials.
Meetings are held every Thursday at 5 PM. Room: CoDa E401 (exception: on April 23rd, we meet in CoDa W101)
Spring 2026 theme: Understanding and Improving LLMs via a Theoretical Lens. This quarter we plan to cover recent work on the internal structure of LLMs, compression and quantization, optimization and training methods, RL-theoretic viewpoints, and systems or algorithmic ideas for improving model performance.
Sign up to be a discussant here. Goal(s) of the discussant group:
- Prepare a 20–30 minute presentation, accessible to a second-year PhD student, focusing on (a) seeding discussion, (b) identifying gaps and connections, and (c) formulating open problems
- We suggest several papers for each week—more than one can cover thoroughly in a week. Pick a small, focused set of papers and read them thoroughly
- Do a single “deep dive” per week about one subject (this can span multiple papers)
Signing up is a great way to (1) force yourself to engage with the content of the paper, (2) get to know your co-discussant(s), and (3) ensure the success of the reading group.
Upcoming & Past Sessions
| Date | Topic | Resources |
|---|---|---|
| 2024-10-16 | Introduction | Slides |
| 2024-10-23 | Scaling Laws 1 (Training Compute-Optimal Language Models) | Paper Slides |
| 2024-10-30 | Scaling Laws 2 (Explaining Neural Scaling Laws) | Paper Slides |
| 2024-11-06 | Data Selection 1 (Perplexity Correlations, Scaling Laws + Data Filtering) | Paper 1 Paper 2 Slides |
| 2024-11-13 | Data Selection 2 (DsDm, LESS) | Paper 1 Paper 2 Tutorial Slides (DsDm) Slides (LESS) |
| 2024-11-20 | Data Selection 3 (Statistical Theory) | Paper Slides |
| 2024-11-20 | Data Selection 3 (Pruning, Prediction) | Paper 1 Paper 2 |
| 2025-01-22 | Post-training 1 (RLHF, AlpacaFarm) | Paper 1 Paper 2 Slides |
| 2025-01-29 | No meeting (ICML Deadline) | |
| 2025-02-05 | Post-training 2 (Direct methods & Offline RL) | Paper 1 Paper 2 Paper 3 Paper 4 Slides 1 Slides 2 |
| 2025-02-12 | Post-training 3 (DeepSeek) | Paper Slides |
| 2025-02-19 | Post-training 4 (Synthetic Data) | Slides |
| 2025-02-26 | Post-training 4 (Synthetic Data & Self-Improvement) | Paper 1 Paper 2 Slides |
| 2025-03-04 | Post-training 5 (Simplicity) | Paper 1 Paper 2 Slides |
| 2025-03-11 | Post-training 5 (In-Context Learning) | Slides |
| (Between-quarter break) | ||
| 2025-04-09 | Reasoning 1 (Introduction) | Slides |
| 2025-04-16 | Reasoning 2 (STaR) | Paper |
| 2025-04-23 | Reasoning 3 (Process rewards) | Slides Paper 1 Paper 2 |
| 2025-04-30 | Reasoning 4 (More Self-improvement) | Paper |
| (Summer break) | ||
| 2025-10-09 | Post-deployment/Safety 1 (CoT Monitoring) | Paper 1 Paper 2 Slides |
| 2025-10-16 | Post-deployment/Safety 2 (Jailbreaking, Elicitation) | Paper 1 Paper 2 Paper 3 Slides |
| 2025-10-23 | Post-deployment/Safety 3 (Hallucinations) | Paper 1 Paper 2 Slides 1 Slides 2 |
| 2025-10-30 | Post-deployment/Safety 3 (Privacy and Memorization) | Slides 1 Slides 2 Paper 1 Paper 2 |
| 2025-11-06 | Post-deployment/Safety 4 (Emergent Misalignment) | Paper |
| 2025-11-13 | Post-deployment/Safety 5 (Out-of-Context Reasoning) | Slides Paper 1 Paper 2 Paper 3 Paper 4 |
| 2026-01-22 | Introduction + Sharpness and Training Dynamics 1 (Edge of Stability) | Slides Paper Extra Reading 1 Extra Reading 2 |
| 2026-02-05 | Sharpness and Training Dynamics 2 (Sharpness-Aware Minimization) | Slides 1 Slides 2 Paper 1 Paper 2 |
| 2026-02-12 | Overfitting and Generalization 1: Double Descent | Slides 1 Slides 2 Paper 1 Paper 2 |
| 2026-02-19 | Overfitting and Generalization 2: Benign Overfitting | Slides 1 Slides 2 Paper 1 Paper 2 |
| 2026-02-26 | Emergent Abilities 1: Grokking | Slides 1 Slides 2 Paper 1 Paper 2 |
| 2026-03-05 | Emergent Abilities 2: Emergent Abilities | Slides 1 Slides 2 Paper 1 Paper 2 |
| 2026-03-12 | Distribution Sharpening and RL | Slides 1 Slides 2 Paper 1 Paper 2 Paper 3 |
| (Between-quarter break) | ||
| 2026-04-16 | Introduction + Internal Structure of LLMs 1 (Low-Logit Rank) | Slides Paper |
| 2026-04-23 | Internal Structure of LLMs 2 (Sublimal effects) | Slides Paper |
| 2026-04-30 | Internal Structure of LLMs 3 (Linear representation hypothesis) | Paper 1 Paper 2 |
| 2026-05-07 | LLM Compression 1 (Speeding up attention) | Paper: TBD |
| 2026-05-14 | LLM Compression 2 (Compressing Weights) | Paper: TBD |