ITCS 6101/8101: Paper Presentation Schedule
Spring 2026
April 2:
- The Pitfalls of Next-Token Prediction, Bachmann and Nagarajan, ICML, 2024
- Presenters: Fatemeh Rajabi and Mehjabeen T Shaikh
- Presentation slides
- Top-nσ: Not All Logits Are You Need, Tang et al., arXiv, 2024
- Presenters: Robert Figueroa and Jamison Heinrich
- Presentation slides
- Roll the Dice & Look Before You Leap: Going Beyond the Creative Limits of Next-Token Prediction, Nagarajan et al., ICML, 2025
- Presenter: Steffy Roselina Eben Judson
- Presentation slides
April 7:
- Fast Inference from Transformers via Speculative Decoding, Leviathan et al., ICML, 2023
- Presenters: Daniel Meza Sarmiento and Abel Varghese
- Presentation slides
April 9:
- CopySpec: Accelerating LLMs with Speculative Copy-and-Paste, Dumitru et al., EMNLP, 2025
- Presenter: Dhrubo Mahbub
- Presentation slides
- Walk the Talk? Measuring the Faithfulness of Large Language Model Explanations, Matton et al., ICLR, 2025
- Presenters: Arham Hussain Inamdar and Kevin Richard
- Presentation slides
- Measuring Chain of Thought Faithfulness by Unlearning Reasoning Steps, Tutek et al., EMNLP, 2025
- Presenters: Cameron Detig and Gowshik Saravanan
- Presentation slides
April 14:
- Learning to (Learn at Test Time): RNNs with Expressive Hidden States, Sun et al., ICML, 2025
- Presenters: Sergio Rodriguez and Kashyap Suthar
- Presentation slides
- Less is More: Recursive Reasoning with Tiny Networks, Jolicoeur-Martineau, arXiv, 2025
- Presenters: Andy Ha and Micheal Splitz
- Presentation slides
- Recursive Language Models, Zhang et al., arXiv, 2025
- Presenter: Pritom Kumar Paul
- Presentation slides
April 16:
- LLM-Enhanced Score Function Evolution for Causal Structure Learning, Wang et al., IJCAI, 2025
- Presenter: Jiaxiang Zhang and Justin Smith
- Presentation slides
- Sparse Feature Circuits: Discovering and Editing Interpretable Causal Graphs in Language Models, Marks et al., ICLR, 2025
- Presenters: Matt Majeske and Ethan Nguyen
- Presentation slides
- Circuit Tracing: Revealing Computational Graphs in Language Models, Ameisen et al., Anthropic Transformer Circuits Thread, 2025
- Presenter: Conor Miller-Lynch
- Presentation slides
April 21:
- Automated Design of Agentic Systems, Hu, Lu, and Clune, ICLR, 2025
- Presenters: David Caballero and Pablo Barona
- Presentation slides
- Heterogeneous Swarms: Jointly Optimizing Model Roles and Weights for Multi-LLM Systems, Feng et al., ICLR, 2025
- Presenters: Abhinav Biju and Eric Fackelman
- Presentation slides
- Weight-Sparse Transformers Have Interpretable Circuits, Gao et al., arXiv, 2025
- Presenter: Saajan Patel
- Presentation slides
April 23:
- A Decoder-Only Foundation Model for Time-Series Forecasting, Das et al., ICML, 2024
- Presenters: Nick Ochsner and Vance Ayscue
- Presentation slides
- Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink-Free, Qiu et al., NeurIPS, 2025
- Presenters: Anant Teotia and Pradhyumna Kothapalli
- Presentation slides
- Large Language Diffusion Models, Nie et al., NeurIPS, 2025
- Presenters: Joshua Foster and Andrew Morgan
- Presentation slides
April 28:
- LLM.int8(): 8-bit Matrix Multiplication for Transformers at Scale, Dettmers et al., NeurIPS, 2022
- Presenter: Vincent Ma
- Presentation slides
- Defeating Nondeterminism in LLM Inference, He et al., Thinking Machines Blog, 2025
- Presenter: Sabrin Nowrin
- Presentation slides
- Matryoshka Representation Learning, Kusupati et al., NeurIPS, 2022
- Presenters: Tarang Sonkusare and Alphin Abraham Varghese
- Presentation slides