ITCS 6101/8101: Natural Language Processing

ITCS 6101/8101: Paper Presentation Schedule
Spring 2026

April 2:

The Pitfalls of Next-Token Prediction, Bachmann and Nagarajan, ICML, 2024
- Presenters: Fatemeh Rajabi and Mehjabeen T Shaikh
- Presentation slides
Top-nσ: Not All Logits Are You Need, Tang et al., arXiv, 2024
- Presenters: Robert Figueroa and Jamison Heinrich
- Presentation slides
Roll the Dice & Look Before You Leap: Going Beyond the Creative Limits of Next-Token Prediction, Nagarajan et al., ICML, 2025
- Presenter: Steffy Roselina Eben Judson
- Presentation slides

April 7:

Fast Inference from Transformers via Speculative Decoding, Leviathan et al., ICML, 2023
- Presenters: Daniel Meza Sarmiento and Abel Varghese
- Presentation slides

April 9:

CopySpec: Accelerating LLMs with Speculative Copy-and-Paste, Dumitru et al., EMNLP, 2025
- Presenter: Razvan-Gabriel Dumitru and Razvan Bunescu
- Presentation video and poster
Walk the Talk? Measuring the Faithfulness of Large Language Model Explanations, Matton et al., ICLR, 2025
- Presenters: Arham Hussain Inamdar and Kevin Richard
- Presentation slides
Measuring Chain of Thought Faithfulness by Unlearning Reasoning Steps, Tutek et al., EMNLP, 2025
- Presenters: Cameron Detig and Gowshik Saravanan
- Presentation slides

April 14:

Learning to (Learn at Test Time): RNNs with Expressive Hidden States, Sun et al., ICML, 2025
- Presenters: Sergio Rodriguez and Kashyap Suthar
- Presentation slides
Less is More: Recursive Reasoning with Tiny Networks, Jolicoeur-Martineau, arXiv, 2025
- Presenters: Andy Ha and Micheal Splitz
- Presentation slides
Recursive Language Models, Zhang et al., arXiv, 2025
- Presenter: Pritom Kumar Paul
- Presentation slides

April 16:

LLM-Enhanced Score Function Evolution for Causal Structure Learning, Wang et al., IJCAI, 2025
- Presenter: Jiaxiang Zhang and Justin Smith
- Presentation slides
Sparse Feature Circuits: Discovering and Editing Interpretable Causal Graphs in Language Models, Marks et al., ICLR, 2025
- Presenters: Matt Majeske and Ethan Nguyen
- Presentation slides
Circuit Tracing: Revealing Computational Graphs in Language Models, Ameisen et al., Anthropic Transformer Circuits Thread, 2025
- Presenter: Conor Miller-Lynch
- Presentation slides

April 21:

Automated Design of Agentic Systems, Hu, Lu, and Clune, ICLR, 2025
- Presenters: David Caballero and Pablo Barona
- Presentation slides
Heterogeneous Swarms: Jointly Optimizing Model Roles and Weights for Multi-LLM Systems, Feng et al., ICLR, 2025
- Presenters: Abhinav Biju and Eric Fackelman
- Presentation slides
Weight-Sparse Transformers Have Interpretable Circuits, Gao et al., arXiv, 2025
- Presenter: Saajan Patel
- Presentation slides

April 23:

A Decoder-Only Foundation Model for Time-Series Forecasting, Das et al., ICML, 2024
- Presenters: Nick Ochsner and Vance Ayscue
- Presentation slides
Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink-Free, Qiu et al., NeurIPS, 2025
- Presenters: Anant Teotia and Pradhyumna Kothapalli
- Presentation slides
Large Language Diffusion Models, Nie et al., NeurIPS, 2025
- Presenters: Joshua Foster and Andrew Morgan
- Presentation slides

April 28:

LLM.int8(): 8-bit Matrix Multiplication for Transformers at Scale, Dettmers et al., NeurIPS, 2022
- Presenter: Vincent Ma
- Presentation slides
Defeating Nondeterminism in LLM Inference, He et al., Thinking Machines Blog, 2025
- Presenter: Sabrin Nowrin
- Presentation slides
Matryoshka Representation Learning, Kusupati et al., NeurIPS, 2022
- Presenters: Tarang Sonkusare and Alphin Abraham Varghese
- Presentation slides