mentorship
This page is an archive of former interns and predoctoral researchers I mentored during my time at Ai2. I’m no longer at Ai2 and am not currently recruiting through this page.
Research Internship
I’ve had the pleasure of working with some amazing interns:
2025
- Mayee Chen, PhD student, Stanford University
- Olmix: A Framework for Data Mixing Throughout LM Development (arXiv)
- OLMo 3 (arXiv)
- Yapei Chang, PhD student, University of Massachusetts Amherst
- Amanda Bertsch, PhD student, Carnegie Mellon University
- OLMo 3 (arXiv)
- Alexis Ross, PhD student, Massachusetts Institute of Technology
2024
- Chaitanya Malaviya, PhD student, University of Pennsylvania
- Alexander Wettig, PhD student, Princeton University
- Organize the Web: Constructing Domains Enhances Pre-Training Data Curation (ICML 2025)
- OLMo 3 (arXiv)
- Niklas Muennighoff, PhD student, Stanford University
- OLMoE: Open Mixture-of-Experts Language Models (NeurIPS 2024)
- Lucy Li, PhD student, University of California Berkeley
- MathFish: Evaluating Language Model Math Reasoning via Grounding in Educational Curricula (EMNLP Findings 2024)
- DrawEduMath: Evaluating Vision Language Models with Expert-Annotated Students’ Hand-Drawn Math Images (NAACL 2025) 🏆 Outsanding Paper Award
- Rose Wang, PhD student, Stanford University
- Vishakh Padmakumar, PhD student, New York University
- Intent-Aware Schema Generation And Refinement For Literature Review Tables (EMNLP Findings 2025)
2023
- Orion Weller, PhD student, Johns Hopkins University
- Hyunji Lee, MS ▸ incoming PhD student, KAIST
- Fangyuan Xu, PhD student, University of Texas at Austin
- Hang Jiang, PhD student, MIT
2022
- John Giorgi, PhD student, University of Toronto
- Towards multi-document summarization in the open-domain (EMNLP Findings 2023)
- Kalpesh Krishna, PhD student, University of Massachusetts Amherst
- LongEval: Guidelines for Human Evaluation of Faithfulness in Long-form Summarization (EACL 2023) 🏆 Outsanding Paper Award
- Catherine Chen, PhD student, University of California Berkeley
- Hancheng Cao, PhD student, Stanford University
2021
- Tal August, PhD student, University of Washington
- Paper Plain: Making Medical Research Papers Approachable to Healthcare Consumers with Natural Language Processing (ACM Transactions on Human-Computer Interaction)
- Dustin Wright, PhD student, University of Copenhagen
2020
- Anne Lauscher, PhD student, University of Mannheim
- Marissa Radensky, PhD student, University of Washington
- Exploring the Role of Local and Global Explanations in Recommender Systems (CHI Extended Abstracts 2022)
2019
- Andrew Head, PhD student, University of California Berkeley
- Benjamin Charles Germain Lee, PhD student, University of Washington
- LIMEADE: From AI Explanations to Advice Taking (ACM Transactions on Interactive Intelligent Systems)
- David Wadden, PhD student, University of Washington
- Fact or Fiction: Verifying Scientific Claims (EMNLP 2020)
2018
- Amandalynne Paullada, PhD student, University of Washington
Predoctoral Research
My current and former predoc researchers:
- David Heineman
- Signal and Noise: A Framework for Reducing Uncertainty in Language Model Evaluation (NeurIPS 2025 Datasets and Benchmarks)
- OLMo 3 (arXiv)
- Benjamin Newman → PhD student at University of Washington (2023)
- Zejiang Shen → PhD student at MIT (2022)
- VILA: Improving Structured Content Extraction from Scientific PDFs Using Visual Layout Groups (Transactions of the ACL 2022)
- Sonia Murthy → PhD student at Harvard (2022)
- ACCoRD: A Multi-Document Approach to Generating Diverse Descriptions of Scientific Concepts (EMNLP System Demo 2023)
- Haokun Liu → PhD student at University of North Carolina at Chapel Hill (2021)
- Isabel Cachola → PhD student at Johns Hopkins University (2020)
- TLDR: Extreme Summarization of Scientific Documents (EMNLP Findings 2020)