1. OLMo: Accelerating the Science of Language Models
    Dirk Groeneveld, Iz Beltagy, Pete Walsh, Akshita Bhagia, Rodney Kinney, Oyvind Tafjord, Ananya Harsh Jha, and 36 more authors
    ArXiv, Feb 2024
  2. Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research
    Luca Soldaini, Rodney Kinney, Akshita Bhagia, Dustin Schwenk, David Atkinson, Russell Authur, Ben Bogin, and 29 more authors
    ArXiv, Jan 2024


  1. Paloma: A Benchmark for Evaluating Language Model Fit
    Ian Magnusson, Akshita Bhagia, Valentin Hofmann, Luca Soldaini, A. Jha, Oyvind Tafjord, Dustin Schwenk, and 9 more authors
    ArXiv, Dec 2023
  2. PaperMage: A Unified Toolkit for Processing, Representing, and Manipulating Visually-Rich Scientific Documents
    Kyle Lo, Zejiang Shen, Benjamin Newman, Joseph Chang, Russell Authur, Erin Bransom, Stefan Candra, and 10 more authors
    In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, Dec 2023
    🏆 Best Paper Award 🏆
  3. Decomposing Complex Queries for Tip-of-the-tongue Retrieval
    Kevin Lin, Kyle Lo, Joseph Gonzalez, and Dan Klein
    In Findings of the Association for Computational Linguistics: EMNLP 2023, Dec 2023
  4. A Question Answering Framework for Decontextualizing User-facing Snippets from Scientific Documents
    Benjamin Newman, Luca Soldaini, Raymond Fok, Arman Cohan, and Kyle Lo
    In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, Dec 2023
  5. Open Domain Multi-document Summarization: A Comprehensive Study of Model Brittleness under Retrieval
    John Giorgi, Luca Soldaini, Bo Wang, Gary Bader, Kyle Lo, Lucy Wang, and Arman Cohan
    In Findings of the Association for Computational Linguistics: EMNLP 2023, Dec 2023
  6. Back to Basics: A Simple Recipe for Improving Out-of-Domain Retrieval in Dense Encoders
    Hyunji Lee, Luca Soldaini, Arman Cohan, Minjoon Seo, and Kyle Lo
    ArXiv, Nov 2023
  7. BooookScore: A systematic exploration of book-length summarization in the era of LLMs
    Yapei Chang, Kyle Lo, Tanya Goyal, and Mohit Iyyer
    ArXiv, Oct 2023
  8. The Rise of Open Science: Tracking the Evolution and Perceived Value of Data and Methods Link-Sharing Practices
    Hancheng Cao, Jesse Dodge, Kyle Lo, Daniel A. McFarland, and Lucy Lu Wang
    ArXiv, Oct 2023
  9. When do Generative Query and Document Expansions Fail? A Comprehensive Study Across Methods, Retrievers, and Datasets
    Orion Weller, Kyle Lo, David Wadden, Dawn J Lawrie, Benjamin Van Durme, Arman Cohan, and Luca Soldaini
    ArXiv, Sep 2023
  10. Efficiency Pentathlon: A Standardized Arena for Efficiency Evaluation
    Hao Peng, Qingqing Cao, Jesse Dodge, Matthew E. Peters, Jared Fernandez, Tom Sherborne, Kyle Lo, and 7 more authors
    ArXiv, Jul 2023
  11. Are Layout-Infused Language Models Robust to Layout Distribution Shifts? A Case Study with Scientific Documents
    Catherine Chen, Zejiang Shen, Dan Klein, Gabriel Stanovsky, Doug Downey, and Kyle Lo
    In Findings of the Association for Computational Linguistics: ACL 2023, Jul 2023
  12. Complex Mathematical Symbol Definition Structures: A Dataset and Model for Coordination Resolution in Definition Extraction
    Anna Martin-Boyle, Andrew Head, Kyle Lo, Risham Sidhu, Marti A. Hearst, and Dongyeop Kang
    ArXiv, May 2023
  13. LongEval: Guidelines for Human Evaluation of Faithfulness in Long-form Summarization
    Kalpesh Krishna, Erin Bransom, Bailey Kuehl, Mohit Iyyer, Pradeep Dasigi, Arman Cohan, and Kyle Lo
    In EACL, May 2023
    🏆 Outstanding Paper Award 🏆
  14. Beyond Summarization: Designing AI Support for Real-World Expository Writing Tasks
    Zejiang Shen, Tal August, Pao Siangliulue, Kyle Lo, Jonathan Bragg, Jeff Hammerbacher, Doug Downey, and 2 more authors
    In Intelligent and Interactive Writing Assistants (In2Writing) Workshop, Apr 2023
  15. CiteSee: Augmenting Citations in Scientific Papers with Persistent and Personalized Historical Context
    Joseph Chee Chang, Amy X. Zhang, Jonathan Bragg, Andrew Head, Kyle Lo, Doug Downey, and Daniel S. Weld
    In CHI, Apr 2023
    🏆 Best Paper Award 🏆
  16. Paper Plain: Making Medical Research Papers Approachable to Healthcare Consumers with Natural Language Processing
    Tal August, Lucy Lu Wang, Jonathan Bragg, Marti A. Hearst, Andrew Head, and Kyle Lo
    ACM Transactions of Computer-Human Interaction (TOCHI), Apr 2023
  17. The Semantic Reader Project: Augmenting Scholarly Documents through AI-Powered Interactive Reading Interfaces
    Kyle Lo, Joseph Chee Chang, Andrew Head, Jonathan Bragg, Amy X. Zhang, Cassidy Trier, Chloe Anastasiades, and 48 more authors
    ArXiv, Mar 2023
  18. Scim: Intelligent Skimming Support for Scientific Papers
    Raymond Fok, Hita Kambhamettu, Luca Soldaini, Jonathan Bragg, Kyle Lo, Marti Hearst, Andrew Head, and 1 more author
    In IUI, Mar 2023
  19. LIMEADE: From AI Explanations to Advice Taking
    Benjamin Charles Germain Lee, Doug Downey, Kyle Lo, and Daniel S. Weld
    ACM Transactions on Interactive Intelligent Systems, Mar 2023
  20. The Semantic Scholar Open Data Platform
    Rodney Michael Kinney, Chloe Anastasiades, Russell Authur, Iz Beltagy, Jonathan Bragg, Alexandra Buraczynski, Isabel Cachola, and 41 more authors
    ArXiv, Jan 2023


  1. SciFact-Open: Towards open-domain scientific claim verification
    David Wadden, Kyle Lo, Bailey Kuehl, Arman Cohan, Iz Beltagy, Lucy Lu Wang, and Hannaneh Hajishirzi
    In Findings of EMNLP, Dec 2022
  2. BLOOM: A 176B-Parameter Open-Access Multilingual Language Model
    Teven Le Scao, Angela Fan, Christopher Akiki, Elizabeth-Jane Pavlick, Suzana Ili’c, Daniel Hesslow, Roman Castagn’e, and 383 more authors
    ArXiv, Nov 2022
  3. Multi-LexSum: Real-world Summaries of Civil Rights Lawsuits at Multiple Granularities
    Zejiang Shen, Kyle Lo, Lauren Yu, Nathan Dahlberg, Margo Schlanger, and Doug Downey
    In NeurIPS (Datasets and Benchmarks), Nov 2022
  4. Overview of the Third Workshop on Scholarly Document Processing
    Arman Cohan, Guy Feigenblat, Dayne Freitag, Tirthankar Ghosal, Drahomira Herrmannova, Petr Knoth, Kyle Lo, and 4 more authors
    In Scholarly Document Processing (SDP) Workshop, Oct 2022
  5. MultiVerS: Improving scientific claim verification with weak supervision and full-document context
    David Wadden, Kyle Lo, Lucy Lu Wang, Arman Cohan, Iz Beltagy, and Hannaneh Hajishirzi
    In Findings of NAACL, Jul 2022
  6. MultiCite: Modeling realistic citations requires moving beyond the single-sentence single-label setting
    Anne Lauscher, Brandon Ko, Bailey Kuehl, Sophie Johnson, Arman Cohan, David Jurgens, and Kyle Lo
    In NAACL, Jul 2022
  7. Automatic question answering for multiple stakeholders, the epidemic question answering dataset
    Travis R. Goodwin, Dina Demner-Fushman, Kyle Lo, Lucy Lu Wang, Hoa T. Dang, and Ian M. Soboroff
    Scientific Data, Jul 2022
  8. Data Governance in the Age of Large-Scale Data-Driven Language Technology
    Yacine Jernite, Huu Nguyen, Stella Biderman, Anna Rogers, Maraim Masoud, Valentin Danchev, Samson Tan, and 13 more authors
    In FAccT, Jun 2022
  9. The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset
    Hugo Laurençon, Lucile Saulnier, Thomas Wang, Christopher Akiki, Albert Villanova Moral, Teven Le Scao, Leandro Von Werra, and 47 more authors
    In NeurIPS (Datasets and Benchmarks), May 2022
  10. Generating Scientific Claims for Zero-Shot Scientific Fact Checking
    Dustin Wright, David Wadden, Kyle Lo, Bailey Kuehl, Arman Cohan, Isabelle Augenstein, and Lucy Lu Wang
    In ACL, May 2022
  11. VILA: Improving Structured Content Extraction from Scientific PDFs Using Visual Layout Groups
    Zejiang Shen, Kyle Lo, Lucy Lu Wang, Bailey Kuehl, Daniel S. Weld, and Doug Downey
    Transactions of ACL (TACL), May 2022
  12. ACCoRD: A Multi-Document Approach to Generating Diverse Descriptions of Scientific Concepts
    Sonia K. Murthy, Kyle Lo, Daniel King, Chandra Bhagavatula, Bailey Kuehl, Sophie Johnson, Jon Borchardt, and 3 more authors
    ArXiv, May 2022
  13. Exploring the Role of Local and Global Explanations in Recommender Systems
    Marissa Radensky, Doug Downey, Kyle Lo, Zoran Popovic, and Daniel S Weld
    In CHI (Extended Abstracts), Apr 2022
  14. Infrastructure for rapid open knowledge network development
    Michael Cafarella, Michael Anderson, Iz Beltagy, Arie Cattan, Sarah Chasins, Ido Dagan, Doug Downey, and 19 more authors
    AI Magazine, Mar 2022


  1. FLEX: Unifying Evaluation for Few-Shot NLP
    Jonathan Bragg, Arman Cohan, Kyle Lo, and Iz Beltagy
    In NeurIPS, Dec 2021
  2. Explaining Relationships Between Scientific Documents
    Kelvin Luu, Xinyi Wu, Rik Koncel-Kedziorski, Kyle Lo, Isabel Cachola, and Noah A. Smith
    In ACL, Aug 2021
  3. A Dataset of Information-Seeking Questions and Answers Anchored in Research Papers
    Pradeep Dasigi, Kyle Lo, Iz Beltagy, Arman Cohan, Noah A. Smith, and Matt Gardner
    In NAACL, Jun 2021
  4. Overview and Insights from the SCIVER shared task on Scientific Claim Verification
    David Wadden, and Kyle Lo
    In Scholarly Document Processing (SDP) Workshop, Jun 2021
  5. Overview of the Second Workshop on Scholarly Document Processing
    Iz Beltagy, Arman Cohan, Guy Feigenblat, Dayne Freitag, Tirthankar Ghosal, Keith Hall, Drahomira Herrmannova, and 8 more authors
    In Scholarly Document Processing (SDP) Workshop, Jun 2021
  6. Augmenting Scientific Papers with Just-in-Time, Position-Sensitive Definitions of Terms and Symbols
    Andrew Head, Kyle Lo, Dongyeop Kang, Raymond Fok, Sam Skjonsberg, Daniel S. Weld, and Marti A. Hearst
    In CHI, May 2021
  7. Discourse Understanding and Factual Consistency in Abstractive Summarization
    Saadia Gabriel, Antoine Bosselut, Jeff Da, Ari Holtzman, Jan Buys, Kyle Lo, Asli Celikyilmaz, and 1 more author
    In EACL, Apr 2021
  8. Searching for scientific evidence in a pandemic: An overview of TREC-COVID
    Kirk Roberts, Tasmeer Alam, Steven Bedrick, Dina Demner-Fushman, Kyle Lo, Ian Soboroff, Ellen Voorhees, and 2 more authors
    Journal of Biomedical Informatics, Apr 2021
  9. TREC-COVID: Constructing a Pandemic Information Retrieval Test Collection
    Ellen Voorhees, Tasmeer Alam, Steven Bedrick, Dina Demner-Fushman, William R. Hersh, Kyle Lo, Kirk Roberts, and 2 more authors
    SIGIR Forum, Feb 2021


  1. Text mining approaches for dealing with the rapidly expanding literature on COVID-19
    Lucy Lu Wang, and Kyle Lo
    Briefings in Bioinformatics, Dec 2020
  2. Fact or Fiction: Verifying Scientific Claims
    David Wadden, Shanchuan Lin, Kyle Lo, Lucy Lu Wang, Madeleine Zuylen, Arman Cohan, and Hannaneh Hajishirzi
    In EMNLP, Nov 2020
  3. Document-Level Definition Detection in Scholarly Documents: Existing Models, Error Analyses, and Future Directions
    Dongyeop Kang, Andrew Head, Risham Sidhu, Kyle Lo, Daniel Weld, and Marti A. Hearst
    In Scholarly Document Processing (SDP) Workshop, Nov 2020
  4. TLDR: Extreme Summarization of Scientific Documents
    Isabel Cachola, Kyle Lo, Arman Cohan, and Daniel Weld
    In Findings of EMNLP, Nov 2020
  5. Mitigating Biases in CORD-19 for Analyzing COVID-19 Literature
    Anshul Kanakia, Kuansan Wang, Yuxiao Dong, Boya Xie, Kyle Lo, Zhihong Shen, Lucy Lu Wang, and 4 more authors
    Frontiers in Research Metrics and Analytics, Nov 2020
  6. TREC-COVID: rationale and structure of an information retrieval shared task for COVID-19
    Kirk Roberts, Tasmeer Alam, Steven Bedrick, Dina Demner-Fushman, Kyle Lo, Ian Soboroff, Ellen Voorhees, and 2 more authors
    Journal of the American Medical Informatics Association, Jul 2020
  7. CORD-19: The COVID-19 Open Research Dataset
    Lucy Lu Wang, Kyle Lo, Yoganand Chandrasekhar, Russell Reas, Jiangjiang Yang, Doug Burdick, Darrin Eide, and 21 more authors
    In NLP for COVID-19 Workshop, Jul 2020
  8. S2ORC: The Semantic Scholar Open Research Corpus
    Kyle Lo, Lucy Lu Wang, Mark Neumann, Rodney Kinney, and Daniel Weld
    In ACL, Jul 2020
  9. Don’t Stop Pretraining: Adapt Language Models to Domains and Tasks
    Suchin Gururangan, Ana Marasović, Swabha Swayamdipta, Kyle Lo, Iz Beltagy, Doug Downey, and Noah A. Smith
    In ACL, Jul 2020
    🏆 Honorable Mention for Best Paper 🏆


  1. SciBERT: A Pretrained Language Model for Scientific Text
    Iz Beltagy, Kyle Lo, and Arman Cohan
    In EMNLP, Nov 2019
  2. Quantifying Sex Bias in Clinical Studies at Scale With Automated Data Extraction
    Sergey Feldman, Waleed Ammar, Kyle Lo, Elly Trepman, Madeleine Zuylen, and Oren Etzioni
    JAMA Network Open, Jul 2019
  3. Combining Distant and Direct Supervision for Neural Relation Extraction
    Iz Beltagy, Kyle Lo, and Waleed Ammar
    In NAACL, Jun 2019


  1. Ontology alignment in the biomedical domain using entity definitions and context
    Lucy Lu Wang, Chandra Bhagavatula, Mark Neumann, Kyle Lo, Chris Wilhelm, and Waleed Ammar
    In BioNLP Workshop, Jul 2018
  2. Construction of the Literature Graph in Semantic Scholar
    Waleed Ammar, Dirk Groeneveld, Chandra Bhagavatula, Iz Beltagy, Miles Crawford, Doug Downey, Jason Dunkelberger, and 16 more authors
    In NAACL, Jun 2018