Specializations

  • Natural Language Processing
  • Health Informatics
  • Machine Learning for Health

Biography

Lucy Lu Wang is an Assistant Professor at the University of Washington Information School. Her research focuses on how to build better AI and NLP systems for extracting and understanding information from scientific texts; for example, can we create systems that leverage up-to-date literature to help us make better and more data-driven healthcare decisions, or design document understanding models that can improve the readability of scientific texts for people who are blind and low vision. Lucy’s work on supplement interaction detection, gender trends in academic publishing, COVID-19 datasets, and document understanding has been featured in Geekwire, Boing Boing, Axios, VentureBeat, and the New York Times. Prior to joining the UW, she was a Young Investigator at the Allen Institute for AI, and she received her PhD in Biomedical Informatics and Medical Education from the University of Washington.

Publications and Contributions

  • Conference Paper
    A Dataset of Alt Texts from HCI Publications: Analyses and Uses Towards Producing More Descriptive Alt Texts of Data Visualizations in Scientific Papers (2022)
    ASSETS 2022 Authors: Sanjana Chintalapati, Jonathan Bragg, Lucy Wang
  • Journal Article, Academic Journal
    Automatic Question Answering for Multiple Stakeholders: the Epidemic Question Answering Dataset (2022)
    Nature Scientific Data Authors: Travis Goodwin, Dina Demner-Fushman, Kyle Lo, Lucy Wang, Hoa Dang, Ian Soboroff
  • Conference Paper
    Generating scientific claims for automated scientific fact checking (2022)
    ACL 2022 Authors: Dustin Wright, David Wadden, Kyle Lo, Bailey Kuehl, Arman Cohan, Isabelle Augenstein, Lucy Wang
  • Journal Article, Academic Journal
    Infrastructure for rapid open knowledge network development (2022)
    AI Magazine Authors: Michael Cafarella, Michael Anderson, Iz Beltagy, Arie Cattan, Sarah Chasins, Ido Dagan, Doug Downey, Oren Etzioni, Sergey Feldman, Tian Gao, Tom Hope, Kexin Huang, Sophie Johnson, Daniel King, Kyle Lo, Yuze Lou, Matthew Shapiro, Dinghao Shen, Shivashankar Subramanian, Lucy Wang, Yuming Wang, Yitong Wang, Daniel S. Weld, Jenny Vo-Phamhi, Anna Zeng, Jiayun Zou
  • Conference Paper
    Literature-Augmented Clinical Outcome Prediction (2022)
    NAACL Findings 2022 Authors: Aakanksha Naik, Sravanthi Parasa, Sergey Feldman, Lucy Wang, Tom Hope
  • Conference Paper
    MultiVerS: Improving scientific claim verification with weak supervision and full-document context (2022)
    NAACL Findings 2022 Authors: David Wadden, Kyle Lo, Lucy Wang, Arman Cohan, Iz Beltagy, Hannah Hajishirzi
  • Preprint
    Paper Plain: Making medical research papers approachable to healthcare consumers with natural language processing (2022)
    Authors: Tal August, Lucy Wang, Mohammad Ahad, Alistair McEwan, Jia Li, Mina Jafarpoor, Seward B Rutkove
  • Journal Article, Academic Journal
    VILA: Improving structured content extraction from scientific PDF using visual layout groups (2022)
    Transactions of the ACL Authors: Zejiang Shen, Kyle Lo, Lucy Wang, Bailey Kuehl, Daniel S Weld, Doug Downey
  • Conference Extended Abstract
    A bibliometric analysis of citation diversity in accessibility and HCI research (2021)
    CHI Extended Abstracts 2021 Authors: Lucy Wang, Kelly Mack, Emma J McDonnnell, Dhruv Jain, Leah Findlater, Jon E. Froehlich
  • Journal Article, Academic Journal
    Gender trends in computer science authorship (2021)
    Communications of the ACM Authors: Lucy Wang, Gabriel Stanovsky, Luca Weihs, Oren Etzioni
  • Journal Article, Academic Journal
    Harnessing the Power of Smart and Connected Health to Tackle COVID-19: IoT, AI, Robotics, and Blockchain for a Better World (2021)
    IEEE Internet of Things Authors: Farshad Firouzi, Bahar Farahani, Mahmoud Daneshmand, Kathy Grise, Jaeseung Song, Roberto Saracco, Lucy Wang, Kyle Lo, Plamen Angelov, Eduardo Suares et al.
  • Preprint
    Improving the accessibility of scientific documents: current state, user needs, and a system solution to enhance scientific PDF accessibility for blind and low vision users (2021)
    Authors: Lucy Wang, Isabel Cachola, Jonathan Bragg, Evie Cheng, Chelsea Haupt, Matt Latzke, Bailey Kuehl, Madeleine van Zuylen, Linda Wagner, Daniel S. Weld
  • Conference Paper
    Multi-document summarization of medical studies (2021)
    EMNLP 2021 Authors: Jay DeYoung, Iz Beltagy, Madeleine van Zuylen, Bailey Kuehl, Lucy Wang
  • Conference Paper
    Overview of the Second Workshop on Scholarly Document Processing at NAACL 2021 (2021)
    SDP at NAACL 2021 Authors: Iz Beltagy, Arman Cohan, Guy Feigenblat, Dayne Freitag, Tirthankar Ghosal, Keith Hall, Drahomira Herrmannova, Petr Knoth, Kyle Lo, Phillip Mayr, Robert M. Patton, Michael Schmueli-Sheuer, Anita de Waard, Kuansan Wang, Lucy Wang
  • Conference Paper
    SciA11y: Converting scientific papers to accessible HTML (2021)
    ASSETS 2021 Authors: Lucy Wang, Isabel Cachola, Jonathan Bragg, Evie Cheng, Chelsea Haupt, Matt Latzke, Bailey Kuehl, Madeleine van Zuylen, Linda Wagner, Daniel S. Weld
  • Journal Article, Academic Journal
    Searching for scientific evidence in a pandemic: an overview of TREC-COVID (2021)
    Journal of Biomedical Informatics Authors: Kirk Roberts, Tasmeer alam, Steven Bedrick, Dina Demner-Fushman, Kyle Lo, Ian Soboroff, Ellen Voorhees, Lucy Wang, William R. Hersh
  • Conference Paper
    What do we mean by 'Accessibility Research'? A literature survey of accessibility papers in CHI and ASSETS from 1994 to 2019 (2021)
    CHI 2021 Authors: Kelly Mack, Emma McDonnell, Dhruv Jain, Lucy Wang, Jon E. Froehlich, Leah Findlater
  • Conference Workshop Paper
    CORD-19: the COVID-19 open research dataset (2020)
    NLP-COVID at ACL 2020 Authors: Lucy Wang, Kyle Lo, Yoganand Chandrasekhar, Russell Reas, Jiangjiang Yang, Doug Burdick, Darrin Eide, Kathryn Funk, Yannis Katsis, Rodney Michael Kinney, Yunyao Li, Ziyang Liu, William Merrill, Paul Mooney, Dewey A. Murdick, Devvret Rishi, Jerry Sheehan, Zhihong Shen, Brandon Stilson, Alex D. Wade, Kuansan Wang, Nancy Xin Ru Wang, Christopher Wilhelm, Boya Xie, Douglas M. Raymond, Daniel S. Weld, Oren Etzioni, Sebastian Kohlmeier
  • Conference Paper
    Fact or fiction: verifying scientific claims (2020)
    EMNLP 2020 Authors: David Wadden, Shanchuan Lin, Kyle Lo, Lucy Wang, Madeleine van Zuylen, Arman Cohan, Hannaneh Hajishirzi
  • Conference Paper
    MedICaT: a dataset of medical images, captions, and textual references (2020)
    EMNLP Findings 2020 Authors: Sanjay Subramanian, Lucy Wang, Ben Bogin, Sachin Mehta, Madeleine van Zuylen, Sravanthi Parasa, Sameer Singh, Matt Gardner, Hannaneh Hajishirzi
  • Journal Article, Academic Journal
    Mitigating biases in CORD-19 for analyzing COVID-19 literature (2020)
    Frontiers in Research Metrics and Analytics Authors: Anshul Kanakia, Kuansan Wang, Yuxiao Dong, Boya Xie, Kyle Lo, Zhihong Shen, Lucy Wang, Chiyuan Huang, Darrin Eide, Sebastian Kohlmeier, Chieh-Han Wu
  • Journal Article, Academic Journal
    Modelling kidney disease using ontology: insights from the Kidney Precision Medicine Project (2020)
    Nature Reviews Nephrology Authors: Edison Ong, Lucy Wang, Jennifer Schaub, John F. O'Toole, Becky Steck, Jonathan Himmelfarb, Ravi Iyengar, Matthias Kretzler, Sean Mooney, Yongqun He, Kidney Precision Medicine Project
  • Conference Paper
    Overview of the 2020 Epidemic Question Answering Track (2020)
    TAC 2020 Authors: Travis R. Goodwin, Dina Demner-Fushman, Kyle Lo, Lucy Wang, William R. Hersh, Hao T. Dang, Ian M. Soboroff
  • Conference Paper
    S2ORC: the Semantic Scholar open research corpus (2020)
    ACL 2020 Authors: Kyle Lo, Lucy Wang, Mark Neumann, Robert Kinney, Daniel S. Weld
  • Conference Paper
    SUPP.AI: finding evidence for supplement-drug interactions (2020)
    ACL Demo 2020 Authors: Lucy Wang, Oyvind Tafjord, Arman Cohan, Sarthak Jain, Sam Skjonsberg, Carissa Schoenick, Nick Botner, Waleed Ammar
  • Conference Paper
    TREC-COVID: Constructing a Pandemic Information Retrieval Test Collection (2020)
    SIGIR Forum Authors: Ellen Voorhees, Tasmeer Alam, Steven Bedrick, Dina Demner-Fushman, William R. Hersh, Kyle Lo, Kirk Roberts, Ian Soboroff, Lucy Wang
  • Journal Article, Academic Journal
    TREC-COVID: rationale and structure of an information retrieval shared task for COVID-19 (2020)
    Journal of the American Medical Informatics Association Authors: Kirk Roberts, Tasmeer Alam, Steven Bedrick, Dina Demner-Fushman, Kyle Lo, Ian Soboroff, Ellen Voorhees, Lucy Wang, William R. Hersh
  • Journal Article, Academic Journal
    Text mining approaches for dealing with the rapidly expanding literature on COVID-19 (2020)
    Briefings in Bioinformatics Authors: Lucy Wang, Kyle Lo
  • Conference Extended Abstract
    Extracting evidence of supplement-drug interactions from literature (2019)
    ML4H at NeurIPS 2019 Authors: Lucy Wang, Oyvind Tafjord, Arman Cohan, Sarthak Jain, Sam Skjonsberg, Carissa Schoenick, Nick Botner, Waleed Ammar
  • Preprint
    PhenotypeXpression: sub-classification of disease states using public gene expression data and literature (2019)
    Authors: Lucy Wang, Huaiying Lin, Xiaojun Bao, Subhajit Sengupta, Ben Busby, Robert R. Butler III
  • Journal Article, Academic Journal
    Predicting instances of Pathway Ontology classes for pathway integration (2019)
    Journal of Biomedical Semantics Authors: Lucy Wang, G. Thomas Hayman, Jennifer R. Smith, Monika Tutaj, Mary E. Shimoyama, John H. Gennari
  • Conference Paper
    Construction of the literature graph in Semantic Scholar (2018)
    NAACL Industry 2018 Authors: Waleed Ammar, Dirk Groeneveld, Chandra Bhagavatula, Iz Beltagy, Miles Crawford, Doug Downey, Jason Dunkelberger, Ahmed Elgohary, Sergey Feldman, Vu Ha, Rodney Kinney, Sebastian Kohlmeier, Kyle Lo, Tyler Murray, Hsu-Han Ooi, Matthew Peters, Joanna Power, Sam Skjonsberg, Lucy Wang, Chris Wilhelm, Zheng Yuan, Madeleine van Zuylen, Oren Etzioni
  • Conference Workshop Paper
    Ontology alignment in the biomedical domain using entity definitions and context (2018)
    BioNLP at ACL 2018 Authors: Lucy Wang, Chandra Bhagavatula, Mark Neumann, Kyle Lo, Chris Wilhelm, Waleed Ammar
  • Preprint
    PhenotypeXpression: sub-classification of disease states using public gene expression data and literature (2018)
    Authors: Lucy Wang, Huaiying Lin, Xiaojun Bao, Subhajit Sengupta, Ben Busby, Robert R. Butler III
  • Journal Article, Academic Journal
    Fluctuation analysis of peak expiratory flow and its associations with treatment failure in asthma (2017)
    American Journal of Respiratory and Critical Care Medicine Authors: David A. Kaminsky, Lucy Wang, Jason HT Bates, Cindy Thamrin, David M. Shade, Anne E. Dixon, Robert A. Wise, Stephen Peters, Charles G. Irvin
  • Conference Paper
    Similarity metrics for determining overlap among biological pathways (2017)
    ICBO 2017 Authors: Lucy Wang, John H. Gennari
  • Conference Paper
    An analysis of differences in biological pathway resources (2016)
    ICBO and BioCreative 2016 Authors: Lucy Wang, John E. Gennari, Neil F. Abernethy
  • Conference Paper
    Development of a novel Markov chain model for the prediction of head and neck squamous cell carcinoma dissemination (2016)
    AMIA 2016 Authors: Hyunggu Jung, Anthony Law, Eli Grunblatt, Lucy Wang, Aaron Kusano, Jose LV Mejino Jr., Mark E. Whipple
  • Conference Paper
    Biological model development as an opportunity to provide content auditing for the Foundational Model of Anatomy ontology (2015)
    AMIA 2015 Authors: Lucy Wang, Eli Grunblatt, Hyunggu Jung, Ira J. Kalet, Mark E. Whipple
  • Journal Article, Academic Journal
    Electrical impedance myography in Duchenne muscular dystrophy and health controls: a multi-center study of reliability and validity (2015)
    Muscle & Nerve Authors: Craig M. Zaidman, Lucy Wang, Anne M. Connolly, Julaine Florence, Brenda L. Wong, Julie A. Parsons, Susan Apkon, Namita Goyal, Eugene Williams, Diana Escolar, Seward B Rutkove, Jose L. Bohorquez, DART-EIM Clinical Evaluators Consortium
  • Journal Article, Academic Journal
    Assessment of alterations in the electrical impedance of muscle after experimental nerve injury via finite-element analysis (2011)
    IEEE Transactions on Biomedical Engineering Authors: Lucy Wang, Mohammad Ahad, Alistair McEwan, Jia Li, Mina Jafarpoor, Seward B. Rutkove
  • Journal Article, Academic Journal
    Electrical impedance myography for monitoring motor neuron loss in the SOD1 G93A amyotrophic lateral sclerosis rat (2011)
    Clinical Neurophysiology Authors: Lucy Wang, Andrew J. Spieker, Jia Li, Seward B. Rutkove

Presentations

  • AI and Scholarly Publishing (2022)
    Online
  • AI: Helping Us Learning to Relax and Love PDFs (2022)
    San Francisco, CA, USA
  • Generating scientific claims for automated scientific fact checking (2022)
    Dublin, Ireland
  • Identifying and Mitigating Algorithmic Biases (2022)
    School of Law, Seattle University, Seattle, WA, USA
  • Knowledge Representation and Semantics for Biomedical Knowledge Synthesis (2022)
    Online
  • Literature-Augmented Clinical Outcome Prediction (2022)
    Seattle, WA, USA
  • MultiVerS: Improving scientific claim verification with weak supervision and full-document context (2022)
    Seattle, WA, USA
  • Ontology and NLP: Bridging the ‘Structural Chasm (2022)
    Department of Biomedical Informatics and Medical Education, University of Washington, Seattle, WA, USA
  • The Machine Element: Signals and Noise: How AI and ML Techniques are Being Deployed to Track a Global Pandemic (2022)
    Online
  • Unlocking Biomedical Knowledge: NLP Systems for Automating Systematic Literature Review (2022)
    Online
  • Unlocking Biomedical Knowledge: NLP Systems for Automating Systematic Literature Review (2022)
    Online
  • Unlocking Biomedical Knowledge: NLP Systems for Automating Systematic Literature Review (2022)
    Charlottesville, VA, USA
  • Unlocking Biomedical Knowledge: NLP Systems for Synthesizing Biomedical Evidence (2022)
    Online
  • VILA: Improving Structured Content Extraction from Scientific PDFs Using Visual Layout Groups (2022)
    Dublin, Ireland
  • A bibliometric analysis of citation diversity in accessibility and HCI research (2021)
    Online
  • Biomedical Informatics Career Development (2021)
    Online
  • Fast-track Learning: Growing Insights from Text-mining COVID-19 Data (2021)
    Online
  • Mathematics in the Scholarly Literature (2021)
    Aussois, France and Online
  • MS^2: Multi-document summarization of medical studies (2021)
    Punta Cana, Dominican Republic
  • NLP and Text Mining Resources for COVID-19 and Beyond (2021)
    Online
  • Practical NLP for Biomedicine: Synthesizing Knowledge from Scientific Literature (2021)
    Online
  • Practical NLP for scientific text mining: extracting and synthesizing knowledge from the literature (2021)
    Online
  • SciA11y: Converting scientific papers to accessible HTML (2021)
    Online
  • Text Mining Insights from the COVID-19 Pandemic (2021)
    Online
  • The Power of AI: A Discussion on COVID-19 & the Future of Industries (2021)
    Online
  • The Power of AI: A Discussion on COVID-19 & the Future of Industries (2021)
    Online
  • Using Machine Learning to Verify Scientific Claims (2021)
    Online
  • What do we mean by 'Accessibility Research'? A literature survey of accessibility papers in CHI and ASSETS from 1994 to 2019 (2021)
    Online
  • A SPARQL Tutorial (2020)
    Department of Biomedical Informatics and Medical Education, University of Washington, Seattle, WA, USA.
  • Building Community and Data Ecosystem for Data Discovery and Reuse (2020)
    Online
  • CORD-19 Search: Using Machine Learning to Explore COVID-19 Scientific Literature (2020)
    Online
  • CORD-19: the COVID-19 open research datase (2020)
    Online
  • CORD-19: The COVID-19 Open Research Dataset (2020)
    Online
  • CORD-19: The COVID-19 Open Research Dataset. Global Tech Mining Conference (2020)
    Online
  • Fact or fiction: verifying scientific claims (2020)
    Online
  • Improving Access to Scientific Literature for NLP (2020)
    Online
  • MedICaT: a dataset of medical images, captions, and textual references (2020)
    Online
  • Mining the COVID-19 Scientific Literature with the CORD-19 Open Research Dataset. (2020)
    Online
  • Open Publishing and Open Data (2020)
    Online
  • Rapid Fire Session: Showcasing What is Here! (2020)
    Online
  • S2ORC: the Semantic Scholar open research corpus (2020)
    Online
  • SUPP.AI: finding evidence for supplement-drug interactions (2020)
    Online
  • The COVID-19 Open Research Dataset (2020)
    Online
  • The COVID-19 Open Research Dataset (2020)
    Online
  • The COVID-19 Open Research Dataset (2020)
    Online
  • The Role of Scientific NLP During an Epidemi (2020)
    Online
  • TREC-COVID: information retrieval for supporting COVID-19 research (2020)
    Online
  • Automated Identification of Noise Signal in Spinal DCE-MRI using Independent Component Analysis and Unsupervised Machine Learning (2019)
    Montréal, QC, Canada
  • Extracting evidence of supplement-drug interactions from literature (2019)
    Vancouver, BC, Canada
  • Ontology-based Integration of Biological Pathway Data (2019)
    Amherst, MA, USA
  • A Brief Introduction to Ontology (2018)
    Seattle, WA, USA
  • A SPARQL Tutorial (2018)
    Department of Biomedical Informatics and Medical Education, University of Washington, Seattle, WA, USA
  • Learning from Biomedical Knowledge (2018)
    Seattle, WA, USA
  • Ontologies and Algorithms for Integrating Biological Pathway Data (2018)
    Seattle, WA, USA
  • Ontology alignment in the biomedical domain using entity definitions and context (2018)
    Melbourne, Australia
  • Ontology- based integration of pathway databases using Pathway Ontology annotations (2018)
    Chicago, IL, USA
  • Quantifying the effects of gene entity disambiguation for GSEA (2018)
    San Francisco, CA, USA
  • Semi-automated integration of pathway data for pathway analysis (2018)
    San Francisco, CA, USA
  • Detection and Functional Classification of Fusion Genes Using Pathway Expression Profiles (2017)
    San Francisco, CA, USA
  • Similarity metrics for determining overlap among biological pathways (2017)
    Newcastle upon Tyne, United Kingdom
  • An analysis of differences in biological pathway resources (2016)
    Corvallis, OR, USA
  • Auditing tree-like organ systems in the FMA using network motifs (2016)
    Chicago, IL, USA
  • Development of a novel Markov chain model for the prediction of head and neck squamous cell carcinoma dissemination (2016)
    Chicago, IL, USA
  • Discovering representational differences between pathway knowledge bases for pathway resource merging (2016)
    Chicago, IL, USA
  • Identifying and resolving inconsistencies in biological pathway resources (2016)
    Columbus, OH, USA
  • Biological model development as an opportunity to provide content auditing for the foundational model of anatomy ontology (2015)
    San Francisco, CA, USA
  • Development of a discharge ontology to support postanesthesia discharge decision making (2015)
    Lisbon, Portugal
  • Ontological content auditing during model creation using the foundational model of anatomy (2015)
    Bethesda, MD, USA
  • Detrended fluctuation analysis of peak expiratory flow and its association with destabilization of asthma control (2014)
    San Diego, CA, USA
  • Electrical impedance myography in DMD: a multi-center study of reliability and relationships to strength and function (2013)
    Asilomar, CA, USA