Affiliate Positions
- Adjunct Assistant Professor, UW Human Centered Design & Engineering
- Adjunct Assistant Professor, UW Biomedical Informatics & Medical Education
- Adjunct Assistant Professor, UW Computer Science & Engineering
Specializations
- Natural Language Processing
- Health Informatics
- Machine Learning for Health
Research Areas
Biography
Lucy Lu Wang is an Assistant Professor at the University of Washington Information School. Her research focuses on how to build better AI and NLP systems for extracting and understanding information from scientific texts; for example, can we create systems that leverage up-to-date literature to help us make better and more data-driven healthcare decisions, or design document understanding models that can improve the readability of scientific texts for people who are blind and low vision. Lucy’s work on supplement interaction detection, gender trends in academic publishing, COVID-19 datasets, and document understanding has been featured in Geekwire, Boing Boing, Axios, VentureBeat, and the New York Times. Prior to joining the UW, she was a Young Investigator at the Allen Institute for AI, and she received her PhD in Biomedical Informatics and Medical Education from the University of Washington.
Education
- Ph D, Biomedical Informatics and Medical Education, University of Washington, 2019
- MS, Applied Biomedical Engineering, The Johns Hopkins University, 2013
- BS, Physics, Massachusetts Institute of Technology, 2009
Awards
- Institute for Medical Data Science Pilot Award - Institute for Medical Data Science, 2023
Publications and Contributions
-
Conference PaperFigurA11y: AI Assistance for Writing Scientific Alt Text (2024)ACM IUI 2024, pp. 886–906
-
Conference PaperFrom Paper to Card: Transforming Design Implications with Generative AI (2024)ACM CHI 2024, pp. 1-15
-
Invited EssayGenerative AI for Scholarly Information Access (2024)Against the Grain, 36(3)
-
Book Editor, ScholarlyGuest editorial: Semantics-enabled Biomedical Literature Analytics (2024)Journal of Biomedical Informatics, 150
-
Conference PaperNLP for Maternal Healthcare: Perspectives and Guiding Principles in the Age of LLMs (2024)ACM FAccT 2024, pp. 1446–1463
-
Conference PaperPersonalized Jargon Identification for Enhanced Interdisciplinary Communication (2024)NAACL 2024 (Volume 1: Long Papers), pp. 4535–4550
-
Conference PaperTOPICAL: TOPIC Pages AutomagicaLly (2024)NAACL 2024 (Volume 3: System Demonstrations), pp. 1–11
-
Conference PaperAutomated Metrics for Medical Multi-Document Summarization Disagree with Human Evaluations (2023)ACL 2023, pp. 9871–9889
-
Magazine/Trade PublicationFoundations of Responsible NLP Use for Maternal Health Equity (2023)AAMC Center For Health Justice
-
Conference Extended AbstractMeasuring the Prevalence and Downstream Impact of Data and Method Sharing in arXiv Preprints (2023)2nd Annual International Conference on the Science of Science and Innovation (ICSSI 2023)
-
Conference PaperOpen Domain Multi-document Summarization: A Comprehensive Study of Model Brittleness under Retrieval (2023)EMNLP Findings 2023, pp. 8177–8199
-
Journal Article, Academic JournalPaper Plain: Making Medical Research Papers Approachable to Healthcare Consumers with Natural Language Processing (2023)ACM Transactions on Computer-Human Interaction (TOCHI), 30(5), pp. 1-38
-
Preprint
-
Book, Chapter in Non-Scholarly Book-NewUsing Machine Learning to Verify Scientific Claims (2023)Artificial Intelligence in Science: Challenges, Opportunities and the Future of Research, pp. 121-128
-
Overview of MSLR2022: A shared task on multi- document summarization for literature reviews (2022)SDP at COLING 2022
-
Conference PaperA Dataset of Alt Texts from HCI Publications: Analyses and Uses Towards Producing More Descriptive Alt Texts of Data Visualizations in Scientific Papers (2022)ASSETS 2022
-
Journal Article, Academic JournalAutomatic Question Answering for Multiple Stakeholders: the Epidemic Question Answering Dataset (2022)Scientific Data, 9(1), pp. 1-11
-
Conference PaperGenerating Scientific Claims for Zero-Shot Scientific Fact Checking (2022)ACL 2022 (Volume 1: Long Papers), pp. 2448–2460
-
Journal Article, Academic JournalInfrastructure for rapid open knowledge network development (2022)AI Magazine, 43(1), pp. 59-68
-
Conference PaperLiterature-Augmented Clinical Outcome Prediction (2022)NAACL Findings 2022, pp. 438–453
-
Conference Extended AbstractLiterature-Augmented Clinical Outcome Prediction (2022)Machine Learning for Health (ML4H) at NeurIPS 2022
-
Conference PaperMultiVerS: Improving scientific claim verification with weak supervision and full-document context (2022)NAACL Findings 2022, pp. 61–76
-
Journal Article, Academic JournalPaper to HTML: A publicly available web tool for converting scientific PDFs into accessible HTML (2022)ACM SIGACCESS Accessibility and Computing, Issue 134
-
Conference PaperSciFact-Open: Towards open-domain scientific claim verification (2022)EMNLP Findings 2022
-
Journal Article, Academic JournalVILA: Improving structured content extraction from scientific PDF using visual layout groups (2022)Transactions of the ACL
-
Conference Extended AbstractA bibliometric analysis of citation diversity in accessibility and HCI research (2021)CHI Extended Abstracts 2021
-
Journal Article, Academic JournalGender trends in computer science authorship (2021)Communications of the ACM
-
Journal Article, Academic JournalHarnessing the Power of Smart and Connected Health to Tackle COVID-19: IoT, AI, Robotics, and Blockchain for a Better World (2021)IEEE Internet of Things
-
PreprintImproving the accessibility of scientific documents: current state, user needs, and a system solution to enhance scientific PDF accessibility for blind and low vision users (2021)
-
Conference PaperMSˆ2: A Dataset for Multi-Document Summarization of Medical Studies (2021)EMNLP 2021, pp. 7494–7513
-
Conference PaperSciA11y: Converting scientific papers to accessible HTML (2021)ASSETS 2021
-
Journal Article, Academic JournalSearching for scientific evidence in a pandemic: an overview of TREC-COVID (2021)Journal of Biomedical Informatics
-
Conference PaperWhat do we mean by 'Accessibility Research'? A literature survey of accessibility papers in CHI and ASSETS from 1994 to 2019 (2021)CHI 2021
-
Conference Workshop PaperCORD-19: the COVID-19 open research dataset (2020)NLP-COVID at ACL 2020
-
Conference PaperFact or fiction: verifying scientific claims (2020)EMNLP 2020
-
Conference PaperMedICaT: a dataset of medical images, captions, and textual references (2020)EMNLP Findings 2020
-
Journal Article, Academic JournalMitigating biases in CORD-19 for analyzing COVID-19 literature (2020)Frontiers in Research Metrics and Analytics
-
Journal Article, Academic JournalModelling kidney disease using ontology: insights from the Kidney Precision Medicine Project (2020)Nature Reviews Nephrology
-
Conference PaperOverview of the 2020 Epidemic Question Answering Track (2020)TAC 2020
-
Conference PaperS2ORC: the Semantic Scholar open research corpus (2020)ACL 2020
-
Conference PaperSUPP.AI: finding evidence for supplement-drug interactions (2020)ACL Demo 2020
-
Conference PaperTREC-COVID: Constructing a Pandemic Information Retrieval Test Collection (2020)SIGIR Forum
-
Journal Article, Academic JournalTREC-COVID: rationale and structure of an information retrieval shared task for COVID-19 (2020)Journal of the American Medical Informatics Association
-
Journal Article, Academic JournalText mining approaches for dealing with the rapidly expanding literature on COVID-19 (2020)Briefings in Bioinformatics
-
ThesisOntology-driven pathway data integration (2019)Department of Biomedical Informatics and Medical Education, University of Washington
-
Conference Extended AbstractExtracting evidence of supplement-drug interactions from literature (2019)ML4H at NeurIPS 2019
-
Journal Article, Academic JournalPredicting instances of Pathway Ontology classes for pathway integration (2019)Journal of Biomedical Semantics
-
Conference PaperConstruction of the literature graph in Semantic Scholar (2018)NAACL Industry 2018
-
Conference Workshop PaperOntology alignment in the biomedical domain using entity definitions and context (2018)BioNLP at ACL 2018
-
PreprintPhenotypeXpression: sub-classification of disease states using public gene expression data and literature (2018)
-
Journal Article, Academic JournalFluctuation analysis of peak expiratory flow and its associations with treatment failure in asthma (2017)American Journal of Respiratory and Critical Care Medicine
-
Conference PaperSimilarity metrics for determining overlap among biological pathways (2017)ICBO 2017
-
Conference PaperAn analysis of differences in biological pathway resources (2016)ICBO and BioCreative 2016
-
Conference PaperDevelopment of a novel Markov chain model for the prediction of head and neck squamous cell carcinoma dissemination (2016)AMIA 2016
-
Conference PaperBiological model development as an opportunity to provide content auditing for the Foundational Model of Anatomy ontology (2015)AMIA 2015
-
Journal Article, Academic JournalElectrical impedance myography in Duchenne muscular dystrophy and health controls: a multi-center study of reliability and validity (2015)Muscle & Nerve
-
Masters ThesisMatching Pursuit for Detecting Epileptic Response in EEG Following Photic Stimulation (2013)Department of Biomedical Engineering, The Johns Hopkins University
-
Journal Article, Academic JournalAssessment of alterations in the electrical impedance of muscle after experimental nerve injury via finite-element analysis (2011)IEEE Transactions on Biomedical Engineering
-
Journal Article, Academic JournalElectrical impedance myography for monitoring motor neuron loss in the SOD1 G93A amyotrophic lateral sclerosis rat (2011)Clinical Neurophysiology
Presentations
-
AI-assisted health information extraction and summarization
(2024)
American Association of Pediatric Hematology-Oncology (ASPHO) Informatics, Innovation & Entrepreneurship-Special Interest Group (IIE-SIG) - Seattle, WA, USA
-
AI-powered systems for scholarly search and content production
(2024)
AI Week for Researchers, Singapore Management University - Online
-
Challenges and Opportunities in Translational Science
(2024)
Semantic Scholar Research, AI2 - Online
-
FigurA11y: AI Assistance for Writing Scientific Alt Text
(2024)
ACM Conference on Intelligent User Interfaces (IUI 2024) - Greenville, South Carolina, USA
-
Personalized Jargon Identification for Enhanced Interdisciplinary Communication
(2024)
NAACL - Online
-
Roundtable Discussion: The impact of the increasing use of AI on the research workflow - in particular, its effect on research quality, research evaluation and our skills
(2024)
AI Week for Researchers, Singapore Management University - Online
-
TOPICAL: TOPIC Pages AutomagicaLly
(2024)
NAACL System Demonstrations - Mexico City, Mexico
-
6 Years of FEVER Workshops - How Far Have We Come?
(2023)
The Sixth Workshop on Fact Extraction and Verification (FEVER) at EACL - Dubrovnik, Croatia
-
AI in Scholarly Communications: Where We Are and Where We’re Going
(2023)
FORCE11 Scholarly Communication Institute (FSCI) Conference - Online
-
Automated Metrics for Medical Multi-Document Summarization Disagree with Human Evaluations
(2023)
Association for Computational Linguistics (ACL’23) - Toronto, Canada
-
Biomedical Evidence Extraction and Synthesis
(2023)
The Center for Informatics Research in Science and Scholarship (CIRSS) Seminar Series, School of Information Sciences, UIUC - Urbana-Champaign, IL
-
Can Scientific Claim Verification Help Us Do Better Science?
(2023)
The Sixth Workshop on Fact Extraction and Verification (FEVER) at EACL - Dubrovnik, Croatia
-
Generative AI for Translational Scholarly Communication
(2023)
SAUL-RSTF Webinar, National University of Singapore - Online
-
Generative AI for Translational Scholarly Communication.
(2023)
Hong Kong University of Science and Technology Library - Online
-
Improving the Accessibility of Scholarly Communication
(2023)
Universidade Estadual de Campinas (Unicamp) Computer Science Seminar - Online
-
Incorporating External Knowledge for Clinical Outcome Prediction
(2023)
Institute for Medical Data Science Seminar - Seattle, WA
-
Measuring the prevalence and downstream impact of data and method sharing in arXiv preprints
(2023)
International Conference on Computational Systems and Communication (ICSSI 2023) - Evanston, Illinois
-
Open domain multi-document summarization: A comprehensive study of model brittleness under retrieval
(2023)
2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023) - Sentosa, Singapore
-
Taking and Giving Back: Open Access, Generative AI, and the Transformation of Scholarly Communication
(2023)
OA Week, Indiana University Bloomington - Bloomington, Illinois
-
AI and Scholarly Publishing
(2022)
Society for Scholarly Publishing ‘Ask the Experts’ Webinar - Online
-
Generating Scientific Claims for Zero-Shot Scientific Fact Checking
(2022)
ACL 2022 - Dublin, Ireland
-
How AI can make PDF useful again
(2022)
PageBreak - San Francisco, CA
-
Identifying and Mitigating Algorithmic Biases
(2022)
School of Law, Seattle University - Seattle, WA
-
Knowledge Representation and Semantics for Biomedical Knowledge Synthesis
(2022)
SeBiLAn Workshop at TheWebConf (WWW) - Online
-
Literature-Augmented Clinical Outcome Prediction
(2022)
NAACL - Seattle, WA, USA
-
MultiVerS: Improving scientific claim verification with weak supervision and full-document context
(2022)
NAACL - Seattle, WA, USA
-
Ontology and NLP: Bridging the ‘Structural Chasm
(2022)
Department of Biomedical Informatics and Medical Education, University of Washington - Seattle, WA, USA
-
The Machine Element: Signals and Noise: How AI and ML Techniques are Being Deployed to Track a Global Pandemic
(2022)
Friends of the NLM Virtual Workshop - Online
-
Unlocking Biomedical Knowledge: NLP Systems for Automating Systematic Literature Review
(2022)
School of Data Science, University of Virginia - Charlottesville, VA, USA
-
Unlocking Biomedical Knowledge: NLP Systems for Automating Systematic Literature Review
(2022)
Department of Informatics, Luddy School of Informatics, Indiana University-Bloomington - Online
-
Unlocking Biomedical Knowledge: NLP Systems for Automating Systematic Literature Review
(2022)
Information School, University of Washington - Online
-
Unlocking Biomedical Knowledge: NLP Systems for Synthesizing Biomedical Evidence
(2022)
Computer Science Research Seminar, Emory University - Online
-
VILA: Improving Structured Content Extraction from Scientific PDFs Using Visual Layout Groups
(2022)
ACL - Dublin, Ireland
-
A bibliometric analysis of citation diversity in accessibility and HCI research
(2021)
CHI - Online
-
Fast-track Learning: Growing Insights from Text-mining COVID-19 Data
(2021)
1st GTM2021 Virtual Forum - Online
-
Mathematics in the Scholarly Literature
(2021)
Conference on Artificial Intelligence and Theorem Proving (AITP) - Aussois, France and Online
-
MS^2: Multi-document summarization of medical studies
(2021)
EMNLP - Punta Cana, Dominican Republic
-
NLP and Text Mining Resources for COVID-19 and Beyond
(2021)
Machine Learning for Preventing and Combating Pandemics Workshop at ICLR 2021 - Online
-
Practical NLP for Biomedicine: Synthesizing Knowledge from Scientific Literature
(2021)
CS Colloquium, Northwestern University - Online
-
Practical NLP for scientific text mining: extracting and synthesizing knowledge from the literature
(2021)
Science of Science Summer School (S4) - Online
-
SciA11y: Converting scientific papers to accessible HTML
(2021)
ASSETS - Online
-
Text Mining Insights from the COVID-19 Pandemic
(2021)
Bibliometric-enhanced Information Retrieval (BIR) Workshop at ECIR 2021 - Online
-
The Power of AI: A Discussion on COVID-19 & the Future of Industries
(2021)
Legalweek - Online
-
The Power of AI: A Discussion on COVID-19 & the Future of Industries
(2021)
Relativity Media Pandemic short film discussion panel - Online
-
Using Machine Learning to Verify Scientific Claims
(2021)
OECD Workshop on AI and the Productivity of Science - Online
-
What do we mean by 'Accessibility Research'? A literature survey of accessibility papers in CHI and ASSETS from 1994 to 2019
(2021)
CHI - Online
-
Building Community and Data Ecosystem for Data Discovery and Reuse
(2020)
Artificial Intelligence for Data Discovery and Reuse (AIDR) Symposium - Online
-
CORD-19 Search: Using Machine Learning to Explore COVID-19 Scientific Literature
(2020)
AWS Education: Research Seminar Series - Online
-
CORD-19: the COVID-19 open research datase
(2020)
NLP-COVID Workshop at ACL - Online
-
CORD-19: The COVID-19 Open Research Dataset
(2020)
NLP Meetup (NY-NLP, A2D-NLP, DC-NLP, Hungarian NLP, London Text Analytics) - Online
-
CORD-19: The COVID-19 Open Research Dataset
(2020)
Global Tech Mining Conference - Online
-
Fact or fiction: verifying scientific claims
(2020)
EMNLP - Online
-
Improving Access to Scientific Literature for NLP
(2020)
Microsoft Research Hanover Group - Online
-
MedICaT: a dataset of medical images, captions, and textual references
(2020)
SDP Workshop at EMNLP - Online
-
Mining the COVID-19 Scientific Literature with the CORD-19 Open Research Dataset.
(2020)
Artificial Intelligence for Data Discovery and Reuse (AIDR) Symposium - Online
-
Open Publishing and Open Data
(2020)
Neuro-Gairdner Open Science in Action Symposium - Online
-
Rapid Fire Session: Showcasing What is Here!
(2020)
Gastroenterology and Artificial Intelligence: 2nd Annual Artificial Intelligence Summit - Online
-
S2ORC: the Semantic Scholar open research corpus
(2020)
ACL - Online
-
SUPP.AI: finding evidence for supplement-drug interactions
(2020)
ACL Demo - Online
-
The COVID-19 Open Research Dataset
(2020)
Connected Health and COVID-19: Now and Beyond the Great Lockdown - Online
-
The COVID-19 Open Research Dataset
(2020)
Centre for Science and Technology Studies, Leiden University - Online
-
The COVID-19 Open Research Dataset
(2020)
Semantic Indexing and Information Retrieval for Health (SIIRH) Workshop at ECIR - Online
-
The Role of Scientific NLP During an Epidemic
(2020)
1st SciNLP Workshop on Natural Language Processing and Data Mining for Scientific Text - Online
-
TREC-COVID: information retrieval for supporting COVID-19 research
(2020)
AMIA Natural Language Processing Working Group Pre-Symposium - Online
-
Automated Identification of Noise Signal in Spinal DCE-MRI using Independent Component Analysis and Unsupervised Machine Learning
(2019)
ISMRM - Montréal, QC, Canada
-
Extracting evidence of supplement-drug interactions from literature
(2019)
ML4H Workshop at NeurIPS - Vancouver, BC, Canada
-
Ontology-based Integration of Biological Pathway Data
(2019)
Scientific Literature Knowledge Bases Workshop at Automated Knowledge Base Construction (AKBC) - Amherst, MA, USA
-
A Brief Introduction to Ontology
(2018)
Kidney Precision Medicine Project Ontology Webinar - Seattle, WA, USA
-
A SPARQL Tutorial
(2018)
Department of Biomedical Informatics and Medical Education, University of Washington - Seattle, WA, USA
-
Learning from Biomedical Knowledge
(2018)
The Allen Institute for Artificial Intelligence (AI2) - Seattle, WA, USA
-
Ontologies and Algorithms for Integrating Biological Pathway Data
(2018)
BIME 590 Seminar, Department of Biomedical Informatics and Medical Education, University of Washington - Seattle, WA, USA
-
Ontology alignment in the biomedical domain using entity definitions and context
(2018)
Bio-NLP Workshop at ACL - Melbourne, Australia
-
Ontology- based integration of pathway databases using Pathway Ontology annotations
(2018)
Bio-Ontologies at ISMB - Chicago, IL, USA
-
Quantifying the effects of gene entity disambiguation for GSEA
(2018)
AMIA Symposium - San Francisco, CA, USA
-
Semi-automated integration of pathway data for pathway analysis
(2018)
Knowledge Representation and Semantics Working Group Pre-Symposium Doctoral Consortium, AMIA Symposium - San Francisco, CA, USA
-
Detection and Functional Classification of Fusion Genes Using Pathway Expression Profiles
(2017)
AMIA Joint Summits on Translational Science - San Francisco, CA, USA
-
Similarity metrics for determining overlap among biological pathways
(2017)
ICBO - Newcastle upon Tyne, United Kingdom
-
An analysis of differences in biological pathway resources
(2016)
ICBO & BioCreative - Corvallis, OR, USA
-
Auditing tree-like organ systems in the FMA using network motifs
(2016)
AMIA Symposium - Chicago, IL, USA
-
Development of a novel Markov chain model for the prediction of head and neck squamous cell carcinoma dissemination
(2016)
AMIA 2016 - Chicago, IL, USA
-
Discovering representational differences between pathway knowledge bases for pathway resource merging
(2016)
AMIA Symposium - Chicago, IL, USA
-
Identifying and resolving inconsistencies in biological pathway resources
(2016)
NLM Informatics Training Conference - Columbus, OH, USA
-
Biological model development as an opportunity to provide content auditing for the foundational model of anatomy ontology
(2015)
AMIA Symposium - San Francisco, CA, USA
-
Development of a discharge ontology to support postanesthesia discharge decision making
(2015)
ICBO - Lisbon, Portugal
-
Ontological content auditing during model creation using the foundational model of anatomy
(2015)
NLM Informatics Training Conference - Bethesda, MD, USA
-
Detrended fluctuation analysis of peak expiratory flow and its association with destabilization of asthma control
(2014)
International Conference of the American Thoracic Society (ATS) - San Diego, CA, USA
-
Electrical impedance myography in DMD: a multi-center study of reliability and relationships to strength and function
(2013)
The 18th International Congress of the World Muscle Society - Asilomar, CA, USA