Affiliate Position
- Adjunct Assistant Professor, UW Computer Science & Engineering
Specializations
- Natural Language Processing
- Health Informatics
- Machine Learning for Health
Research Areas
Courses
- INFO 330 - Databases And Data Modeling
Biography
Lucy Lu Wang is an Assistant Professor at the University of Washington Information School. Her research focuses on how to build better AI and NLP systems for extracting and understanding information from scientific texts; for example, can we create systems that leverage up-to-date literature to help us make better and more data-driven healthcare decisions, or design document understanding models that can improve the readability of scientific texts for people who are blind and low vision. Lucy’s work on supplement interaction detection, gender trends in academic publishing, COVID-19 datasets, and document understanding has been featured in Geekwire, Boing Boing, Axios, VentureBeat, and the New York Times. Prior to joining the UW, she was a Young Investigator at the Allen Institute for AI, and she received her PhD in Biomedical Informatics and Medical Education from the University of Washington.
Education
- Ph D, Biomedical Informatics and Medical Education, University of Washington, 2019
- MS, Applied Biomedical Engineering, The Johns Hopkins University, 2013
- BS, Physics, Massachusetts Institute of Technology, 2009
Publications and Contributions
-
Conference PaperA Dataset of Alt Texts from HCI Publications: Analyses and Uses Towards Producing More Descriptive Alt Texts of Data Visualizations in Scientific Papers (2022)ASSETS 2022
-
Journal Article, Academic JournalAutomatic Question Answering for Multiple Stakeholders: the Epidemic Question Answering Dataset (2022)Nature Scientific Data
-
Conference PaperGenerating scientific claims for automated scientific fact checking (2022)ACL 2022
-
Journal Article, Academic JournalInfrastructure for rapid open knowledge network development (2022)AI Magazine
-
Conference PaperLiterature-Augmented Clinical Outcome Prediction (2022)NAACL Findings 2022
-
Conference PaperMultiVerS: Improving scientific claim verification with weak supervision and full-document context (2022)NAACL Findings 2022
-
PreprintPaper Plain: Making medical research papers approachable to healthcare consumers with natural language processing (2022)
-
Newsletter
-
Journal Article, Academic JournalVILA: Improving structured content extraction from scientific PDF using visual layout groups (2022)Transactions of the ACL
-
Conference Extended AbstractA bibliometric analysis of citation diversity in accessibility and HCI research (2021)CHI Extended Abstracts 2021
-
Journal Article, Academic JournalGender trends in computer science authorship (2021)Communications of the ACM
-
Journal Article, Academic JournalHarnessing the Power of Smart and Connected Health to Tackle COVID-19: IoT, AI, Robotics, and Blockchain for a Better World (2021)IEEE Internet of Things
-
PreprintImproving the accessibility of scientific documents: current state, user needs, and a system solution to enhance scientific PDF accessibility for blind and low vision users (2021)
-
Conference PaperMulti-document summarization of medical studies (2021)EMNLP 2021
-
Conference PaperOverview of the Second Workshop on Scholarly Document Processing at NAACL 2021 (2021)SDP at NAACL 2021
-
Conference PaperSciA11y: Converting scientific papers to accessible HTML (2021)ASSETS 2021
-
Journal Article, Academic JournalSearching for scientific evidence in a pandemic: an overview of TREC-COVID (2021)Journal of Biomedical Informatics
-
Conference PaperWhat do we mean by 'Accessibility Research'? A literature survey of accessibility papers in CHI and ASSETS from 1994 to 2019 (2021)CHI 2021
-
Conference Workshop PaperCORD-19: the COVID-19 open research dataset (2020)NLP-COVID at ACL 2020
-
Conference PaperFact or fiction: verifying scientific claims (2020)EMNLP 2020
-
Conference PaperMedICaT: a dataset of medical images, captions, and textual references (2020)EMNLP Findings 2020
-
Journal Article, Academic JournalMitigating biases in CORD-19 for analyzing COVID-19 literature (2020)Frontiers in Research Metrics and Analytics
-
Journal Article, Academic JournalModelling kidney disease using ontology: insights from the Kidney Precision Medicine Project (2020)Nature Reviews Nephrology
-
Conference PaperOverview of the 2020 Epidemic Question Answering Track (2020)TAC 2020
-
Conference PaperS2ORC: the Semantic Scholar open research corpus (2020)ACL 2020
-
Conference PaperSUPP.AI: finding evidence for supplement-drug interactions (2020)ACL Demo 2020
-
Conference PaperTREC-COVID: Constructing a Pandemic Information Retrieval Test Collection (2020)SIGIR Forum
-
Journal Article, Academic JournalTREC-COVID: rationale and structure of an information retrieval shared task for COVID-19 (2020)Journal of the American Medical Informatics Association
-
Journal Article, Academic JournalText mining approaches for dealing with the rapidly expanding literature on COVID-19 (2020)Briefings in Bioinformatics
-
Conference Extended AbstractExtracting evidence of supplement-drug interactions from literature (2019)ML4H at NeurIPS 2019
-
PreprintPhenotypeXpression: sub-classification of disease states using public gene expression data and literature (2019)
-
Journal Article, Academic JournalPredicting instances of Pathway Ontology classes for pathway integration (2019)Journal of Biomedical Semantics
-
Conference PaperConstruction of the literature graph in Semantic Scholar (2018)NAACL Industry 2018
-
Conference Workshop PaperOntology alignment in the biomedical domain using entity definitions and context (2018)BioNLP at ACL 2018
-
PreprintPhenotypeXpression: sub-classification of disease states using public gene expression data and literature (2018)
-
Journal Article, Academic JournalFluctuation analysis of peak expiratory flow and its associations with treatment failure in asthma (2017)American Journal of Respiratory and Critical Care Medicine
-
Conference PaperSimilarity metrics for determining overlap among biological pathways (2017)ICBO 2017
-
Conference PaperAn analysis of differences in biological pathway resources (2016)ICBO and BioCreative 2016
-
Conference PaperDevelopment of a novel Markov chain model for the prediction of head and neck squamous cell carcinoma dissemination (2016)AMIA 2016
-
Conference PaperBiological model development as an opportunity to provide content auditing for the Foundational Model of Anatomy ontology (2015)AMIA 2015
-
Journal Article, Academic JournalElectrical impedance myography in Duchenne muscular dystrophy and health controls: a multi-center study of reliability and validity (2015)Muscle & Nerve
-
Journal Article, Academic JournalAssessment of alterations in the electrical impedance of muscle after experimental nerve injury via finite-element analysis (2011)IEEE Transactions on Biomedical Engineering
-
Journal Article, Academic JournalElectrical impedance myography for monitoring motor neuron loss in the SOD1 G93A amyotrophic lateral sclerosis rat (2011)Clinical Neurophysiology
Presentations
-
AI and Scholarly Publishing
(2022)
Online
-
AI: Helping Us Learning to Relax and Love PDFs
(2022)
San Francisco, CA, USA
-
Generating scientific claims for automated scientific fact checking
(2022)
Dublin, Ireland
-
Identifying and Mitigating Algorithmic Biases
(2022)
School of Law, Seattle University, Seattle, WA, USA
-
Knowledge Representation and Semantics for Biomedical Knowledge Synthesis
(2022)
Online
-
Literature-Augmented Clinical Outcome Prediction
(2022)
Seattle, WA, USA
-
MultiVerS: Improving scientific claim verification with weak supervision and full-document context
(2022)
Seattle, WA, USA
-
Ontology and NLP: Bridging the ‘Structural Chasm
(2022)
Department of Biomedical Informatics and Medical Education, University of Washington, Seattle, WA, USA
-
The Machine Element: Signals and Noise: How AI and ML Techniques are Being Deployed to Track a Global Pandemic
(2022)
Online
-
Unlocking Biomedical Knowledge: NLP Systems for Automating Systematic Literature Review
(2022)
Charlottesville, VA, USA
-
Unlocking Biomedical Knowledge: NLP Systems for Automating Systematic Literature Review
(2022)
Online
-
Unlocking Biomedical Knowledge: NLP Systems for Automating Systematic Literature Review
(2022)
Online
-
Unlocking Biomedical Knowledge: NLP Systems for Synthesizing Biomedical Evidence
(2022)
Online
-
VILA: Improving Structured Content Extraction from Scientific PDFs Using Visual Layout Groups
(2022)
Dublin, Ireland
-
A bibliometric analysis of citation diversity in accessibility and HCI research
(2021)
Online
-
Biomedical Informatics Career Development
(2021)
Online
-
Fast-track Learning: Growing Insights from Text-mining COVID-19 Data
(2021)
Online
-
Mathematics in the Scholarly Literature
(2021)
Aussois, France and Online
-
MS^2: Multi-document summarization of medical studies
(2021)
Punta Cana, Dominican Republic
-
NLP and Text Mining Resources for COVID-19 and Beyond
(2021)
Online
-
Practical NLP for Biomedicine: Synthesizing Knowledge from Scientific Literature
(2021)
Online
-
Practical NLP for scientific text mining: extracting and synthesizing knowledge from the literature
(2021)
Online
-
SciA11y: Converting scientific papers to accessible HTML
(2021)
Online
-
Text Mining Insights from the COVID-19 Pandemic
(2021)
Online
-
The Power of AI: A Discussion on COVID-19 & the Future of Industries
(2021)
Online
-
The Power of AI: A Discussion on COVID-19 & the Future of Industries
(2021)
Online
-
Using Machine Learning to Verify Scientific Claims
(2021)
Online
-
What do we mean by 'Accessibility Research'? A literature survey of accessibility papers in CHI and ASSETS from 1994 to 2019
(2021)
Online
-
A SPARQL Tutorial
(2020)
Department of Biomedical Informatics and Medical Education, University of Washington, Seattle, WA, USA.
-
Building Community and Data Ecosystem for Data Discovery and Reuse
(2020)
Online
-
CORD-19 Search: Using Machine Learning to Explore COVID-19 Scientific Literature
(2020)
Online
-
CORD-19: the COVID-19 open research datase
(2020)
Online
-
CORD-19: The COVID-19 Open Research Dataset
(2020)
Online
-
CORD-19: The COVID-19 Open Research Dataset. Global Tech Mining Conference
(2020)
Online
-
Fact or fiction: verifying scientific claims
(2020)
Online
-
Improving Access to Scientific Literature for NLP
(2020)
Online
-
MedICaT: a dataset of medical images, captions, and textual references
(2020)
Online
-
Mining the COVID-19 Scientific Literature with the CORD-19 Open Research Dataset.
(2020)
Online
-
Open Publishing and Open Data
(2020)
Online
-
Rapid Fire Session: Showcasing What is Here!
(2020)
Online
-
S2ORC: the Semantic Scholar open research corpus
(2020)
Online
-
SUPP.AI: finding evidence for supplement-drug interactions
(2020)
Online
-
The COVID-19 Open Research Dataset
(2020)
Online
-
The COVID-19 Open Research Dataset
(2020)
Online
-
The COVID-19 Open Research Dataset
(2020)
Online
-
The Role of Scientific NLP During an Epidemi
(2020)
Online
-
TREC-COVID: information retrieval for supporting COVID-19 research
(2020)
Online
-
Automated Identification of Noise Signal in Spinal DCE-MRI using Independent Component Analysis and Unsupervised Machine Learning
(2019)
Montréal, QC, Canada
-
Extracting evidence of supplement-drug interactions from literature
(2019)
Vancouver, BC, Canada
-
Ontology-based Integration of Biological Pathway Data
(2019)
Amherst, MA, USA
-
A Brief Introduction to Ontology
(2018)
Seattle, WA, USA
-
A SPARQL Tutorial
(2018)
Department of Biomedical Informatics and Medical Education, University of Washington, Seattle, WA, USA
-
Learning from Biomedical Knowledge
(2018)
Seattle, WA, USA
-
Ontologies and Algorithms for Integrating Biological Pathway Data
(2018)
Seattle, WA, USA
-
Ontology alignment in the biomedical domain using entity definitions and context
(2018)
Melbourne, Australia
-
Ontology- based integration of pathway databases using Pathway Ontology annotations
(2018)
Chicago, IL, USA
-
Quantifying the effects of gene entity disambiguation for GSEA
(2018)
San Francisco, CA, USA
-
Semi-automated integration of pathway data for pathway analysis
(2018)
San Francisco, CA, USA
-
Detection and Functional Classification of Fusion Genes Using Pathway Expression Profiles
(2017)
San Francisco, CA, USA
-
Similarity metrics for determining overlap among biological pathways
(2017)
Newcastle upon Tyne, United Kingdom
-
An analysis of differences in biological pathway resources
(2016)
Corvallis, OR, USA
-
Auditing tree-like organ systems in the FMA using network motifs
(2016)
Chicago, IL, USA
-
Development of a novel Markov chain model for the prediction of head and neck squamous cell carcinoma dissemination
(2016)
Chicago, IL, USA
-
Discovering representational differences between pathway knowledge bases for pathway resource merging
(2016)
Chicago, IL, USA
-
Identifying and resolving inconsistencies in biological pathway resources
(2016)
Columbus, OH, USA
-
Biological model development as an opportunity to provide content auditing for the foundational model of anatomy ontology
(2015)
San Francisco, CA, USA
-
Development of a discharge ontology to support postanesthesia discharge decision making
(2015)
Lisbon, Portugal
-
Ontological content auditing during model creation using the foundational model of anatomy
(2015)
Bethesda, MD, USA
-
Detrended fluctuation analysis of peak expiratory flow and its association with destabilization of asthma control
(2014)
San Diego, CA, USA
-
Electrical impedance myography in DMD: a multi-center study of reliability and relationships to strength and function
(2013)
Asilomar, CA, USA
-
A Dataset of Alt Texts from HCI Publications: Analyses and Uses Towards Producing More Descriptive Alt Texts of Data Visualizations in Scientific Papers
The 24th International ACM SIGACCESS Conference on Computers and Accessibility - Athens, Greece
-
How AI can make PDF useful again
PageBreak - San Francisco, CA