Affiliate Position
- Adjunct Assistant Professor, UW Computer Science & Engineering
Specializations
- Natural Language Processing
- Health Informatics
- Machine Learning for Health
Research Areas
Courses
- INFO 498 - Special Topics In Informatics
Biography
Lucy Lu Wang is an Assistant Professor at the University of Washington Information School. Her research focuses on how to build better AI and NLP systems for extracting and understanding information from scientific texts; for example, can we create systems that leverage up-to-date literature to help us make better and more data-driven healthcare decisions, or design document understanding models that can improve the readability of scientific texts for people who are blind and low vision. Lucy’s work on supplement interaction detection, gender trends in academic publishing, COVID-19 datasets, and document understanding has been featured in Geekwire, Boing Boing, Axios, VentureBeat, and the New York Times. Prior to joining the UW, she was a Young Investigator at the Allen Institute for AI, and she received her PhD in Biomedical Informatics and Medical Education from the University of Washington.
Education
- Ph D, Biomedical Informatics and Medical Education, University of Washington, 2019
- MS, Applied Biomedical Engineering, The Johns Hopkins University, 2013
- BS, Physics, Massachusetts Institute of Technology, 2009
Publications and Contributions
-
PreprintAPPLS: A Meta-evaluation Testbed for Plain Language Summarization (2023)arXiv:2305.14341
-
Conference PaperAutomated Metrics for Medical Multi-Document Summarization Disagree with Human Evaluations (2023)ACL 2023
-
Journal Article, Professional JournalPaper Plain: Making medical research papers approachable to healthcare consumers with natural language processing (2023)Transactions of CHI (TOCHI)
-
Preprint
-
Book, Chapter in Non-Scholarly Book-NewUsing Machine Learning to Verify Scientific Claims (2023)AI and the Productivity of Science
-
Overview of MSLR2022: A shared task on multi- document summarization for literature reviews (2022)SDP at COLING 2022
-
Conference PaperA Dataset of Alt Texts from HCI Publications: Analyses and Uses Towards Producing More Descriptive Alt Texts of Data Visualizations in Scientific Papers (2022)ASSETS 2022
-
Journal Article, Academic JournalAutomatic Question Answering for Multiple Stakeholders: the Epidemic Question Answering Dataset (2022)Nature Scientific Data
-
Conference ProceedingGenerating Scientific Claims for Zero-Shot Scientific Fact Checking (2022)ACL 2022
-
Conference PaperGenerating scientific claims for automated scientific fact checking (2022)ACL 2022
-
Journal Article, Academic JournalInfrastructure for rapid open knowledge network development (2022)AI Magazine
-
Conference Extended AbstractLiterature-Augmented Clinical Outcome Prediction (2022)Machine Learning for Health (ML4H) at NeurIPS 2022
-
Conference PaperLiterature-Augmented Clinical Outcome Prediction (2022)NAACL Findings 2022
-
Conference PaperMultiVerS: Improving scientific claim verification with weak supervision and full-document context (2022)NAACL Findings 2022
-
Newsletter
-
Conference PaperSciFact-Open: Towards open-domain scientific claim verification (2022)Association for Computational Linguistics, Findings of the Association for Computational Linguistics: EMNLP 2022
-
Journal Article, Academic JournalVILA: Improving structured content extraction from scientific PDF using visual layout groups (2022)Transactions of the ACL
-
Conference Extended AbstractA bibliometric analysis of citation diversity in accessibility and HCI research (2021)CHI Extended Abstracts 2021
-
Journal Article, Academic JournalGender trends in computer science authorship (2021)Communications of the ACM
-
Journal Article, Academic JournalHarnessing the Power of Smart and Connected Health to Tackle COVID-19: IoT, AI, Robotics, and Blockchain for a Better World (2021)IEEE Internet of Things
-
PreprintImproving the accessibility of scientific documents: current state, user needs, and a system solution to enhance scientific PDF accessibility for blind and low vision users (2021)
-
Conference PaperMSˆ2: A Dataset for Multi-Document Summarization of Medical Studies (2021)EMNLP 2021
-
Conference PaperSciA11y: Converting scientific papers to accessible HTML (2021)ASSETS 2021
-
Journal Article, Academic JournalSearching for scientific evidence in a pandemic: an overview of TREC-COVID (2021)Journal of Biomedical Informatics
-
Conference PaperWhat do we mean by 'Accessibility Research'? A literature survey of accessibility papers in CHI and ASSETS from 1994 to 2019 (2021)CHI 2021
-
Conference Workshop PaperCORD-19: the COVID-19 open research dataset (2020)NLP-COVID at ACL 2020
-
Conference PaperFact or fiction: verifying scientific claims (2020)EMNLP 2020
-
Conference PaperMedICaT: a dataset of medical images, captions, and textual references (2020)EMNLP Findings 2020
-
Journal Article, Academic JournalMitigating biases in CORD-19 for analyzing COVID-19 literature (2020)Frontiers in Research Metrics and Analytics
-
Journal Article, Academic JournalModelling kidney disease using ontology: insights from the Kidney Precision Medicine Project (2020)Nature Reviews Nephrology
-
Conference PaperOverview of the 2020 Epidemic Question Answering Track (2020)TAC 2020
-
Conference PaperS2ORC: the Semantic Scholar open research corpus (2020)ACL 2020
-
Conference PaperSUPP.AI: finding evidence for supplement-drug interactions (2020)ACL Demo 2020
-
Conference PaperTREC-COVID: Constructing a Pandemic Information Retrieval Test Collection (2020)SIGIR Forum
-
Journal Article, Academic JournalTREC-COVID: rationale and structure of an information retrieval shared task for COVID-19 (2020)Journal of the American Medical Informatics Association
-
Journal Article, Academic JournalText mining approaches for dealing with the rapidly expanding literature on COVID-19 (2020)Briefings in Bioinformatics
-
ThesisOntology-driven pathway data integration (2019)Department of Biomedical Informatics and Medical Education, University of Washington
-
Conference Extended AbstractExtracting evidence of supplement-drug interactions from literature (2019)ML4H at NeurIPS 2019
-
Journal Article, Academic JournalPredicting instances of Pathway Ontology classes for pathway integration (2019)Journal of Biomedical Semantics
-
Conference PaperConstruction of the literature graph in Semantic Scholar (2018)NAACL Industry 2018
-
Conference Workshop PaperOntology alignment in the biomedical domain using entity definitions and context (2018)BioNLP at ACL 2018
-
PreprintPhenotypeXpression: sub-classification of disease states using public gene expression data and literature (2018)
-
Journal Article, Academic JournalFluctuation analysis of peak expiratory flow and its associations with treatment failure in asthma (2017)American Journal of Respiratory and Critical Care Medicine
-
Conference PaperSimilarity metrics for determining overlap among biological pathways (2017)ICBO 2017
-
Conference PaperAn analysis of differences in biological pathway resources (2016)ICBO and BioCreative 2016
-
Conference PaperDevelopment of a novel Markov chain model for the prediction of head and neck squamous cell carcinoma dissemination (2016)AMIA 2016
-
Conference PaperBiological model development as an opportunity to provide content auditing for the Foundational Model of Anatomy ontology (2015)AMIA 2015
-
Journal Article, Academic JournalElectrical impedance myography in Duchenne muscular dystrophy and health controls: a multi-center study of reliability and validity (2015)Muscle & Nerve
-
Masters ThesisMatching Pursuit for Detecting Epileptic Response in EEG Following Photic Stimulation (2013)Department of Biomedical Engineering, The Johns Hopkins University
-
Journal Article, Academic JournalAssessment of alterations in the electrical impedance of muscle after experimental nerve injury via finite-element analysis (2011)IEEE Transactions on Biomedical Engineering
-
Journal Article, Academic JournalElectrical impedance myography for monitoring motor neuron loss in the SOD1 G93A amyotrophic lateral sclerosis rat (2011)Clinical Neurophysiology
Presentations
-
AI and Scholarly Publishing
(2022)
Society for Scholarly Publishing ‘Ask the Experts’ Webinar - Online
-
Generating scientific claims for automated scientific fact checking
(2022)
ACL - Dublin, Ireland
-
How AI can make PDF useful again
(2022)
PageBreak - San Francisco, CA
-
Identifying and Mitigating Algorithmic Biases
(2022)
School of Law, Seattle University - Seattle, WA
-
Knowledge Representation and Semantics for Biomedical Knowledge Synthesis
(2022)
SeBiLAn Workshop at TheWebConf (WWW) - Online
-
Literature-Augmented Clinical Outcome Prediction
(2022)
NAACL - Seattle, WA, USA
-
MultiVerS: Improving scientific claim verification with weak supervision and full-document context
(2022)
NAACL - Seattle, WA, USA
-
Ontology and NLP: Bridging the ‘Structural Chasm
(2022)
Department of Biomedical Informatics and Medical Education, University of Washington - Seattle, WA, USA
-
The Machine Element: Signals and Noise: How AI and ML Techniques are Being Deployed to Track a Global Pandemic
(2022)
Friends of the NLM Virtual Workshop - Online
-
Unlocking Biomedical Knowledge: NLP Systems for Automating Systematic Literature Review
(2022)
Information School, University of Washington - Online
-
Unlocking Biomedical Knowledge: NLP Systems for Automating Systematic Literature Review
(2022)
School of Data Science, University of Virginia - Charlottesville, VA, USA
-
Unlocking Biomedical Knowledge: NLP Systems for Automating Systematic Literature Review
(2022)
Department of Informatics, Luddy School of Informatics, Indiana University-Bloomington - Online
-
Unlocking Biomedical Knowledge: NLP Systems for Synthesizing Biomedical Evidence
(2022)
Computer Science Research Seminar, Emory University - Online
-
VILA: Improving Structured Content Extraction from Scientific PDFs Using Visual Layout Groups
(2022)
ACL - Dublin, Ireland
-
A bibliometric analysis of citation diversity in accessibility and HCI research
(2021)
CHI - Online
-
Fast-track Learning: Growing Insights from Text-mining COVID-19 Data
(2021)
1st GTM2021 Virtual Forum - Online
-
Mathematics in the Scholarly Literature
(2021)
Conference on Artificial Intelligence and Theorem Proving (AITP) - Aussois, France and Online
-
MS^2: Multi-document summarization of medical studies
(2021)
EMNLP - Punta Cana, Dominican Republic
-
NLP and Text Mining Resources for COVID-19 and Beyond
(2021)
Machine Learning for Preventing and Combating Pandemics Workshop at ICLR 2021 - Online
-
Practical NLP for Biomedicine: Synthesizing Knowledge from Scientific Literature
(2021)
CS Colloquium, Northwestern University - Online
-
Practical NLP for scientific text mining: extracting and synthesizing knowledge from the literature
(2021)
Science of Science Summer School (S4) - Online
-
SciA11y: Converting scientific papers to accessible HTML
(2021)
ASSETS - Online
-
Text Mining Insights from the COVID-19 Pandemic
(2021)
Bibliometric-enhanced Information Retrieval (BIR) Workshop at ECIR 2021 - Online
-
The Power of AI: A Discussion on COVID-19 & the Future of Industries
(2021)
Relativity Media Pandemic short film discussion panel - Online
-
The Power of AI: A Discussion on COVID-19 & the Future of Industries
(2021)
Legalweek - Online
-
Using Machine Learning to Verify Scientific Claims
(2021)
OECD Workshop on AI and the Productivity of Science - Online
-
What do we mean by 'Accessibility Research'? A literature survey of accessibility papers in CHI and ASSETS from 1994 to 2019
(2021)
CHI - Online
-
Building Community and Data Ecosystem for Data Discovery and Reuse
(2020)
Artificial Intelligence for Data Discovery and Reuse (AIDR) Symposium - Online
-
CORD-19 Search: Using Machine Learning to Explore COVID-19 Scientific Literature
(2020)
AWS Education: Research Seminar Series - Online
-
CORD-19: the COVID-19 open research datase
(2020)
NLP-COVID Workshop at ACL - Online
-
CORD-19: The COVID-19 Open Research Dataset
(2020)
NLP Meetup (NY-NLP, A2D-NLP, DC-NLP, Hungarian NLP, London Text Analytics) - Online
-
CORD-19: The COVID-19 Open Research Dataset
(2020)
Global Tech Mining Conference - Online
-
Fact or fiction: verifying scientific claims
(2020)
EMNLP - Virtual
-
Improving Access to Scientific Literature for NLP
(2020)
Microsoft Research Hanover Group - Online
-
MedICaT: a dataset of medical images, captions, and textual references
(2020)
SDP Workshop at EMNLP - Online
-
Mining the COVID-19 Scientific Literature with the CORD-19 Open Research Dataset.
(2020)
Artificial Intelligence for Data Discovery and Reuse (AIDR) Symposium - Online
-
Open Publishing and Open Data
(2020)
Neuro-Gairdner Open Science in Action Symposium - Online
-
Rapid Fire Session: Showcasing What is Here!
(2020)
Gastroenterology and Artificial Intelligence: 2nd Annual Artificial Intelligence Summit - Online
-
S2ORC: the Semantic Scholar open research corpus
(2020)
ACL - Online
-
SUPP.AI: finding evidence for supplement-drug interactions
(2020)
ACL Demo - Online
-
The COVID-19 Open Research Dataset
(2020)
Connected Health and COVID-19: Now and Beyond the Great Lockdown - Online
-
The COVID-19 Open Research Dataset
(2020)
Semantic Indexing and Information Retrieval for Health (SIIRH) Workshop at ECIR - Online
-
The COVID-19 Open Research Dataset
(2020)
Centre for Science and Technology Studies, Leiden University - Online
-
The Role of Scientific NLP During an Epidemic
(2020)
1st SciNLP Workshop on Natural Language Processing and Data Mining for Scientific Text - Online
-
TREC-COVID: information retrieval for supporting COVID-19 research
(2020)
AMIA Natural Language Processing Working Group Pre-Symposium - Online
-
Automated Identification of Noise Signal in Spinal DCE-MRI using Independent Component Analysis and Unsupervised Machine Learning
(2019)
ISMRM - Montréal, QC, Canada
-
Extracting evidence of supplement-drug interactions from literature
(2019)
ML4H Workshop at NeurIPS - Vancouver, BC, Canada
-
Ontology-based Integration of Biological Pathway Data
(2019)
Scientific Literature Knowledge Bases Workshop at Automated Knowledge Base Construction (AKBC) - Amherst, MA, USA
-
A Brief Introduction to Ontology
(2018)
Kidney Precision Medicine Project Ontology Webinar - Seattle, WA, USA
-
A SPARQL Tutorial
(2018)
Department of Biomedical Informatics and Medical Education, University of Washington - Seattle, WA, USA
-
Learning from Biomedical Knowledge
(2018)
The Allen Institute for Artificial Intelligence (AI2) - Seattle, WA, USA
-
Ontologies and Algorithms for Integrating Biological Pathway Data
(2018)
BIME 590 Seminar, Department of Biomedical Informatics and Medical Education, University of Washington - Seattle, WA, USA
-
Ontology alignment in the biomedical domain using entity definitions and context
(2018)
Bio-NLP Workshop at ACL - Melbourne, Australia
-
Ontology- based integration of pathway databases using Pathway Ontology annotations
(2018)
Bio-Ontologies at ISMB - Chicago, IL, USA
-
Quantifying the effects of gene entity disambiguation for GSEA
(2018)
AMIA Symposium - San Francisco, CA, USA
-
Semi-automated integration of pathway data for pathway analysis
(2018)
Knowledge Representation and Semantics Working Group Pre-Symposium Doctoral Consortium, AMIA Symposium - San Francisco, CA, USA
-
Detection and Functional Classification of Fusion Genes Using Pathway Expression Profiles
(2017)
AMIA Joint Summits on Translational Science - San Francisco, CA, USA
-
Similarity metrics for determining overlap among biological pathways
(2017)
ICBO - Newcastle upon Tyne, United Kingdom
-
An analysis of differences in biological pathway resources
(2016)
ICBO & BioCreative - Corvallis, OR, USA
-
Auditing tree-like organ systems in the FMA using network motifs
(2016)
AMIA Symposium - Chicago, IL, USA
-
Development of a novel Markov chain model for the prediction of head and neck squamous cell carcinoma dissemination
(2016)
AMIA 2016 - Chicago, IL, USA
-
Discovering representational differences between pathway knowledge bases for pathway resource merging
(2016)
AMIA Symposium - Chicago, IL, USA
-
Identifying and resolving inconsistencies in biological pathway resources
(2016)
NLM Informatics Training Conference - Columbus, OH, USA
-
Biological model development as an opportunity to provide content auditing for the foundational model of anatomy ontology
(2015)
AMIA Symposium - San Francisco, CA, USA
-
Development of a discharge ontology to support postanesthesia discharge decision making
(2015)
ICBO - Lisbon, Portugal
-
Ontological content auditing during model creation using the foundational model of anatomy
(2015)
NLM Informatics Training Conference - Bethesda, MD, USA
-
Detrended fluctuation analysis of peak expiratory flow and its association with destabilization of asthma control
(2014)
International Conference of the American Thoracic Society (ATS) - San Diego, CA, USA
-
Electrical impedance myography in DMD: a multi-center study of reliability and relationships to strength and function
(2013)
The 18th International Congress of the World Muscle Society - Asilomar, CA, USA
-
6 Years of FEVER Workshops - How Far Have We Come?
The Sixth Workshop on Fact Extraction and Verification (FEVER) at EACL - Dubrovnik, Croatia
-
AI in Scholarly Communications: Where We Are and Where We’re Going
FORCE11 Scholarly Communication Institute - Online
-
Biomedical Evidence Extraction and Synthesis
The Center for Informatics Research in Science and Scholarship (CIRSS) Seminar - Urbana-Champaign, IL
-
Can Scientific Claim Verification Help Us Do Better Science?
The Sixth Workshop on Fact Extraction and Verification (FEVER) at EACL - Dubrovnik, Croatia
-
Incorporating External Knowledge for Clinical Outcome Prediction
Institute for Medical Data Science Seminar - Seattle, WA