About IMDS

Harnessing data science and innovative methods to transform complex health data into actionable insights for improving healthcare.

Evidence-based knowledge comes from good, rich data combined with rigorous analysis. Unlocking the potential of health data analytics requires both the careful application of existing tools and the development of new techniques to address the complexities of modern health systems data. 

The Innovative Methods and Data Science (IMDS) program bridges the learning health systems pipeline from data acquisition to clinical insights. Our focus includes developing innovative data science in healthcare tools and statistical methods to ensure they remain effective, equitable, and impactful while improving quality during deployment.

IMDS at a Glance

IMDS launched in November 2022 as a program within CLHSS in partnership with the School of Public Health Division of Biostatistics and is in the process of establishing operations and services. The initial goals of IMDS are to:

  • Develop innovative computational methods in health research for accountable, interpretable, impactful and fair analysis of health system data;
  • Partner with CLHSS researchers to augment AI/ML tools with advanced statistical analysis in healthcare;
  • Provide expert consultation on modern study designs and health data analytics methods for CLHSS-affiliated faculty, staff, and students.

Join Us

If you’re interested in engaging with IMDS or have inquiries, please email [email protected] to learn more about how we can support your research goals.

Publications


2024

Multi-modality risk prediction of cardiovascular diseases for breast cancer cohort in the All of Us Research Program
Yang H, Zhou S, Rao Z, Zhao C, Cui E, Shenoy C, Blaes AH, Paidimukkala N, Wang J, Hou J, Zhang R

Reinforced Borrowing Framework: Leveraging Auxiliary Data for Individualized Inference
Ji Z, Wolfson J

Prevalent Metformin Use in Adults With Diabetes and the Incidence of Long COVID: An EHR-Based Cohort Study From the RECOVER Program
Johnson SG, Abedian S, Stürmer T, Huling JD, Lewis V C, Buse JB, Brosnahan SB, Mudumbi PC, Erlandson KM, McComsey GA, Arnold J, Wiggen TD, Wong R, Murphy S, Rosen C, Kaushal R, Weiner MG, Bramante C; RECOVER PCORnet EHR Cohort and the N3C Consortium

A Unified Framework for Causal Estimand Selection
Barnard M, Huling JD, Wolfson J

A review of reinforcement learning for natural language processing and applications in healthcare
Liu Y, Wang H, Zhou H, Li M, Hou Y, Zhou S, Wang F, Hoetzlein R, Zhang R

Effect of SARS-CoV-2 Infection on Incident Diabetes by Viral Variant: Findings From the National COVID Cohort Collaborative (N3C)
Wong R, Hall MA, Wiggen T, Johnson SG, Huling JD, Turner LE, Wilkins KJ, Yeh HC, Stürmer T, Bramante CT, Buse JB, Reusch J; N3C Consortium

FuseLinker: Leveraging LLM’s pre-trained text embeddings and domain knowledge to enhance GNN-based link prediction on biomedical knowledge graphs
Xiao Y, Zhang S, Zhou H, Li M, Yang H, Zhang R

Causally interpretable meta-analysis combining aggregate and individual participant data
Rott KW, Clark JM, Murad MH, Hodges JS, Huling J

Comparing Insulin Against Glucagon-Like Peptide-1 Receptor Agonists, Dipeptidyl Peptidase-4 Inhibitors, and Sodium-Glucose Cotransporter 2 Inhibitors on 5-Year Incident Heart Failure Risk for Patients With Type 2 Diabetes Mellitus: Real-World Evidence Study Using Insurance Claims
Wang X, Plantinga AM, Xiong X, Cromer SJ, Bonzel CL, Panickan V, Duan R, Hou J, Cai T

A proposed method for identifying Interfacility transfers in Medicare claims data
Nikpay S, Leeberg M, Kozhimannil K, Ward M, Wolfson J, Graves J, Virnig BA

A Case Demonstration of the Open Health Natural Language Processing Toolkit From the National COVID-19 Cohort Collaborative and the Researching COVID to Enhance Recovery Programs for a Natural Language Processing System for COVID-19 or Postacute Sequelae of SARS CoV-2 Infection: Algorithm Development and Validation
Wen A, Wang L, He H, Fu S, Liu S, Hanauer DA, Harris DR, Kavuluru R, Zhang R, Natarajan K, Pavinkurve NP, Hajagos J, Rajupet S, Lingam V, Saltz M, Elowsky C, Moffitt RA, Koraishy FM, Palchuk MB, Donovan J, Lingrey L, Stone-DerHagopian G, Miller RT, Williams AE, Leese PJ, Kovach PI, Pfaff ER, Zemmel M, Pates RD, Guthe N, Haendel MA, Chute CG, Liu H; National COVID Cohort Collaborative; RECOVER Initiative

RT: a Retrieving and Chain-of-Thought framework for few-shot medical named entity recognition
Li M, Zhou H, Yang H, Zhang R

LEAP: LLM instruction-example adaptive prompting framework for biomedical relation extraction
Zhou H, Li M, Xiao Y, Yang H, Zhang R

Favorable Antiviral Effect of Metformin on SARS-CoV-2 Viral Load in a Randomized, Placebo-Controlled Clinical Trial of COVID-19
Bramante CT, Beckman KB, Mehta T, Karger AB, Odde DJ, Tignanelli CJ, Buse JB, Johnson DM, Watson RHB, Daniel JJ, Liebovitz DM, Nicklas JM, Cohen K, Puskarich MA, Belani HK, Siegel LK, Klatt NR, Anderson B, Hartman KM, Rao V, Hagen AA, Patel B, Fenno SL, Avula N, Reddy NV, Erickson SM, Fricton RD, Lee S, Griffiths G, Pullen MF, Thompson JL, Sherwood NE, Murray TA, Rose MR, Boulware DR, Huling JD; COVID-OUT Study Team

Navigating Online Health Information: Assessing the Quality and Readability of Dietary and Herbal Supplements for Chronic Musculoskeletal Pain
Austin RR, Jantraporn R, Schulz C, Zhang R

Frustrations with time spent on cancer care among patients with metastatic breast or advanced ovarian cancer
Vogel RI, Jewett P, Parsons HM, Brown K, Pecoraro A, Starks I, Adewakun Ingram S, Gupta A, Gek Koon Teoh D, Fan Y, Blaes AH, Arend RC, Rocque GB, Wolfson J

Prediction of the mechanism of suicide among Minnesota residents using data from the Minnesota violent death reporting system (MNVDRS) 
Waller DC, Wolfson J, Gingerich S, Wright N, Ramirez MR

Semi-supervised Double Deep Learning Temporal Risk Prediction (SeDDLeR) with Electronic Health Records
Nogues IE, Wen J, Zhao Y, Bonzel CL, Castro VM, Lin Y, Xu S, Hou J, Cai T

Develop and validate a computable phenotype for the identification of Alzheimer's disease patients using electronic health record data
He X, Wei R, Huang Y, Chen Z, Lyu T, Bost S, Tong J, Li L, Zhou Y, Li Z, Guo J, Tang H, Wang F, DeKosky S, Xu H, Chen Y, Zhang R, Xu J, Guo Y, Wu Y, Bian J

A taxonomy for advancing systematic error analysis in multi-site electronic health record-based clinical concept extraction
Fu S, Wang L, He H, Wen A, Zong N, Kumari A, Liu F, Zhou S, Zhang R, Li C, Wang Y, St Sauver J, Liu H, Sohn S

Cognitive function and bladder health among midlife adult women in the Coronary Artery Risk Development in Young Adults (CARDIA) study
Brady SS, Arguedas A, Huling JD, Hellemann G, Yaffe K, Lewis CE, Fok CS, Van Den Eeden SK, Markland AD

Comparison of Weighting Methods to Understand Improved Outcomes Attributable to Public Health Nursing Interventions
Huling JD, Austin RR, Lu SC, Mathiason MA, Pirsch AM, Monsen KA

Unraveling the multiple chronic conditions patterns among people with Alzheimer's disease and related dementia: A machine learning approach to incorporate synergistic interactions
Yew PY, Devera R, Liang Y, Khalifa RAE, Sun J, Chi NC, Chou YC, Tonellato PJ, Chi CL

Exploring Large Language Models for Acronym, Symbol Sense Disambiguation, and Semantic Similarity and Relatedness Assessment
Liu Y, Melton GB, Zhang R

Risk of Post-Acute Sequelae of SARS-CoV-2 Infection (PASC) Among Patients with Type 2 Diabetes Mellitus on Anti-Hyperglycemic Medications
Olawore O, Turner LE, Evans MD, Johnson SG, Huling JD, Bramante CT, Buse JB, Stürmer T; N3C Consortium

Consensus modeling: Safer transfer learning for small health systems
Tourani R, Murphree DH, Sheka A, Melton GB, Kor DJ, Simon GJ

Enhancing the coverage of SemRep using a relation classification approach
Ming S, Zhang R, Kilicoglu H

An in-depth evaluation of federated learning on biomedical natural language processing for information extraction
Peng L, Luo G, Zhou S, Chen J, Xu Z, Sun J, Zhang R

Potential impact of blood cholesterol guidelines on statin treatment in the U.S. population using interrupted time series analysis
Yew PY, Loth M, Adam TJ, Wolfson J, Liang Y, Tonellato PJ, Chi CL

Federated Learning with Convex Global and Local Constraints
He C, Peng L, Sun J

Repurposing non-pharmacological interventions for Alzheimer's disease through link prediction on biomedical literature 
Xiao Y, Hou Y, Zhou H, Diallo G, Fiszman M, Wolfson J, Zhou L, Kilicoglu H, Chen Y, Su C, Xu H, Mantyh WG, Zhang R

Logistic burdens of cancer care: A qualitative study
Dona AC, Jewett PI, Hwee S, Brown K, Solomon M, Gupta A, Teoh D, Yang G, Wolfson J, Fan Y, Blaes AH, Vogel RI

Power Analysis for Causal Discovery
Kummerfeld E, Williams L, Ma S

Machine Learning Identifies Higher Survival Profile In Extracorporeal Cardiopulmonary Resuscitation
Crespo-Diaz R, Wolfson J, Yannopoulos D, Bartos JA

Characterizing the spectrum of bladder health and lower urinary tract symptoms among men: Results from the CARDIA study
Markland AD, Hellemann G, Shan L, Brady SS, Huling JD, Schreiner PJ, Sidney S, Van Den Eeden SK, Lewis CE

Development and Validation of the Pharmacological Statin-Associated Muscle Symptoms Risk Stratification Score Using Electronic Health Record Data
Sun B, Yew PY, Chi CL, Song M, Loth M, Liang Y, Zhang R, Straka RJ

Unsupervised Machine Learning of the Combined Danish and Norwegian Knee Ligament Registers: Identification of 5 Distinct Patient Groups With Differing ACL Revision Rates
Martin RK, Wastvedt S, Pareek A, Persson A, Visnes H, Fenstad AM, Moatshe G, Wolfson J, Lind M, Engebretsen L

A Cluster-Randomized Evaluation of the SuperShelf Intervention in Choice-Based Food Pantries
Caspi CE, Gombi-Vaca MF, Barsness CB, Gordon N, Canterbury M, Peterson HH, Wolfson J, Pratt R

Kidney Outcomes with Sodium-Glucose Cotransporter-2 Inhibitor Initiation after AKI among Veterans with Diabetic Kidney Disease
Murphy DP, Wolfson J, Reule S, Johansen KL, Ishani A, Drawz PE

A Symptom-Based Natural Language Processing Surveillance Pipeline for Post-COVID-19 Patients
Silverman GM, Rajamani G, Ingraham NE, Glover JK, Sahoo HS, Usher M, Zhang R, Ikramuddin F, Melnik TE, Melton GB, Tignanelli CJ

Measuring Associations Between Community-Level Social Determinants of Health and Bariatric Surgery Weight Loss Outcomes
Skoufis N, Zhang R, Chen Y

External validation of the Norwegian anterior cruciate ligament reconstruction revision prediction model using patients from the STABILITY 1 Trial
Martin RK, Marmura H, Wastvedt S, Pareek A, Persson A, Moatshe G, Bryant D, Wolfson J, Engebretsen L, Getgood A

Discrimination and bladder health among women in the CARDIA cohort study: Life course and intersectionality perspectives
Brady SS, Arguedas A, Huling JD, Hellemann G, Lewis CE, Fok CS, Van Den Eeden SK, Markland AD

Robust sample weighting to facilitate individualized treatment rule learning for a target population
Chen R, Huling JD, Chen G, Yu M

Energy balancing of covariate distributions
Huling JD, Mak S

2023

An Open Natural Language Processing (NLP) Framework for EHR-based Clinical Research: A Case Demonstration Using the National COVID Cohort Collaborative (N3C) 
Liu S, Wen A, Wang L, He H, Fu S, Miller R, Williams A, Harris D, Kavuluru R, Liu M, Abu-el-Rub N, Schutte D, Zhang R, Rouhizadeh M, Osborne JD, He Y, Topaloglu U, Hong SS, Saltz JH, Schaffter T, Pfaff E, Chute CG, Duong T, Haendel MA, Fuentes R, Szolovits P, Xu H, Liu H

Extracting Complementary and Integrative Health Approaches in Electronic Health Records 
Zhou H, Silverman G, Niu Z, Silverman J, Evans R, Austin R, Zhang R

Generate Analysis-Ready Data for Real-world Evidence: Tutorial for Harnessing Electronic Health Records With Advanced Informatic Technologies 
Hou J, Zhao R, Gronsbell J, Lin Y, Bonzel CL, Zeng Q, Zhang S, Beaulieu-Jones BK, Weber GM, Jemielita T, Wan SS, Hong C, Cai T, Wen J, Panickan VA, Liaw KL, Liao K, Cai T

Evaluation of federated learning variations for COVID-19 diagnosis using chest radiographs from 42 US and European hospitals 
Peng L, Luo G, Walker A, Zaiman Z, Jones EK, Gupta H, Kersten K, Burns JL, Harle CA, Magoc T, Shickel B, Steenburg SD, Loftus T, Melton GB, Gichoya JW, Sun J, Tignanelli CJ

The Association Between Inflammation, Incident Heart Failure, and Heart Failure Subtypes in Patients with Rheumatoid Arthritis 
Huang S, Cai T, Weber BN, He Z, Dahal KP, Hong C, Hou J, Seyok T, Cagan A, DiCarli MF, Joseph J, Kim SC, Solomon DH, Cai T, Liao KP

Related News