Our group works at the intersection of healthcare, big data and artificial intelligence


To help further research and improve precision care, the Veterans Affairs (VA) and Department of Energy (DOE) have partnered to bring together VA’s healthcare and genomic data with DOE’s high-performance computing (HPC) resources and expertise. The vast VA data includes Electronic health records (EHR) for almost 24M patients covering 22 years. EHR contains a combination of various data types including, genomics, structured data such as demographics and diagnoses, and unstructured data such as physicians’ and nurses’ notes. So far, the structured data has been mostly used for predicting outcomes and this is due in part to the complexity of dealing with unstructured data which contain ~4.65 trillion documents in the VA dataset. However, studies have shown that structured data alone have low performance metrics. We are developing AI models that integrate structured, unstructured, and geospatial data to improve VA’s ability to identify patients at risk of suicide, overdose, or complications from obstructive sleep apnea and to assess response to lung cancer treatment.