The Wicklow AI in Medicine Research Initiative
 
WAMRI-Logo.png
 
WAMRI_Background.png
 

About

The AI & Medicine Initiative (AIM), formerly the Wicklow AI in Medicine Research Initiative (WAMRI), provides researchers the opportunity to focus on utilizing artificial intelligence models to advance critical medical research across areas including oncology, cardiology and neurology. AIM, located at USF's Data Institute in downtown San Francisco, originally launched with five joint projects.

The researchers and faculty at USF’s Data Institute have a high impact record of conducting multidisciplinary research in collaboration with leading medical research institutions such as UCSF, Harvard Medical School and Boston Children’s Hospital. WAMRI was funded through a gift from Wicklow Capital to support research at the intersection of machine learning, AI and medicine.

To find out more about the projects and research going on at WAMRI, check out the past projects here.

 
 
 

Working with WAMRI

We would love to help you analyze your organization's medical data, whether it be imaging, text, tabular, genomic, or any other type (or a mixture of types) by developing a collaborative project. We can provide the data science expertise from our faculty and mentors; we would need you to provide the medical data and medical expertise, and to work closely with our data scientists. Projects are generally academic in nature - that is, the end product will be an academic paper or similar.

contact@wamri.ai

 
 
 

AFFILIATES

 
 
 
jh-head.jpg

Jeremy Howard

WAMRI Chairman, Founder Fast.ai

Jeremy Howard is an entrepreneur, business strategist, developer, and educator. Jeremy is a founding researcher at fast.ai, a research institute dedicated to making deep learning more accessible. He is also a faculty member at the University of San Francisco, and CSO at doc.ai and platform.ai.

Jeremy’s most recent startup, Enlitic, was the first company to apply deep learning to medicine. He was previously the President and Chief Scientist of the data science platform Kaggle, where he was the top ranked participant in international machine learning competitions 2 years running. He was the founding CEO of two successful Australian startups (FastMail, and Optimal Decisions Group–purchased by Lexis-Nexis). Before that, he spent 8 years in management consulting, at McKinsey & Co, and AT Kearney.

He has many television and other video appearances, including as a regular guest on Australia’s highest-rated breakfast news program, a popular talk on TED.com, and data science and web development tutorials and discussions.

du_head.jpg

David Uminsky

WAMRI Board Member, USF Director MS in Data Science & Data Institute

David Uminsky is the director for the MS in Data Science program. His research interests are in applied mathematics. He is interested in unsupervised machine learning, data clustering, algebraic signal processing, as well as pattern formation, dynamical systems and fluids. David was selected in 2015 by the National Academy of Sciences (NAS) as a Kavli Frontiers of Science Fellow. Each year, 100 researchers under the age of 45 are selected by the academy, and the 20% of the current NAS were previous Kavli Fellows. He is also the founding director of the Bachelor's program in Data Science at University of San Francisco. Before joining USF, he was a combined National Science Foundation and UC President's Fellow at UCLA, where he was awarded the Chancellor's Award for post-doctoral research. This award is given to approximately top 20 post-docs out of over a thousand who qualify for consideration. He holds a PhD in Mathematics from Boston University and a BS in Mathematics from Harvey Mudd College.

fm-head.jpg

Fred Monroe

RESEARCH AND SUPPORT

Fred Monroe is a machine learning researcher, serial entrepreneur, software developer, trader and investor with over twenty five years of experience working at the forefront of technology.  He is working to apply advances in deep learning and artificial intelligence (AI)  to important real world problems like those studied at WAMRI.

Fred also works for Wicklow Capital, a family office which makes investments across a wide range of early stage companies. Fred helps Wicklow assess investments and donations, particularly when related to artificial intelligence.

 
 

Projects and Research

 
 

2019 - 2020 COHORT

 

*Currently accepting proposals for the upcoming 2020-2021 cohort through October 23rd. Contact us at contact@wamri.ai.

 

Early Prediction of Hospital Acquired Sepsis

Zachary Barnes, MS Data Science

Collaboration with Dr. Leo Liu @ UcSF

In a collaboration with the UCSF School of Medicine, our research is on the development of a model for the early prediction of hospital acquired sepsis. Several features differentiate this research from current sepsis models in research and production: training a model using the UCSF information commons database, extensive feature engineering, labeling patients based on the most reliable definition of sepsis, and focusing on a non-ICU ward. We achieved state of the art results with an AUCPR of 0.67 four hours before onset using the leading model, a multi-task gaussian process and recurrent neural network, and comparable results with tree-based models trained with expert engineered features.

 

De-Crappifying Microscopy Images of the Brain

annette lin and alaa a latif, MS Data Science

Collaboration with Dr. Uri Manor and Linjing FanG @ Salk

Electron Microscopy (EM) is an important tool for studying biological systems at their most fundamental level. Our research work involves training deep neural networks to de-crapify (de-noising and super-resolution) EM images of brain structures using self-supervised learning and perceptual losses.

 
 

Multivariate Pattern Analysis with Machine Learning

Maxwell calehuff, MS Data Science

Collaboration with Dr. Ross Filice @ Medstar georgetown university hospital

This project applies cutting edge natural language processing techniques to improve feedback for medical students. Students' written imaging reports are automatically flagged when they deviate from those of the supervising physicians. Portions of the flagged reports that deviate in meaning, not just phrasing, are highlighted to improve understanding.

 

Fast Acquisition Microscopy Via Deep Learning 

Dr. Tahir Issa Bachar, USFDI POSTDOC

Collaboration with Dr. Claudio Vinegoni @ Harvard

Laser scanning microscopy is a powerful imaging modality ideal for monitoring spatial and temporal dynamics in both in vitro and in vivo models with particular emphasis in the neuroimaging field. Unfortunately, the video-rate acquisition required to capture these changes lead to low resolution images, with high spatial distortion and low signal-to-noise ratio. By combining microscopy fast acquisition methods with deep learning, we show here that image acquisition up to 20x the speed of equivalent standard high resolution condition can be obtained without sacrificing image quality. Specifically, we present here a GAN based training approach able to simultaneously 1) super-resolve, 2) denoise and 3) correct distortion on fast scanning acquisition microscopy images.

'Fast Acquisition Microscopy via Deep Learning' Issa Tahir Bachar, Postdoctoral Fellow, USF Data Institute in collaboration with In Vivo Microscopy @ Harvard...

 

Contrast Enhancement Prediction in Brain MRI

wendeng hu and todd zhang, MS Data Science

collaboration with Dr. Greg Zaharchuk @ stanford

Gadolinium-based contrast agents (GBCAs) have become an integral part of daily clinical brain abnormality diagnosis. However, the consensus is that the use of GBCAs could have harmful effects on the human body and should only be used in limited amounts when no alternatives are available. The team developed a deep learning model to predict contrast enhancement from non-contrasted multiparametric brain magnetic resonance imaging (MRI) to replace the use of GBCAs potentially. The predictions highly resembled the actual contrast-enhanced images based on PSNR and SSIM metrics.

Simple CNN Models for Classification on Medical Images

Roja immanni, MS Data Science

Collaboration with Dr. Gilmer Valdes, UCSF

Medical image datasets are fundamentally different from natural image datasets in terms of the number of available training observations and the number of classes for the classification task. We hypothesized that compared to architectures used for natural images, those needed for medical imaging can be simpler. Here, we propose smaller architectures and show how they perform similarly while significantly saving training time and memory. 

 

Multiple-Instance Learning on Pathology Slides

Lori sheng, MS Data Science

Collaboration with Dr. Dmytro Lituiev @ UCSF

When it comes to applying computer vision in the medical field, most tasks involve either 1) image classification for diagnosis or 2) segmentation to identify and separate lesions. However, in pathology cancer detection, this is not always possible. Obtaining segmentation labels is time-consuming and labor-intensive. Furthermore, a single pathology slide can be up to 200k x 100k pixels resolution. We investigate Multiple Instance Learning as a deep learning method to handle both memory and labelling constraints.

 

Disease Diagnosis from Brain MEG with LSTMs

Lexie Sun, MS Data Science

Collaboration with Dr. Ashish Raj and Xihe Xie @ Brain Networks Lab - UCSF

When it comes to applying computer vision in the medical field, most tasks involve either 1) image classification for diagnosis or 2) segmentation to identify and separate lesions. However, in pathology cancer detection, this is not always possible. Obtaining segmentation labels is time-consuming and labor-intensive. Furthermore, a single pathology slide can be up to 200k x 100k pixels resolution. We investigate Multiple Instance Learning as a deep learning method to handle both memory and labelling constraints.

 

Auto-Segmentation of Brain Metastases

sihan Chen, MS Data Science

Collabration with Dr. tomi Nano, Gilmer Valdes and olivier morin @ UCSF

Brain metastases (BM) have a high incidence and affect up to 1/3 of patients with cancer. To develop an accurate auto-segmentation for BM, it’s essential to identify the limitations of current models. Empowered by the world’s largest BM dataset, We built a deep-learning-based auto-segmentation and evaluate it on test datasets with different properties. We found that the volume of BM and the number of BM in a brain affects the accuracy drastically.

 

Auto-Segmentation of Brain Metastases

Akanksha, MS Data Science

Collabration with Dr. Ashish Raj and Xihe Xie @ Brain Networks Lab - UCSF

The task is to build a deep learning model to perform 3D semantic image segmentation of brain sMRI into White Matter, Grey Matter, and Cerebrospinal fluid, and the employing transfer learning to extend the segmentation model for the regression task of predicting cognitive scores of the patients. Presently I have built a 3D symmetrical U-Net model using PyTorch which gives an overall dice coefficient of 92% across classes on the ADNI dataset, and right now I am in the process of extending it to the regression task.

 
 

2018 - 2019 COHORT

 
 

Functional Embeddings Identify Differences in Schizophrenia Sub-types

Lead: Dr. James Wilson, USF

Speaker: Dr. Abbie Popa, USFDI PostDOC

The team helped create a new technique to quantify how different areas of the brain are connected and then utilized this technique to examine differences between schizophrenics and healthy individuals. Using a mathematical representation of how correlated two parts of the brain are across individuals in a group, the team was able to show correlations among brain regions exactly where they were expected. With the model validated she then compared healthy individuals to those with schizophrenia. The results were able to show a difference in regions of the brain that are well documented in neuroscience literature as areas the disease effects. Future work includes being able to identify sub-types of schizophrenia using this technique, as well as collecting more data to increase the effectiveness of the technique.

 

Automated Segmentation of Brain Ventricles using 3D U-Nets

Lead: Dr. Olivier Morin, UCSF

Speakers: Anish Dalal and Robert Sandor, MS Data Science

The team developed a deep learning model that performs as well or better than the leading proprietary software used in hospitals to identify Hydrocephalus (cerebrospinal fluid leaking into the ventricles of the brain). Monitoring of the ventricles in the brain for swelling is very important for newborns, patients after brain surgery, and for patients who have suffered a stroke or other traumatic brain injury. If fluid builds up in these ventricles it can lead to symptoms including headache, loss of vision, a decline in memory, and impaired neural development . Monitoring the ventricles is an expensive and time-consuming task for doctors because they have to go through many 2D images of the brain, drawing boundaries around the ventricles. This model makes this process faster and more accurate than the current software solution, which the team hopes will help improve monitoring of patients and bring down the cost of care for hospitals.

Enjoy the videos and music you love, upload original content, and share it all with friends, family, and the world on YouTube.

 

Deep Learning to Assist Medical Imaging Diagnostics 

Lead: Dr. Vasant Kearney, UCSF

Speakers: Tianqi Wang And Alan Perry, MS Data Science

The team studied the applications of deep learning on CT scans, while also producing two academic papers on their findings. The first paper showed that deep learning can categorize pixels of CT scans based on if they are a part of the different organs in the body. This technique could be helpful for clinicians interested in studying any disease that manifests in organ tissue. The second paper showed that MRI data could be converted to CT scan data. Using deep learning techniques the the team showed that a model could be used to convert data between the two types of scans. While this has applications for clinicians who may want one type of scan to compare two patients without having to run another test, this technique also has the possibility of affecting data science by allowing practitioners to increase the amount of data they have for a specific task.


 

Improving the Efficiency in View Classification of Echo-Cardiograms with Deep Learning 

Lead: Dr. Rima Arnaout, UCSF

Speakers: Divya Shargavi and Max Alfaro, MS Data Science

Echo-cardiograms are an important imaging technique used by doctors to inform their diagnostics of a patient's heart. However these images contain a lot of variation since echo-cardiograms can be still-images or videos taken from many different angles on moving patient. Doctors have standardized 15 “views” of these images, or angles the ultrasound is taken from. The team worked  to train a model to recognize which of the 15 different “views” the image was taken from. This is important to understand the diagnostic potential of each image, and could also be used to help doctors standardize the variation within the different views. By doing this they hope to ultimately build a tool to assist doctors in diagnostics.

 

Deep Learning with Medical Images: Towards Best Practices for Small Data

Lead: Dr. Gilmer Valdes, UCSF

Speaker: Miguel Romero, MS Data Science

Large data-sets are getting all the attention from big companies these days, but most medical researchers and practitioners around the world are working with small amounts of data. This poses a challenge to building accurate and reproducible machine learning models. The team took two approaches to address this issue. The first approach was to rigorously asses the efficacy of different methods to train machine learning models. The next approach was to understand how well models transfer knowledge from task to task, based on both the size and contents of the training set.

 

Multivariate Pattern Analysis with Machine Learning

Lead: Dr. Peter Wais, UCSF

Speaker: Jenny Kong, MS Data Science

Using complex and noisy data from fMRI scans, the team conducted research to correlate the activation of different regions of the brain with High Fidelity Memory (HFM) storage. HFM is a type of active memory, where individuals detect and remember minute details in images. Researchers hope that increasing our knowledge of how this type of memory forms will aid in our understanding of attention’s impact on memory. Using clustering and statistical analyses, the team sought to identify when individuals were using HFM by analyzing the haemodynamic response function from fMRI scans taken while participants took a visual memory test.