Peter L. Elkin, MD, is among the Jacobs School of Medicine and Biomedical Sciences researchers involved in COVID-19 data collection.

Jacobs School Researchers Collecting COVID-19 Data

Published June 26, 2020

Researchers in the Jacobs School of Medicine and Biomedical Sciences continue to spearhead a number of projects related to the COVID-19 global health pandemic.

Print

Peter L. Elkin, MD, professor and chair of biomedical informatics, says several current studies are focused on data collection that can be used to better understand how to combat COVID-19.

Much of the work is being completed through the Clinical and Translational Science Awards (CTSA) consortium, of which the University at Buffalo is a member.

UB is one of more than 50 medical research institutions across the nation currently receiving CTSA program funding from the National Institutes of Health.

Building a Data Warehouse for CTSA Consortium

One such project is the launch of the National COVID Cohort Collaborative (N3C), a joint program between the National Center for Data to Health and the National Center for Advancing Translational Sciences.

Elkin says the project’s aim is to build a warehouse of COVID-19 data for the entire CTSA consortium and for other interested contributing health care organizations.

“This is intended to hold all patient data (inpatient and outpatient) on COVID-tested patients from all of the CTSA hubs,” he says. “It entails a cloud-based method for data collection on the COVID-19 pandemic.”

“We are working closely with N3C to see how this can be designed and implemented in a standardized and timely fashion,” Elkin says.

“The goal of developing a national-level COVID-19 database is to facilitate research and improve recruitment to clinical trials,” he says.

N3C is looking to address the many difficult questions raised by the COVID-19 global emergency, such as:

  • who is infectious
  • who may need hospital care and at what level
  • what are the key risk factors
  • what are the best prognostic indicators
  • what are best practices for ethical resource allocation
  • which drugs are the most viable candidates for patients

Collecting Data on Upper Respiratory Infections

UB is also a member of a New York State initiative, COMBATCOVID, to save case report forms on all hospital admissions for upper respiratory infections, including all patients tested for COVID-19 or patients who are suspected to have COVID-19.

The statewide consortium will collect and analyze the results from all the CTSA institutions in the state.

“It is being run out of New York University, and I am participating from our site as our CTSA informatics core director,” Elkin says. “I am working on the design and data governance.”

“The data use agreements are being signed, and the database design and data definitions are being built,” he adds. “This larger row level dataset will allow us to ask questions that would not be possible at any one institution.”

In the Department of Biomedical Informatics, Elkin and Frank D. LeHouillier, senior programmer and analyst, are involved in the project.

Clinical researchers in the Jacobs School who are involved include:

Stool Microbiome Samples Yield Information

Researchers in the Department of Biomedical Informatics have also developed a validated microbiome platform that finds infected persons with COVID-19 — whether symptomatic or not — using deep sequencing of stool microbiome samples.

Elkin is working with Sapan Mandloi, PhD, a postdoctoral associate in biomedical informatics, in using a National Center for Biotechnology Information (NCBI) Sequence Read Archive (SRA) database to collect and process metagenomics data for the organism classified as “human gut metagenome.”

The total number of samples are more than 300,000 divided into 3,464 projects, according to Mandloi.

“We are performing comparison of all samples’ raw sequences with SARS-Cov-2 genome using a NCBI SRA Taxonomy Analysis Tool (STAT), which utilizes precomputed k-mer dictionary databases and gene specific profiling,” Mandloi says. “This allows us to perform geographic mapping of samples identified across the world,” Mandloi says.

9,720 samples were identified as potential cases of colonization for COVID-19, which were mostly from the U.S., China, Australia and the U.K., Mandloi adds.

“The ability to identify and track this trafficking of genetic material is vital as a public health topic,” he says. “As of now, this large pool of genetic data remains largely untapped for clinical surveillance using the combined strategy of gene-based profiling and k-mer based classification on raw genomic data.”