UoC researchers play central role in new genomic database for pediatric cancer

22 August 2017
Two crucial components of the Kids First project are the teams led by Robert L. Grossman, PhD, and Sam Volchenboum, MD, PhD, at the University of Chicago. Grossman and Volchenboum will play a central role in the technical underpinnings of the large-scale processing and sharing of genomic and clinical data for this important initiative, states a UoC press release.

Grossman, (Frederick H. Rawson Professor in Medicine and Computer Science and director of the Center for Data Intensive Science at the University of Chicago), heads up an operations center that runs numerous data commons, supporting more than 20,000 researchers across the world every month. “Platforms that enable researchers to analyze securely large amounts of de-identified clinical and genomic data are one of our most powerful tools for making discoveries that improve children’s lives,” Grossman is quoted.

His team is known for its work on the NCI’s Genomic Data Commons (GDC), a federally funded, unified data system that promotes sharing of cancer genomic and clinical data between researchers.
The GDC is a core component of the National Institutes of Health’s Precision Medicine Initiative.

Combining expertise for Kids First project

For the Kids First project, Grossman will work closely with Volchenboum, an expert in pediatric cancers and director of the Center for Research Informatics at UChicago. Volchenboum’s team developed the world’s first international pediatric cancer data commons, housing data on more than 19,000 neuroblastoma patients from around the world.

Under Grossman and  Volchenboum , the Chicago team of engineers and scientists will design and operate the cloud-based, open-source software needed to establish the data coordination center within the Kids First data resource center.

“This is a critical step forward for the pediatric oncology community,” Volchenboum believes. “The Kids First data resource center will provide a much-needed resource for pediatric researchers to leverage a large set of genomic and clinical data on children. These data will help us understand why some children develop cancer and how to best stratify and treat their disease.”

Comprehensive portal for pediatric genetic data

The Kids First data resource center is a centralized platform of well-curated clinical and genetic sequence data from dozens of childhood cancer and structural birth defect cohorts, comprising genetic data from thousands of patients and their families. The team will integrate large, disparate data sources, provide support for analyses, and coordinate with third-party data commons and applications.

Researchers can use the platform to probe genetic pathways and explore genetic abnormalities that underlie childhood cancer and structural birth defects. The program will also provide funds to generate new data and facilitate deposition into a centralized database.

According to the UoC, It is important to study these conditions together because children with birth defects are at a higher risk of also developing childhood cancer, suggesting they may share an underlying cause. However, not much is known about these suspected pathways. Few large-scale genetics studies have focused on both childhood cancer and structural birth defects. These shared biological pathways may not be detected if researchers study cancer patients or those with structural birth defects independently.

No comprehensive portal for pediatric cancers

While there is already a healthy web-based research ecosystem for cancer genomics data, “there is no comprehensive portal for pediatric cancers, which are distinct in etiology and genomic profile from adult cancers,” according to background information from the NIH. “There are scant web- or application-based tools to assist researchers investigating the causes and consequences of structural birth defects (or germline diseases in general), and no online resources whatsoever that combine the phenotypic and genotypic information for these two classes of pediatric disease.”

To fill up this void, the Kids First data resource center is designed to bring together the pediatric cancer and structural birth defect research communities, providing a unique opportunity to leverage the information gathered by one group to acquire insights in the other, and to recognize and promote collaborations among the two disciplines.

This data resource portal will serve the needs of four groups of users:
  1. Biomedical researchers, who require deep access to the Kids first data sets and the ability to perform broad integrative queries and analytics across multiple data sets
  2. Clinicians, who require concise summaries of the state of knowledge of pediatric cancers and congenital birth defects
  3. Data scientists, who will build analytic pipelines, knowledge bases and other tools and services on top of the portal
  4. Patients and family, who will look to Kids First as a community resource for learning about their disease, and for finding support groups and other disease-related resources