Use case driven evaluation of open databases for pediatric cancer research

Research output: Contribution to journalArticlepeer-review

Abstract


Background

A plethora of Web resources are available offering information on clinical, pre-clinical, genomic and theoretical aspects of cancer, including not only the comprehensive cancer projects as ICGC and TCGA, but also less-known and more specialized projects on pediatric diseases such as PCGP. However, in case of data on childhood cancer there is very little information openly available. Several web-based resources and tools offer general biomedical data which are not purpose-built, for neither pediatric nor cancer analysis. Additionally, many Web resources on cancer focus on incidence data and statistical social characteristics as well as self-regulating communities.
Methods

We summarize those resources which are open and are considered to support scientific fundamental research, while we address our comparison to 11 identified pediatric cancer-specific resources (5 tools, 6 databases). The evaluation consists of 5 use cases on the example of brain tumor research and covers user-defined search scenarios as well as data mining tasks, also examining interactive visual analysis features.
Results

Web resources differ in terms of information quantity and presentation. Pedican lists an abundance of entries with few selection features. PeCan and PedcBioPortal include visual analysis tools while the latter integrates published and new consortia-based data. UCSC Xena Browser offers an in-depth analysis of genomic data. ICGC data portal provides various features for data analysis and an option to submit own data. Its focus lies on adult Pan-Cancer projects. Pediatric Pan-Cancer datasets are being integrated into PeCan and PedcBioPortal. Comparing information on prominent mutations within glioma discloses well-known, unknown, possible, as well as inapplicable biomarkers. This summary further emphasizes the varying data allocation. Tested tools show advantages and disadvantages, depending on the respective use case scenario, providing inhomogeneous data quantity and information specifics.
Conclusions

Web resources on specific pediatric cancers are less abundant and less-known compared to those offering adult cancer research data. Meanwhile, current efforts of ongoing pediatric data collection and Pan-Cancer projects indicate future opportunities for childhood cancer research, that is greatly needed for both fundamental as well as clinical research.
Original languageEnglish
Article number2
Pages (from-to)2
Number of pages20
JournalBioData Mining
Volume12
Issue number1
DOIs
Publication statusPublished - 15 Jan 2019

Keywords

  • In silico analysis
  • brain tumor
  • cancer database
  • Brain tumor
  • Cancer database
  • Glioma
  • Open research
  • Pediatric oncology
  • Childhood cancer

ASJC Scopus subject areas

  • Artificial Intelligence
  • Computational Mathematics
  • Genetics
  • Molecular Biology
  • Biochemistry
  • Computer Science Applications
  • Computational Theory and Mathematics

Fields of Expertise

  • Information, Communication & Computing

Treatment code (Nähere Zuordnung)

  • Basic - Fundamental (Grundlagenforschung)

Fingerprint

Dive into the research topics of 'Use case driven evaluation of open databases for pediatric cancer research'. Together they form a unique fingerprint.

Cite this