Double-stage discretization approaches for biomarker-based bladder cancer survival modeling

Research output: Contribution to journalArticlepeer-review

Abstract

Bioinformatic techniques targeting gene expression data require specific analysis pipelines with the aim of studying properties, adaptation, and disease outcomes in a sample population. Present investigation compared together results of four numerical experiments modeling survival rates from bladder cancer genetic profiles. Research showed that a sequence of two discretization phases produced remarkable results compared to a classic approach employing one discretization of gene expression data. Analysis involving two discretization phases consisted of a primary discretizer followed by refinement or pre-binning input values before the main discretization scheme. Among all tests, the best model encloses a sequence of data transformation to compensate skewness, data discretization phase with class-attribute interdependence maximization algorithm, and final classification by voting feature intervals, a classifier that also provides discrete interval optimization.

Original languageEnglish
Pages (from-to)29-47
Number of pages19
JournalCommunications in Applied and Industrial Mathematics
Volume12
Issue number1
DOIs
Publication statusPublished - 1 Jan 2021

Keywords

  • Bladder cancer
  • Data-driven biomarker research
  • Discretization
  • Genetic expression
  • Machine learning
  • Survival rate modeling

Fingerprint

Dive into the research topics of 'Double-stage discretization approaches for biomarker-based bladder cancer survival modeling'. Together they form a unique fingerprint.

Cite this