Open Peer Review reports
Tumor classification and biomarker discovery based on the 5’isomiR expression level
© The Author(s). 2019
- Published: 7 February 2019
The miRNA isoforms (isomiRs) have been suggested to regulate the same pathways as the canonical miRNA and play an important biological role in miRNA-mediated gene regulation. Recently, a study has demonstrated that the presence or absence of all isomiRs could efficiently discriminate amongst 32 TCGA cancer types. Besides, an effective reduction of distinguishing isomiR features for multiclass tumor discrimination must have a major impact on our understanding of the disease and treatment of cancer.
In this study, we have constructed a combination of the genetic algorithms (GA) with Random Forest (RF) algorithms to detect reliable sets of cancer-associated 5’isomiRs from TCGA isomiR expression data for multiclass tumor classification.
We obtained 100 sets of the optimal predictive features, each of which comprised of 50–5’isomiRs that could effectively classify with an average sensitivity of 92% samples from 32 different tumor types. We calculated the frequency with which a 5’isomiR found in these sets as measuring its importance for tumor classification. Many highly frequent 5’isomiRs with different 5′ loci from canonical miRNAs were detected in these sets, supporting that the isomiRs play a significant role in the multiclass tumor classification. The further functional enrichment analysis showed that the target genes of the 10 most frequently appearing 5’isomiRs were involved in the activity of transcription activator and protein kinase and cell-cell adhesion.
The findings of the present study indicated that the 5’isomiRs might be employed for multiclass tumor classification and the suggested that GA/RF model could perform effective tumor classification by a series of largely independent optimal predictor 5′ isomiR sets.
- Tumor classification
- Genetic algorithm
- Random forest