Yıl: 2022 Cilt: 46 Sayı: 2 Sayfa Aralığı: 145 - 161 Metin Dili: İngilizce DOI: 10.3906/biy-2108-83 İndeks Tarihi: 20-06-2022

Developing a label propagation approach for cancer subtype classification problem

Öz:
Cancer is a disease in which abnormal cells grow uncontrollably and invade other tissues. Several types of cancer have various subtypes with different clinical and biological implications. Based on these differences, treatment methods need to be customized. The identification of distinct cancer subtypes is an important problem in bioinformatics, since it can guide future precision medicine applications. In order to design targeted treatments, bioinformatics methods attempt to discover common molecular pathology of different cancer subtypes. Along this line, several computational methods have been proposed to discover cancer subtypes or to stratify cancer into informative subtypes. However, existing works do not consider the sparseness of data (genes having low degrees) and result in an ill-conditioned solution. To address this shortcoming, in this paper, we propose an alternative unsupervised method to stratify cancer patients into subtypes using applied numerical algebra techniques. More specifically, we applied a label propagationbased approach to stratify somatic mutation profiles of colon, head and neck, uterine, bladder, and breast tumors. We evaluated the performance of our method by comparing it to the baseline methods. Extensive experiments demonstrate that our approach highly renders tumor classification tasks by largely outperforming the state-of-the-art unsupervised and supervised approaches.
Anahtar Kelime:

Belge Türü: Makale Makale Türü: Araştırma Makalesi Erişim Türü: Erişime Açık
  • Alashwal H, El Halaby M, Crouse JJ, Abdalla A, Moustafa AA (2019). The Application of Unsupervised Clustering Methods to Alzheimer’s Disease. Frontiers in Computational Neuroscience 13: 31. doi: https://doi.org/10.3389/fncom.2019.00031
  • Backstrom L, Leskovec J (2011). Supervised random walks: predicting and recommending links in social networks. In: Proceedings of the fourth ACM international conference on Web search and data mining (WSDM ‘11), NY, USA. p. 635– 644. doi: 10.1145/1935826.1935914
  • Bolshakova N, Azuaje F (2003). Cluster validation techniques for genome expression data. Signal Processing 83: 825-833. doi: 10.1016/S0165-1684(02)00475-9
  • Cai D, He X, Wu X, Han J (2008). Non-negative matrix factorization on manifold. In: 8th IEEE International Conference on Data Mining. p. 63–72.
  • Cai D, He X, Han J, Huang TS (2011). Graph Regularized Nonnegative Matrix Factorization for Data Representation. IEEE Transactions on Pattern Analysis and Machine Intelligence 33 (8): 1548-1560. doi: 10.1109/TPAMI.2010.231
  • Cho A, Shim JE, Kim E, Supek F, Lehner B, et al. (2016). MUFFINN: Cancer gene discovery via network analysis of somatic mutation data. Genome Biology 17 (1): 129. doi: 10.1186/ s13059-016-0989-x
  • Cho DY, Przytycka TM (2013). Dissecting cancer heterogeneity with a probabilistic genotype–phenotype model. Nucleic Acids Research 41: 8011–8020. doi: 10.1093/nar/gkt577
  • Coskun M, Bakir-Gungor B, Koyuturk M (2019). Expanding Label Sets for Graph Convolutional Networks. arXiv preprint arXiv:1912.09575
  • Coskun M, Grama A, Koyuturk M (2018). Indexed Fast Network Proximity Querying. Proceedings of the VLDB Endowment 11 (8): 840-852. doi: 10.14778/3204028.3204029
  • Coşkun M, Koyutürk M (2021). Node Similarity Based Graph Convolution for Link Prediction in Biological Networks. Bioinformatics doi: 10.1093/bioinformatics/btab464
  • Cowen L, Ideker T, Raphael BJ, Sharan R (2017). Network propagation: A universal amplifier of genetic associations. Nature Reviews Genetics 18 (9): 551–62. doi: 10.1038/nrg.2017.38
  • Davies DL, Bouldin DW (1979). A Cluster Separation Measure. IEEE Transactions on Pattern Analysis and Machine Intelligence PAMI-1(2): 224–27. doi: 10.1109/TPAMI.1979.4766909
  • Hofree M, Shen JP, Carter H, Gross A, Ideker T (2013). Networkbased stratification of tumor mutations. Nature Methods 10 (11): 1108-1115. doi: 10.1038/nmeth.2651
  • Hou JP, Ma J (2014). DawnRank: Discovering personalized driver genes in cancer. Genome Medicine 6 (7): 56. doi: 10.1186/ s13073-014-0056-8
  • Huang JK, Jia T, Carlin DE, Ideker T (2018). pyNBS: a Python implementation for network-based stratification of tumor mutations. Bioinformatics 34 (16):2859-2861. doi: 10.1093/ bioinformatics/bty186
  • Hu F, Wang Q, Yang Z, Zhang Z, Liu X (2020). Network-based identification of biomarkers for colon adenocarcinoma. BMC Cancer 20: 668. doi: 10.1186/s12885-020-07157-w.
  • Jin N, Wu H, Miao Z, Huang Y, Hu Y et al. (2015). Network-based survival-associated module biomarker and its crosstalk with cell death genes in ovarian cancer. Scientific Reports 5: 11566. doi: 10.1038/srep11566
  • Kim YA, Cho DY, Przytycka TM (2016). Understanding GenotypePhenotype Effects in Cancer via Network Approaches. PLoS Computational Biology 12 (3): e1004747. doi: 10.1371/journal. pcbi.1004747
  • Konstantinopoulos PA, Spentzos D, Cannistra SA (2008). Geneexpression profiling in epithelial ovarian cancer. Nature Clinical Practice Oncology 5 (10): 577–87. doi: 10.1038/ ncponc1178.
  • Konstantinopoulos PA, Spentzos D, Karlan BY, Taniguchi T, Fountzilas E et al. (2010). Gene expression profile of BRCAness that correlates with responsiveness to chemotherapy and with outcome in patients with epithelial ovarian cancer. Journal of Clinical Oncology 28 (22): 3555–61. doi: 10.1200/ JCO.2009.27.5719
  • Kourou K, Exarchos TP, Exarchos KP, Karamouzis MV, Fotiadis DI (2014). Machine learning applications in cancer prognosis and prediction. Computational and Structural Biotechnology Journal 13: 8-17. doi: 10.1016/j.csbj.2014.11.005
  • Mateen L, Hinton G (2008). Visualizing data using t-SNE Visualizing Data using t-SNE. Journal of machine learning research 9: 2579- 2605.
  • Monti S, Tamayo P, Mesirov J, Golub T (2003). Consensus Clustering: A Resampling-Based Method for Class Discovery and Visualization of Gene Expression Microarray Data. Machine Learning 52: 91–118. doi: 10.1023/A:1023949509487
  • Pearson K (1905). The Problem of the Random Walk. Nature 72: 294. doi: 10.1038/072294b0
  • Reis-Filho JS, Pusztai L (2011). Gene expression profiling in breast cancer: classification, prognostication, and prediction. Lancet 378 (9805): 1812-23. doi: 10.1016/S0140-6736(11)61539-0
  • Rohani N, Eslahchi C (2020) Classifying Breast Cancer Molecular Subtypes by Using Deep Clustering Approach. Frontiers in Genetics, 11: 553587. doi: 10.3389/fgene.2020.553587
  • Rousseeuw PJ (1987). Silhouettes: A graphical aid to the interpretation and validation of cluster analysis. Journal of Computational and Applied Mathematics 20 (C): 53–65. doi: 10.1016/0377- 0427(87)90125-7
  • The Cancer Genome Atlas Research Network (2013). Integrated genomic characterization of endometrial carcinoma. Nature 474 (7353): 609–615. doi: 10.1038/nature10166
  • Vanunu O, Magger O, Ruppin E, Shlomi T, Sharan R (2010). Associating genes and protein complexes with disease via network propagation. PLoS Computational Biology 6: e1000641. doi: 10.1371/journal.pcbi.1000641
  • World Health Organization (2018). Latest global cancer data: Cancer burden rises to 18.1 million new cases and 9.6 million cancer deaths in 2018. IARC Global Cancer Observatory.
  • Xu H, Gao L, Huang M, Duan R (2020). A network embedding based method for partial multi-omics integration in cancer subtyping. Methods 192: 67-76. doi: 10.1016/j.ymeth.2020.08.001
  • Zhang W, Ma J, Ideker T (2018). Classifying tumors by supervised network propagation. Bioinformatics 34 (13): i484–i493. doi: 10.1093/bioinformatics/bty1072
  • Zhu X, Ghahramani Z (2002). Learning from labeled and unlabeled data with label propagation. Carnegie Mellon University Tech Report.
APA GÜNER ŞAHAN P, Bakir-Gungor B, Coskun M (2022). Developing a label propagation approach for cancer subtype classification problem. , 145 - 161. 10.3906/biy-2108-83
Chicago GÜNER ŞAHAN Pınar,Bakir-Gungor Burcu,Coskun Mustafa Developing a label propagation approach for cancer subtype classification problem. (2022): 145 - 161. 10.3906/biy-2108-83
MLA GÜNER ŞAHAN Pınar,Bakir-Gungor Burcu,Coskun Mustafa Developing a label propagation approach for cancer subtype classification problem. , 2022, ss.145 - 161. 10.3906/biy-2108-83
AMA GÜNER ŞAHAN P,Bakir-Gungor B,Coskun M Developing a label propagation approach for cancer subtype classification problem. . 2022; 145 - 161. 10.3906/biy-2108-83
Vancouver GÜNER ŞAHAN P,Bakir-Gungor B,Coskun M Developing a label propagation approach for cancer subtype classification problem. . 2022; 145 - 161. 10.3906/biy-2108-83
IEEE GÜNER ŞAHAN P,Bakir-Gungor B,Coskun M "Developing a label propagation approach for cancer subtype classification problem." , ss.145 - 161, 2022. 10.3906/biy-2108-83
ISNAD GÜNER ŞAHAN, Pınar vd. "Developing a label propagation approach for cancer subtype classification problem". (2022), 145-161. https://doi.org/10.3906/biy-2108-83
APA GÜNER ŞAHAN P, Bakir-Gungor B, Coskun M (2022). Developing a label propagation approach for cancer subtype classification problem. Turkish Journal of Biology, 46(2), 145 - 161. 10.3906/biy-2108-83
Chicago GÜNER ŞAHAN Pınar,Bakir-Gungor Burcu,Coskun Mustafa Developing a label propagation approach for cancer subtype classification problem. Turkish Journal of Biology 46, no.2 (2022): 145 - 161. 10.3906/biy-2108-83
MLA GÜNER ŞAHAN Pınar,Bakir-Gungor Burcu,Coskun Mustafa Developing a label propagation approach for cancer subtype classification problem. Turkish Journal of Biology, vol.46, no.2, 2022, ss.145 - 161. 10.3906/biy-2108-83
AMA GÜNER ŞAHAN P,Bakir-Gungor B,Coskun M Developing a label propagation approach for cancer subtype classification problem. Turkish Journal of Biology. 2022; 46(2): 145 - 161. 10.3906/biy-2108-83
Vancouver GÜNER ŞAHAN P,Bakir-Gungor B,Coskun M Developing a label propagation approach for cancer subtype classification problem. Turkish Journal of Biology. 2022; 46(2): 145 - 161. 10.3906/biy-2108-83
IEEE GÜNER ŞAHAN P,Bakir-Gungor B,Coskun M "Developing a label propagation approach for cancer subtype classification problem." Turkish Journal of Biology, 46, ss.145 - 161, 2022. 10.3906/biy-2108-83
ISNAD GÜNER ŞAHAN, Pınar vd. "Developing a label propagation approach for cancer subtype classification problem". Turkish Journal of Biology 46/2 (2022), 145-161. https://doi.org/10.3906/biy-2108-83