Decision tree-based classifiers for lung cancer diagnosis and subtyping using TCGA miRNA expression data.

Abstract	Lung cancer has the world's highest cancer- associated mortality rate, making biomarker discovery for this cancer a pressing issue. Machine learning approaches to identify molecular biomarkers are not as prevalent as screening of potential biomarkers by differential expression analysis. However, several differentially expressed miRNAs involved in cancer have been identified using this approach. The availability of The Cancer Genome Atlas (TCGA) allows the use of machine-learning methods for the molecular profiling of tumors. The present study employed empirical negative control microRNAs (miRs) in lung cancer to normalize lung adenocarcinoma (LUAD) and lung squamous cell carcinoma (LUSC) datasets from TCGA to model decision trees in order to classify lung cancer status and subtype. The two primary classification models consisted of four miRNAs for lung cancer diagnosis and subtyping. hsa-miR-183 and hsa-miR-135b were used to distinguish lung tumors from normal samples taken from tissues adjacent to the tumor site, and hsa-miR-944 and hsa-miR-205 to further classify the tumors into LUAD and LUSC major subtypes. Specific cancer status classification models were also presented for each subtype.
Authors	Masih Sherafatian, Fateme Arjmand
Journal	Oncology letters (Oncol Lett) Vol. 18 Issue 2 Pg. 2125-2131 (Aug 2019) ISSN: 1792-1074 [Print] Greece
PMID	31423286 (Publication Type: Journal Article)

Join CureHunter, for free Research Interface BASIC access!

Take advantage of free CureHunter research engine access to explore the best drug and treatment options for any disease. Find out why thousands of doctors, pharma researchers and patient activists around the world use CureHunter every day.

Realize the full power of the drug-disease research graph!