NCBI Logo
GEO Logo
   NCBI > GEO > Accession DisplayHelp Not logged in | LoginHelp
GEO help: Mouse over screen elements for information.
          Go
Series GSE244311 Query DataSets for GSE244311
Status Public on Feb 22, 2024
Title Improved lung cancer classification by employing diverse molecular features of microRNAs
Organism Homo sapiens
Experiment type Non-coding RNA profiling by high throughput sequencing
Summary Lung adenocarcinoma (LUAD) is one of the most common pathological and histological subtypes of primary lung cancer, with high morbidity and mortality. MicroRNAs (miRNAs) are endogenous small non-coding RNAs that regulate the expression of genes at post-transcriptional level. It was reported that A-to-I miRNA editing was decreased in tumors, suggesting the potential value of miRNA editing in cancer classification. However, existing miRNA-based cancer classification models mainly used the frequencies of miRNAs. In order to validate the contribution of miRNA editing information in cancer classification, we extracted three types of miRNA features, including the abundances of original miRNAs, the abundances of edited miRNAs, and the editing levels of miRNA editing sites. Our results show that four classification algorithms selected, i.e., kNN, C4.5, RF and SVM, generally had better performances on all features than on the abundances of miRNAs alone. Since the number of features were large, we used three feature selection (FS) methods to further improve the classification models. One of the FS methods, the DFL algorithm, selected only three features, i.e., the frequencies of hsa-miR-135b-5p, hsa-miR-210-3p and hsa-miR-182 48u (an edited miRNA), from 316 training samples. And all of the four classification algorithms achieved 100% accuracy on these three features for 79 independent testing samples. These results indicate that the additional information of miRNA editing are useful in improving the classification of LUAD samples. And the three miRNAs selected by DFL potentially represent an effective molecular signature for LUAD diagnosis.
 
Overall design Small RNA-Seq for 19 lung adenocarcinoma (LUAD) and 19 adjacent normal tissues were obtained, and put into liquid nitrogen immediately after resection. The total RNAs were retrieved and the small RNA sequencing libraries were prepared and sequenced by BGI (Shenzhen, China). Next, the mutation and editing sites of miRNAs were analyzed with the MiRME algorithm for all of these 38 sRNA-seq profiles and 357 public LUAD and normal samples. Then, abundance of original and edited miRNAs, editing levels of identified miRNA editing sites were obtained for these 395 samples. Four machine learning algorithms were used to classify these samples as LUAD or normal samples. Three Feature Selection algorithms were used to select molecular features that were accurate in predicting the samples.
Web link https://pubmed.ncbi.nlm.nih.gov/38384512/
 
Contributor(s) Zheng Y
Citation(s) 38384512
Submission date Sep 29, 2023
Last update date Feb 22, 2024
Contact name Yun Zheng
E-mail(s) zhengyun5488@gmail.com
Organization name Yunnan Agricultural University
Street address 452 Fengyuan Road
City Kunming
State/province Yunnan
ZIP/Postal code 650201
Country China
 
Platforms (1)
GPL23227 BGISEQ-500 (Homo sapiens)
Samples (38)
GSM7813349 LUAD sample 1 [383-1B]
GSM7813350 LUAD sample 2 [408-1C]
GSM7813351 LUAD sample 3 [411-1C]
Relations
BioProject PRJNA1022393

Download family Format
SOFT formatted family file(s) SOFTHelp
MINiML formatted family file(s) MINiMLHelp
Series Matrix File(s) TXTHelp

Supplementary file Size Download File type/resource
GSE244311_miRNA-freq-38-luad-samples.xlsx 446.9 Kb (ftp)(http) XLSX
SRA Run SelectorHelp
Raw data are available in SRA

| NLM | NIH | GEO Help | Disclaimer | Accessibility |
NCBI Home NCBI Search NCBI SiteMap