Plasma Extrachromosomal Circular DNA As a Potential Diagnostic Biomarker for Nodular Thyroid Disease

CLINICAL AND TRANSLATIONAL MEDICINE(2024)

引用 0|浏览9
摘要
Dear Editor, The accurate diagnosis of cancerous nodules in nodular thyroid disease remains a significant challenge.1, 2 Extrachromosomal circular DNA (eccDNA), generated during apoptosis, shows tissue/plasma-specific patterns, and varying spectra in different diseases. However, its role in distinguishing benign and malignant thyroid nodules is unexplored.3-9 This study leverages Circle-seq technology and machine learning to investigate the potential of eccDNA as a non-invasive biomarker for diagnosing thyroid cancer. To understand the distribution of eccDNA in plasma amongst healthy controls (NOR, N = 13), nodular thyroid goitre patients (NOD, N = 25) and papillary thyroid carcinoma (PTC, N = 47) individuals, we applied an optimised Circle-seq strategy for enriching circular DNA (Figure 1A), with the clinical characteristics of the subjects detailed in Table S1. The majority of the Circle-seq reads were aligned to the human reference genome (Figure S1). We examined the eccDNA profiles in healthy controls and NOD/PTC patients, noting significant variability in eccDNA counts amongst groups. On average, NOR, NOD and PTC groups harboured 27 019 (range: 11 822–45 622), 16 850 (range: 7632–44 537) and 27 352 (range: 4015–83 301) unique eccDNAs, respectively. After normalising eccDNA counts to eccDNA counts per million mapped reads (EPM) to adjust for sequencing depth variations, the PTC group showed significantly elevated EPM values compared to both NOD and healthy controls (Figure 1B). Consistent with increased EPM values in PTC, this group also exhibited a higher number and proportion of eccDNA mapping to protein-coding regions, while repeat element proportions were lower (Figure 1C–E). EccDNA load variations prompted us to explore its relationship with cancer characteristics, revealing that loss of eccDNA from coding regions correlates positively with tumour size, suggesting the specific roles of eccDNA in oncogenesis (Figure S2). The plasma eccDNA analysis in the PTC group showed longer fragments and higher Guanine-Cytosine (GC) content, with most eccDNA populations under 1000 bases, peaking at around 202 and 338 bases, a preference for high GC content areas in eccDNA formation (Figure S3A–C). Genomic annotation highlighted eccDNA's enrichment in untranslated and exonic regions, aligning with known profiles in health and disease (Figure 1F). To understand the distribution of eccDNA genomic locations in PTC patients, we aligned eccDNA sequences with the human genome and identified 450 gene-containing eccDNA (eccGenes) that were significantly prevalent in the PTC group (Figure S4A). Gene ontology (GO) and gene set enrichment analysis (GSEA) indicated that these eccGenes frequently involve exons related to tissue growth, and the Wnt and GPCR signalling pathways (Figure S4B,C). Principal component analysis (PCA) highlighted distinct eccDNA gene region diversity associated with PTC (Figure 1G). Remarkably, an increased presence of miR-1203-related eccDNA circles was detected in the PTC group (Figure 2A,B). Transfection of synthesised miR-1203 eccDNA into thyroid cell lines (TPC-1, BHP10-3 and K1) led to significant transcriptional changes: 572 genes were upregulated and 1035 downregulated (Figure 2C). This significant shift in gene expression, involving numerous cancer-associated genes, underscores eccDNA's influence in oncogenesis (Figure 2D,E). To investigate if PTC tumour cells emit eccDNA into the bloodstream, we crafted mouse xenograft models with human PTC cell lines TPC-1, BHP10-3 and K1. We isolated eccDNA from plasma, confirming human-origin eccDNA in mice (Figure 3A). Sequencing revealed genomic features consistent with tumour-derived eccDNA, including GC content, motif patterns and chromosomal distribution, highlighting its potential as a cancer detection marker (Figure 3B–D). Exploring eccDNA's diagnostic value in PTC, we analysed 308 837 genomic locations. Utilising an E-net logistic regression model and a two-nested leave-one-out strategy,10 we assessed the link between these locations and disease status. We identified 71 critical eccDNA locations capable of differentiating PTC from NOD patients, forming a potential classification model. By comparing 71 eccDNA locations identified in this study with the gene annotations of the human genome GRCh38.p13 in Ensembl (version 108), we found 18 locations showed no overlap with any known genes, while the remaining 53 locations overlapped with 71 known genes (Figure 4A and Table S2). After completing the two-nested leave-one-out cross-validation (LOOCV) process, we generated 72 predictions for the 72 subjects. Receiver operating characteristic curve (ROC) analysis on these predictions and their true labels produced an area under the curve (AUC) of .754 (Figure 4B). To further evaluate the diagnostic efficacy of our model, we have supplemented it with more detailed performance metrics, including an accuracy of 68.1%, sensitivity of 55.3%, specificity of 92.0%, positive predictive value of 92.9% and negative predictive value of 52.3%. We also selected the optimal cut-off point using a two-nested LOOCV approach. To address possible bias from random training and validation set splits, we conducted 2000 rounds of five-fold cross validation, averaging an AUC of .789 with a standard deviation of .135. Additionally, a T-test comparing predictions across the two groups indicated a statistically significant difference (T = −3.9012, p = .00029; Figure 4C), suggesting the 71 identified eccDNA locations could serve as a diagnostic biomarker for distinguishing between PTC and NOD patients. This study explores the potential of plasma eccDNA as a non-invasive biomarker to distinguish between benign and malignant thyroid nodules, potentially improving the diagnosis of PTC. The elevated eccDNA levels found in PTC patients and mouse PTC xenograft models indicate its tumour-originated release into circulation, supporting its diagnostic value. In this study, the sample size is an important limitation, which restricts the exploration of research conclusions and the performance of the model to a certain extent. A small sample size may lead to overfitting and reduce the credibility of research conclusions. To address this, we adopted a two-nested LOOCV loops to ensure that the testing set in the outer loop does not involve repeated training. The results from the outer loop were used as evaluation criteria, maximising data utilisation and preventing overfitting. After our posterior sample size estimation, our sample size meets the minimum sample size requirement, indicating that the research conclusions are credible. In the future, we will expand the sample size to improve our research model and conclusions. Despite limited samples, we identified the nucleosomal origin and genetic predisposition for eccDNA formation in PTC, establishing eccDNA as a promising biomarker for cancer detection and enhancing thyroid cancer diagnostics precision. Jiajun Zhao, Yonglun Luo, Ling Gao and Changbin Yu conceived the idea. Meng Zhou, Zhenyu Yao and Yunyun Xu collected the clinical samples and constructed the mouse xenograft models. Wei Lv and Peng Han performed the Circle-seq experiments. Yonglun Luo, Wei Lv, Meng Zhou, Peng Han, Kai Sun and Ziqian Hao drafted most of the manuscript. Wei Lv, Peng Han and Zhe Xu analysed the eccDNA data. Kai Sun, Ziqian Hao and Meng Zhou encoded the eccDNA sequences and performed machine learning and statistical analysis. Shanshan Shao, Shizhan Ma, Qingling Guo, Haiqing Zhang, Ke Liu, Fan Yang, Zhongshang Yuan, Jiajun Zhao and Guojun Wu have contributed to the execution of the experiments and studies. Meng Zhou, Wei Lv, Peng Han, Kai Sun, Ziqian Hao, Guojun Wu, Changbin Yu, Zhenyu Yao, Ling Gao and Jiajun Zhao discussed the results and contributed to the final manuscript. This work was supported by the National Key Research and Development Program of China (2021YFA0805100), Shandong Provincial Project (ZFJH202306) and the National Natural Science Foundation of China (81430020, 81900717, 82070818 and 82200879). The authors express their gratitude to Prof. Christopher G. Proud and Prof. Xuemin Wang for their valuable comments and suggestions during the writing process. We extend our thanks to Lifang Zhao, Jin Xie, Bo Xiang, Ning Wang, Xiaoqing Sun and Wenbo Bian for their technical assistance. We are also appreciative of Xiuyun Li and Jindi Zhang for the clinical samples collection. Our gratitude goes to Figdraw2.0 (www.figdraw.com) for figures generation. The authors declare no conflicts of interest. Please note: The publisher is not responsible for the content or functionality of any supporting information supplied by the authors. Any queries (other than missing content) should be directed to the corresponding author for the article.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
0
您的评分 :

暂无评分

数据免责声明
页面数据均来自互联网公开来源、合作出版商和通过AI技术自动分析结果,我们不对页面数据的有效性、准确性、正确性、可靠性、完整性和及时性做出任何承诺和保证。若有疑问,可以通过电子邮件方式联系我们:report@aminer.cn