The dark matter of the cancer genome: aberrations in regulatory elements, untranslated regions, splice sites, non‐coding RNA and synonymous mutations
Abstract
Glossary
- Acceptor splice site
- Splice site at the end of an intron (3′ end).
- AU‐rich elements (ARE)
- Conserved motif of adenine/uridine bases in the 3′‐untranslated region (UTR) of an mRNA controlling mRNA decay.
- Branch point
- Sequence within the intron needed during splicing for the creation of the lariat structure. The adenine of the branch point forms a phosphodiester bond with the 5′ end of the intron.
- Cis‐acting element
- A non‐coding sequence in a gene or transcript with regulatory effects on the same or a nearby gene (in cis).
- Consensus splice site
- Nucleotide sequences that serve as splice sites in the majority of premature gene transcription. These include the highly conserved dinucleotides GT (5′ end of intron) and AG (3′ end of intron).
- Cryptic splice site
- Inactive splice site which can be activated when the previous dominant splice site loses its function.
- Donor splice site
- Splice site at the beginning of an intron (5′ end).
- Driver mutation
- Mutation that confers a growth advantage for the tumor leading to malignant initiation, promotion, or progression.
- Epigenetic events
- Events of gene regulation without underlying alterations in the DNA sequence, for example, through DNA methylation or histone modification.
- Enhancer
- Transcription factor binding site located up to 1 Mbp up‐ or downstream of a particular gene with bidirectional effects. The binding of a transcription factor to an enhancer results in the upregulation of the transcription of the respective gene.
- Exon skipping
- Exons are sequences that are usually retained during the splicing process and are part of the mature transcript. Exon skipping denotes a form of alternative splicing in which an exon and its neighboring introns are spliced out, for example, due to mutations in or different strengths of splice sites.
- Insulator
- Genomic region that creates a boundary between an enhancer and neighboring genes. Enhancer‐blocking insulators limit the number of genes which an enhancer can influence through selective disruption of enhancer–promoter interaction.
- Internal ribosome entry site (IRES)
- Alternate ribosomal binding site (RBS) in mRNA, downstream of the classic RBS at the 5′ cap.
- Intron retention
- Introns are sequences that are usually cut out during the splicing process and are not part of the mature transcript. Intron retention denotes a form of alternative splicing in which whole or parts of introns remain in the RNA, for example, due to mutations in or different strengths of splice sites.
- Kozak consensus sequence
- A nucleotide sequence motif in mRNA essential for ribosomal assembly and initiation of translation around the start codon.
- Long non‐coding RNA (lncRNA)
- Long non‐coding RNAs are non‐coding transcripts with a length of > 200 nucleotides and lacking a significant coding potential. LncRNAs affect a variety of cellular functions: they regulate gene expression, influence the activity and localization of proteins or nucleic acids, or act as scaffolds for the formation of cellular substructures and protein complexes.
- microRNA (miRNA)
- Short, non‐coding RNA (18–25 nt) that can repress gene expression at the post‐transcriptional level by binding to mRNAs.
- NCI‐60 Panel
- A panel of the US National Cancer Institute comprising 60 different, well‐characterized human cancer cell lines that is used to test natural and chemical products and serves as a tool in cancer research.
- Passenger mutations
- Mutation that does not promote the fitness of malign cells or even damage them.
- PIWI‐interacting RNAs or piRNAs
- A class of small non‐coding RNAs mainly involved in the silencing of transposable elements (TEs) in germ cells.
- Polyadenylation
- After cleavage of a pre‐mRNA at its 3′‐end to terminate the transcript, roughly 250 adenosines are attached to the mRNA sequence that form the poly(A) tail ensuring translational efficacy and increasing mRNA stability.
- Promoter
- Region of DNA located within the close upstream area of a gene that contains binding sites for specific transcription factors crucial for the initiation of transcription.
- Seed region
- Nucleotides 2–8 of a microRNA largely determining target recognition by usually perfect complementarity to the target mRNA.
- Single nucleotide polymorphism (SNP)
- Single nucleotide variation in the genome that is found in at least 1% of the population.
- Silent mutation
- Base substitution anywhere in the genome without any effect on the amino acid sequence of coding genes, for example, mutations outside of genes or in regulatory elements or synonymous mutations.
- Synonymous mutation
- Base substitution in the coding sequence of a protein‐coding gene that does not modify the amino acid sequence of the gene product due to the redundancy of the genetic code.
- Trans‐acting element
- A factor, usually a protein or oligonucleotide, with regulatory effects on a gene distant from its transcriptional source (in trans).
- Upstream open reading frame (uORF)
- Open reading frame in the 5′‐UTR with regulatory effects on the translation of the main ORF downstream on the same mRNA
Introduction
Regulatory elements
Genetic event | Regulation | Affected gene | Gene function | Alteration | Reference |
---|---|---|---|---|---|
New binding site for activating TF | ↑ | TERT (M) | Catalytic subunit of telomerase | C228T, C250T (promoter) | Bell et al (2015); Heidenreich et al (2014); Horn et al (2013) |
TAL1 (M) | Oncogene, transcription factor | insertion (super‐enhancer) | Mansour et al (2014) | ||
MCL1 (M) | Apoptosis inhibitor | insertion (promoter) | Moshynska et al (2004); Tobin et al (2005) | ||
CCND1 (P) | Oncogene, regulation of cell cycle progression | multiple SNPs (enhancer) | Schodel et al (2012) | ||
MMP1 (P) | MMP | (−1,607) 1G/2G (promoter) | Liu et al (2012) | ||
HGF (P) | Cell proliferation, survival, migration, and morphogenesis | truncation deletion (promoter) | Ma et al (2009b) | ||
LMO1 (P) | Transcription factor | SNP in super‐enhancer | Oldridge et al (2015) | ||
New binding site for repressing TF | ↓ | BRM (P) | Cancer susceptibility gene | insertion (−741, −1,321) (promoter) | Gao et al (2013); Liu et al (2011); Wong et al (2014) |
Disrupted binding site for activating TF | ↓ | SDHD (M) | Tumor suppressor gene, subunit of succinate dehydrogenase complex | 3 hotspots C > T (promoter) | Weinhold et al (2014) |
WDR74 (M) | Cell cycle control, apoptosis | 52 hotspots C > T (promoter) | Weinhold et al (2014) | ||
PAX5 (M) | B cell differentiation factor | multiple mutations (enhancer) | Puente et al (2015) | ||
CK‐19 (M) | Tumor marker (NSCLC) | G (−99)C (promoter) | Fujita et al (2001) | ||
MMP2 (P) | MMP | C (−1,306)T (promoter) | Liu et al (2012) | ||
Disrupted binding site for repressing TF | ↑ | AMACR (P) | Racemase in fat metabolism | germline deletion (promoter) | Zhang et al (2009b) |
Disrupted insulator | ↑/↓ | IGF2/H19 (M) | Proliferation control | germline deletion (insulator) | Ideraabdullah et al (2014); Sparago et al (2004) |
Unknown | ↓ | PLEKHS1 (M) | Largely unknown | 23 hotspots C > T (promoter) | Weinhold et al (2014) |
↓ | CASP8 (P) | Induction of apoptosis | −652 6N del (promoter) | de Martino et al (2013); Li et al (2010); Malik et al (2011); Wang et al (2009) | |
↑ | NFKB1 (P) | Transcription factor | insertion (promoter) | Fan et al (2011); Mohd Suzairi et al (2013); Tang et al (2010); Zhang et al (2009a) | |
↓ | BRCA1 (P) | Tumor suppressor, DNA repair gene | 5‐kb deletion (promoter + 5′‐UTR) | Brown et al (2002) | |
↓ | MMP3 (P) | MMP | (−1,171) 5A/6A (promoter) | Liu et al (2012) | |
↑ | MMP7 (P) | MMP | A (−181)G (promoter) | Liu et al (2012) | |
↑ | MMP9 (P) | MMP | C (−1,562)T (promoter) | Liu et al (2012) |
5′‐Untranslated regions (5′‐UTR)

Gene | Variant | Regulatory element/Mechanism | Effect on protein | Cancer type | Reference |
---|---|---|---|---|---|
CDKN1B | 4‐bp deletion C.‐456‐453del (g) | uORF | Decrease | MEN4 | Occhi et al (2013) |
CDKN2A | G‐34T (g) | Aberrant initiation codon | N/A | Melanoma | Liu et al (1999) |
C‐MYC | C2756T (s) | IRES | Increase | Multiple myeloma | Chappell et al (2000) |
ERCC5 | A25G (SNP) | uORF | Increase | Pediatric ependymoma | Somers et al (2015) |
RAD51 | G135C (SNP) | Splice site/secondary structure | Decrease | Breast cancer | Antoniou et al (2007) |
RB1 | G17C, G18U (SNV, N/A) | Secondary structure | Decrease | Retinoblastoma | Kutchko et al (2015) |
TP53 | C119T (SNP) | IRES | Decrease | Melanoma | Khan et al (2013); Soto et al (2005) |
Synonymous mutations

Mechanism | Examples in cancer | Nucleotide change | Result | Examples in other disease |
---|---|---|---|---|
Splicing | BRCA1/2 (Anczuków et al, 2008; Hansen et al, 2010; Raponi et al, 2011) | BRCA1, 3719 G > T (g) BCRA2, 744 G > A (g) BCRA1, 231 G > T (g) | Exon skipping in vitro Exon skipping Exon skipping | CD44 (Vidal et al, 2009) (osteoporosis) CHRNE (Richard et al, 2007) (myasthenic syndrome) |
APC (Montera et al, 2001; Pećina‐Slaus et al, 2010) | 1869 G > T (g) 5883 G > A (s) | Exon skipping New splice site ? | ||
TP53 (Supek et al, 2014) | Multiple (s) | Multiple | ||
mRNA stability | CYP2D6 (Toscano et al, 2006) | 2939 G > A (s) | Predicted changes in mRNA structure may affect stability | COMT (Nackley et al, 2006) (temporo‐mandibular joint disorder) |
microRNA binding | BCL2L12 (Gartner et al, 2013) | 51 C > T (s) | Loss of has‐miR‐671‐5p binding site | IRGM (Brest et al, 2011) (Crohn's disease) |
Translation/Protein folding | MDR1 (Kimchi‐Sarfaty et al, 2007) | 3435 C > T (s) | Rare codon might lead to changes in cotranslational folding | CFTR (Bartoszewski et al, 2010; Lazrak et al, 2013) (cystic fibrosis) |
Splice sites and introns

3′‐Untranslated regions (3′‐UTR)
Gene | Variant | Mechanism | Expression/effect on protein | Cancer type | Reference |
---|---|---|---|---|---|
CCND1 | Several genomic deletions in 3′‐UTR (N/A) | Premature polyadenylation | Increase by enhanced stability of truncated mRNA (lacking AU‐rich elements, loss of miR‐BS) | Mantle cell lymphoma (oncogenic risk) | Wiestner et al (2007) |
Small aberration within 3′‐UTR (320 bp from stop codon: single base insertion (A at position 1344), small deletion (3 bp at position 1,344–46), duplication in repetitive element in 3′‐UTR (N/A) | Creation of APA signals | ||||
MSH6 | Duplication of 20 bp close to the polyadenylation site (g) | Decreased efficiency of polyadenylation | Decrease | Lynch syndrome | Decorsiere et al (2012) |
TP53 | rs78378222 A/C (g: SNP) | Change within polyadenylation signal | Decrease | Cutaneous basal cell carcinoma, prostate cancer, colorectal adenoma, glioma | Stacey et al (2011) |
PSMD8 TM9SF3 CD59 ANKH CIAO1 SRSF5 MRSP16 NDUFA6 | (N/A) | APA Differential usage of polyadenylation sites | Increase by enhanced stability of truncated mRNA due to miR‐BS loss | Small intestinal neuroendocrine tumor | Rehfeld et al (2014) |
Non‐coding RNAs
Conclusions & outlook
Conflict of interest
Pending issues
Acknowledgements
Footnote
Supporting Information
References
Information & Authors
Information
Published In
Cover: This month's cover highlights the Report “Generation and deposition of Aβ43 by the virtually inactive presenilin‐1 L435F mutant contradicts the presenilin loss‐of‐function hypothesis of Alzheimer's disease” by Benedikt Kretner, Johannes Trambauer, Harald Steiner and colleagues. Cover image shows the immunohistochemical detection of Aβ43 in a hippocampus paraffin section (hilus region) of a FAD case with the PS1 L435F mutation. Cover image by Thomas Arzberger (DZNE – German Center for Neurodegenerative Diseases, Munich, Germany | Department of Psychiatry and Psychotherapy, Ludwig‐Maximilians‐University Munich, Germany | Center for Neuropathology and Prion Research, Ludwig‐Maximilians‐University Munich, Germany)
Submission history
Keywords
Notes
Copyright
Authors
Research Funding
Metrics & Citations
Metrics
Citations
Download Citations
If you have the appropriate software installed, you can download article citation data to the citation manager of your choice. Select your manager software from the list below and click Download.
Citing Literature
- Phei Er Saw, Erwei Song, Phei Er Saw, Erwei Song, mRNA Vaccines, RNA Therapeutics in Human Diseases, 10.1007/978-981-96-3041-7_26, (617-637), (2025).
- Moawiah M Naffaa, Ola A Al-Ewaidat, Sopiko Gogia, Valiko Begiashvili, Neoantigen-based immunotherapy: advancing precision medicine in cancer and glioblastoma treatment through discovery and innovation, Exploration of Targeted Anti-tumor Therapy, 10.37349/etat.2025.1002313, 6, (2025).
- Xuefei Liao, Li Yang, Mingjuan Jiang, Yujie Xin, Huirong Yan, Qingshuang Qin, Mengdi Chen, Jianhong Lu, The Emerging Roles of Alternative Splicing in Human Oncovirus Infection, Journal of Medical Virology, 10.1002/jmv.70346, 97, 4, (2025).
- Bárbara dos Santos Dias, Symara Rodrigues Antunes, Danilo do Rosário Pinheiro, Rommel Mario Rodriguez Burbano, Bárbara do Nascimento Borges, Mitochondrial Complex I Molecular Alterations in Sapajus apella as a Human Gastric Carcinogenesis Model After MNU Exposure , Journal of Medical Primatology, 10.1111/jmp.70017, 54, 2, (2025).
- Yimin Zhu, Jiayu Wang, Binghe Xu, Development of a prognostic model based on the ceRNA network in Triple-Negative Breast cancer, PeerJ, 10.7717/peerj.19063, 13, (e19063), (2025).
- Jagriti Pal, Marisa Riester, Athina Ganner, Avantika Ghosh, Sonam Dhamija, Debdatto Mookherjee, Christian Voss, Ian J. Frew, Fruzsina Kotsis, Elke Neumann-Haefelin, Anne Spang, Sven Diederichs, Nonstop mutations cause loss of renal tumor suppressor proteins VHL and BAP1 and affect multiple stages of protein translation , Science Advances, 10.1126/sciadv.adr6375, 11, 7, (2025).
- Xuemei Lv, Xiaoyu Sun, Yang Gao, Xinyue Song, Xiaoyun Hu, Lang Gong, Li Han, Miao He, Minjie Wei, Targeting RNA splicing modulation: new perspectives for anticancer strategy?, Journal of Experimental & Clinical Cancer Research, 10.1186/s13046-025-03279-w, 44, 1, (2025).
- Si-Yun Lin, Hou Huang, Jin-Jie Yu, Feng Su, Tian Jiang, Shao-Yuan Zhang, Lu Lv, Tao Long, Hui-Wen Pan, Jun-Qing Qi, Qiang Zhou, Wei-Feng Tang, Guo-Wen Ding, Li-Ming Wang, Li-Jie Tan, Jun Yin, Activin A receptor type 1C single nucleotide polymorphisms associated with esophageal squamous cell carcinoma risk in Chinese population, World Journal of Gastrointestinal Oncology, 10.4251/wjgo.v17.i1.96702, 17, 1, (2025).
- Hikaru Kurasawa, Yuta Matsuura, Riho Yamane, Tomoyuki Ohno, Yasunori Aizawa, Biallelic genome engineering to create isogenic induced pluripotent stem cells modelling Huntington’s disease, Genes & Genetic Systems, 10.1266/ggs.25-00016, (2025).
- Anikó Bozsik, Henriett Butz, Vince Kornél Grolmusz, Tímea Pócza, Attila Patócs, János Papp, Spectrum and genotyping strategies of “dark” genetic matter in germline susceptibility genes of tumor syndromes, Critical Reviews in Oncology/Hematology, 10.1016/j.critrevonc.2024.104549, 205, (104549), (2025).
- See more