Results
We recently reported a highly sensitive picogram-scale m
6A RNA immunoprecipitation and sequencing (picoMeRIP-seq) method that can start out with a single mouse oocyte, or a single mouse or zebrafish early embryo (Li et al,
2024). Here, we applied picoMeRIP-seq to human oocytes and early embryos to profile the m
6A landscape and to identify potential regulators during human oocyte-to-embryo transition (OET) (Fig.
EV1A; “Methods”).
First, we demonstrated that picoMeRIP-seq could acquire high-quality data from as few as 10 human embryonic stem cells (hESCs) (Fig.
EV1B–F). As shown in genome browser, a high level of reproducibility for selected gene loci was observed (Fig.
EV1B). Transcriptome-wide profiles of picoMeRIP-seq data from 1000 cells to 10 cells showed strong correlations between replicates and across different input cell numbers (Pearson correlation coefficients >0.8; Fig.
EV1C). A high percentage of genes with m
6A identified in samples of 10 cells showed an overlap with those identified in samples of 1000 cells (>85%), and with those reported in previously published m
6A-seq data (>75%) (Batista et al,
2014) (Fig.
EV1D). In agreement with previous studies (Batista et al,
2014; Dominissini et al,
2012; Meyer et al,
2012), m
6A peaks were predominantly enriched around stop codons (Fig.
EV1E), and the most significant consensus motif within m
6A peaks was GGACU (Fig.
EV1F). These results highlight the efficacy of picoMeRIP-seq in accurately profiling the m
6A methylome from low-input human samples, demonstrating its strong sensitivity and reproducibility.
Next, we performed picoMeRIP-seq on single human oocytes at the germinal vesicle (GV) and metaphase II (MII) stages, and single human embryos at the zygote (1C), two-cell (2C), eight-cell (8C) and blastocyst (BLT) stages, with ≥2 biological replicates for each stage (Table
EV1). Principal component analysis (Fig.
EV1G) and hierarchical clustering analysis (Fig.
1A) of the transcriptome-wide m
6A profiles showed the precise clustering of single oocytes and embryos by cell identity. For the 2C and 8C stages, we had only two biological replicates available. To maintain statistical rigor and ensure comparability between stages, we selected two biological replicates from each stage for downstream analyses. The number of m
6A peaks identified ranged from 5401 to 9601 across the various stages (Fig.
EV1H). We identified 5126 (GV), 4497 (MII), 3666 (1C), 5328 (2C), 4820 (8C) and 5519 (BLT) m
6A-modified (m
6A+) genes (Fig.
1B; Dataset
EV1). Among highly expressed genes (TPM ≥ 10), the percentage of m
6A+ genes ranged from 32 to 44%, aligning with the overall trend observed in the number of m
6A+ genes across the six stages (Fig.
EV1I). Overall, the numbers and the percentage of m
6A+ genes decreased until fertilization and then increased after the zygote stage, the minimal number of m
6A-modified genes at the 1C stage could result from significant maternal RNA degradation before substantial zygotic mRNA synthesis begins, which suggests that m
6A is dynamically regulated during the human preimplantation embryos. This could be credited to the dynamic expression of the m
6A methyltransferases METTL3 and METTL14, the m
6A demethylases FTO and ALKBH5, and m
6A readers at both RNA and protein levels (Yang et al,
2018; Zheng et al,
2023; Zhu et al,
2025) (Figs.
EV1J and
EV2A). Among these m
6A+ genes, the majority were protein-coding genes (82–92%), but there were also long non-coding RNAs (lncRNAs, 5–12%), pseudogenes (2–6%) and a small fraction of other types of RNA species (<1%) (Fig.
EV2B,C). The non-coding m
6A + RNAs identified at each stage in this study may play potential regulatory roles during human oocyte and early embryonic development, as demonstrated by previous study that established the m
6A-associated functions of lncRNAs in human fetal tissues (Xiao et al,
2019). For instance, m
6A has been demonstrated to be required for the function of
XIST (Patil et al,
2016), which is a lncRNA that mediates gene silencing on the X chromosome during mammalian development and was marked by m
6A at the BLT stage (Fig.
1C). The m
6A profiles were clearly detectable as peaks, as illustrated by genome browser snapshots of representative genes, such as oocyte-secreted factor
GDF9 (Paulini and Melo,
2011), the 8C embryo marker gene
LEUTX (Lewin et al,
2023), blastocyst-specifically activated gene
ARID3A (Rhee et al,
2014), and an imprinted gene
ZDBF2 that was constantly expressed across all stages (Duffie et al,
2014) (Fig.
1C). The m
6A peaks typically exhibited significant enrichment near the stop codons (Figs.
1D,E and
EV2D), and displayed the consensus motif GGACU (Figs.
1F and
EV2E) across all stages.
We further explored the extent to which differences in the m
6A methylome during OET are conserved between human and mouse by comparing their transcriptomes and m
6A methylomes from equivalent stages (Wang et al,
2023) (Fig.
2A). To mitigate the influence of RNA abundance on m
6A characterization in the two species, we specifically targeted genes with high expression levels (TPM ≥ 50) at each stage in either species, noting that m
6A+ genes had median TPM around 50 (Fig.
EV3A). For the non-homologous genes between the two species, an average of 39% were m
6A+ across all stages in human, compared to 49% in mouse, with a notably low fraction of m
6A modification at the BLT stage for both human (20%) and mouse (35%) (Fig.
EV3B).
Next, we focused on homologous genes and categorized them into three groups based on their expression at each stage: human-mouse co-expressed (expressed in both human and mouse), human-specifically expressed, and mouse-specifically expressed genes (Fig.
2B; Dataset
EV2; “Methods”). Among human-mouse co-expressed genes, approximately 35% were m
6A+ in both species across various stages, with <11% uniquely modified in human while >17% uniquely modified in mouse (Fig.
2B, middle). Notably, 36% of co-expressed genes at the 8C/2C stage were exclusively modified at the mouse 2C stage versus only 7% at the human 8C stage. Human major zygotic genome activation (ZGA) occurs at the eight-cell stage, whereas in mice it takes place at the two-cell stage. This observed divergence could be attributed to differences in developmental timing or the distinct profiles of major zygotic gene expression, comparisons of human and mouse transcriptomes during the major ZGA show that only half of the transcriptomes overlap, suggesting that the homology between human and mouse zygotic genes is relatively low (Sha et al,
2020). This disparity is further reflected in the genes that are specifically expressed in either human or mouse. Regarding human-specifically expressed genes, over 50% were m
6A+ at the GV, MII and BLT stages, whereas this decreased to 35% at the 8C stage (Fig.
2B, top). In contrast, in mouse-specifically expressed genes, >54% were m
6A+ at every stage, with a peak of 65% at the 2C stage (Fig.
2B, bottom). These data reveal distinct RNA m
6A methylome states during the human major ZGA, likely reflecting evolutionary divergence.
Recent advances in understanding m
6A deposition mechanisms indicate that m
6A acts like a default hardware feature (He et al,
2023; Uzonyi et al,
2023; Yang et al,
2022). These studies found that longer genes tend to exhibit higher m
6A levels, with the exon junction complex (EJC) acting as an m
6A suppressor to shape the m
6A epitranscriptome, protecting RNA near exon junctions within coding sequences from methylation. Also, longer genes typically have more exons, which increases the availability of RRACH (R = G or A; H = A, C or U) motifs in exonic regions. We aim to determine if this finding represents a universal mechanism or context-dependent differences in human early embryonic development. For both human- and mouse-specifically expressed genes, m
6A+ genes exhibited significantly longer gene lengths than unmodified genes across all stages (Figs.
2C and
EV3C). In human-mouse co-expressed genes, m
6A+ genes in both species were substantially longer than others; and interestingly, human homologous m
6A+ genes were significantly shorter than mouse m
6A+ homologous genes specifically at the 2C and 8C stage, respectively (Fig.
2C), a distinction not observed at other stages (Fig.
EV3C). Again, this could be attributed to the unequal cell stage comparison between the mouse 2C and human 8C stages. m
6A+ genes contained markedly more RRACH motifs than unmodified genes across all stages; and for human genes co-expressed in mouse, the genes were uniquely expressed in m
6A + mRNAs in human had lower counts of RRACH motifs than those uniquely expressed in m
6A + mRNAs in mouse, specifically in human 8C embryo, likely due to the unequal cell stage comparison (Figs.
2D and
EV3D). It is important to note that, after normalizing for gene length, there is no significant difference in the density of RRACH motifs (Fig.
EV3E). Instead, this may be more closely related to gene length and splicing events, as supported by recent studies (He et al,
2023; Uzonyi et al,
2023; Yang et al,
2022). These analyses indicate that our data quality is consistent with the latest findings in the field, demonstrating that picoMeRIP is a highly accurate method for RNA m
6A profiling.
Gene ontology (GO) analysis provided functional insights into different groups of genes: of human-specifically expressed genes at the GV, MII, 1C and 8C stages, m
6A+ genes were largely linked to transcription regulation while unmodified genes were mainly enriched in basic metabolic processes; at the BLT stage, human-specifically expressed m
6A+ genes contributed to cell differentiation (Fig.
2E). Additionally, for mouse-specifically expressed m
6A+ genes, divergence in function by stage was observed: they were implicated in the regulation of cell proliferation and multicellular organism development from GV to the 1C stage, and in transcription regulation from GV to the 2C stage (Fig.
EV4A). Across all stages, for human-mouse co-expressed genes, m
6A+ genes were largely associated with transcription, embryonic development as well as development-related signal pathways, while unmodified genes were frequently involved in diverse metabolic pathways (Fig.
EV4B,C). Taken together, these results underscore the conserved characteristics of m
6A modification between the two species. The observed divergences between human 8C and mouse 2C embryos suggest the possibility of species-specific roles for m
6A during the critical window of major ZGA.
We sought to further focus on m
6A profiling of the two critical events during OET: maternal RNA degradation (MD) and ZGA (Fig.
2A). We classified MD genes into two subsets based on their distinct decay patterns: M-decay, degraded during oocyte maturation, exemplified by
VCAN that is associated with human oocyte developmental competence (Shen et al,
2020); and Z-decay, degrading after fertilization, such as
BNC1 that is required for human oocyte meiosis (Zhang et al,
2018) (Fig.
3A,B; Dataset
EV3; “Methods”). In total, we identified 1339 MD genes (M-decay, 396; Z-decay, 943) and 550 ZGA genes (Fig.
3A). The smaller number of m
6A-marked genes detected in this study compared to mouse studies is likely influenced by the larger number of embryos (at least hundreds of embryos at each stage) used in mouse studies, which allows for greater enrichment of low-abundance RNAs (Wang et al,
2023; Wu et al,
2022; Zhu et al,
2023). Near half of ZGA genes were methylated at the 8C stage (Fig.
3C), including
TFAP2C, a known marker of human ground-state naive pluripotency (Pastor et al,
2018) (Fig.
3B). Notably, some MD genes were consistently tagged with m
6A throughout development. We hypothesize that although the original maternal mRNAs are degraded, the zygotic genome re-expressed new mRNAs from the same genes, ensuring continued support for embryonic development (Sha et al,
2020; Yan et al,
2013), consistent with the results in mouse (Wang et al,
2023; Wu et al,
2022). In contrast to M-decay genes (<19%), Z-decay genes (>30%) displayed significantly higher m
6A+ proportions at the GV, MII and 1C stages (Fig.
3C), which can be partially explained by the relatively lower expression of M-decay genes (Fig.
EV5A). For M-decay genes at the GV stage, Z-decay genes at the GV, MII and 1C stages, as well as ZGA genes at the 8C stage, those with m
6A modification had significantly longer gene lengths (Figs.
3D and
EV5B) and more RRACH motifs (Figs.
3E and
EV5C) than unmodified genes. Unmodified MD genes were predominantly associated with metabolic pathways, apoptotic process and protein transport, whereas unmodified ZGA genes were linked to RNA processing and nucleosome assembly (Fig.
3F). Of note, m
6A+ genes from both MD and ZGA functioned in regulation of cell proliferation and pluripotency, likely mediated by key factors essential for transcription (Fig.
3F). Given that m
6A is believed to regulate cell fate in embryonic stem cells by modulating its abundance on transcription factor coding genes (Batista et al,
2014; Geula et al,
2015; Jin et al,
2021), we examined the expression and m
6A modification dynamics of genes encoding transcription factors (Fig.
3G) and transcription cofactors (Fig.
EV5D) throughout all six stages. Among the highly expressed genes (TPM ≥ 50), an average of 68% of transcription factors and 48% of transcription cofactors (Fig.
EV5E) were m
6A+ across all stages, with the highest methylation ratios at the BLT stage. The mature human blastocyst is comprised of the first three cell lineages of the embryo: trophectoderm, epiblast and primitive endoderm (Alberio,
2020; Blakeley et al,
2015; Stirparo et al,
2018). We further examined the m
6A status on the lineage-specific marker genes across developmental stages and observed m
6A occupancy on some master transcription regulators at the 8C and/or BLT stages, such as
GATA2,
GATA3 and
TEAD3 (markers of trophectoderm),
SOX2 and
NANOG (markers of epiblast), and
PDGFRA and
GATA6 (markers of primitive endoderm) (Fig.
EV5F). The prevalence of m
6A deposition on maternal RNAs and the onset of new m
6A methylation during ZGA, particularly on those genes involved in transcription regulation, hint at the involvement of m
6A in these two critical events during human OET.
MicroRNAs (miRNAs), along with m
6A, are another key regulator of RNA stability and translation, and are essential for mammalian development (DeVeale et al,
2021). We and others have observed that m
6A modifications are particularly enriched at miRNA targeting sites in various human and mouse tissues (Liu et al,
2020) as well as in mouse pluripotent stem cells and early embryos (Chen et al,
2015b; Wang et al,
2023). Next, we examined the co-occupancy between m
6A modification and miRNAs on human MD and ZGA genes (Dataset
EV4; “Methods”). miRNAs had a significant tendency to target m
6A + Z-decay genes as compared with unmodified genes at the GV, MII and 1C stages (Figs.
4A and
EV6A,B). However, unlike our observations in mouse data (Chen et al,
2015b; Wang et al,
2023), we did not discern a distinct preference of miRNA targeting between modified and unmodified M-decay genes at the GV stage and ZGA genes at the 8C stage (Figs.
4A and
EV6A,B). Previous studies have demonstrated significant differences in gene expression and miRNA profiles between humans and mice during oocyte maturation and major ZGA. We reasoned that mice and human may have divergent mechanisms for miRNA regulation, which likely explains the distinct miRNA targeting phenomenon observed here (Paloviita et al,
2021; Sha et al,
2020; Yang et al,
2016; Zhao et al,
2020). This discrepancy may suggest species-specific differences in the coordination of m
6A modification and miRNA targeting during OET between humans and mice. However, when examining genes that were constantly expressed across all stages, the preference of miRNA targeting on m
6A+ genes was substantially higher than on unmodified genes (Figs.
4B and
EV6C,D).
m
6A is known to regulate RNA translation efficiency (Jain et al,
2023; Shan et al,
2023; Wang et al,
2015). By correlating previous translatome data in human oocytes and early embryos (Zou et al,
2022) with our m
6A methylome data, we found that m
6A + Z-decay genes had relatively higher translation efficiency than unmodified genes at the MII and 1C stages, while there was no significant association between m
6A modification and mRNA translation efficiency for M-decay and ZGA genes (Fig.
4C; Dataset
EV5; “Methods”). For constantly expressed genes, the median translation efficiencies of m
6A+ genes were higher than unmodified genes across different stages (Fig.
4D). The locations of m
6A deposition along mRNAs are closely related to its distinct roles in translation control (Meyer,
2019b; Shan et al,
2023). We categorized constantly expressed m
6A+ genes based on their modification sites into 5’UTR, coding sequence (CDS), stop codon and 3’UTR. Despite previous reports of a negative correlation between CDS m
6A and translation in other research models (Meyer,
2019b; Shan et al,
2023), our observations showed that genes with CDS m
6A residing had relatively higher translation efficiencies in comparison to those with m
6A at other locations, especially in oocytes (Fig.
4E). The role of CDS m
6A in either promoting or inhibiting protein production remains elusive (Meyer,
2019b). The variations in m
6A associated translation efficiency between MD and ZGA events, as well as the divergent impacts of CDS m
6A on translation between our observation and previous studies, suggest that the effect of m
6A on translation is multifaceted and complex. It possibly involves different m
6A readers to modulate translation positively or negatively under different conditions (Meyer,
2019b; Shan et al,
2023).
Recent findings showed that the m
6A modified transcripts derived from retrotransposons could act both
in cis and, in trans to regulate the chromatin environment, hence, the m
6A modification has been linked to cell fate determination through modulating retrotransposon-derived RNAs (Chelmicki et al,
2021; Chen et al,
2021; Liu et al,
2021; Sun et al,
2023; Xu et al,
2021). Here, we found that m
6A was largely enriched on retrotransposon-derived RNAs during human OET (Fig.
5A; Dataset
EV6; “Methods”). The most recently integrated human endogenous retrovirus H (HERVH) were consistently activated across all stages, and >51% of the expressed HERVH loci/copies were modified by m
6A from the GV to 2C stages (Fig.
5A). HERVK showed marked m
6A modification at the BLT stage. In addition to full-length endogenous retroviruses (ERVs), solo long terminal repeats (LTRs), such as LTR12C and THE1C, were also heavily occupied by m
6A prior to ZGA (Fig.
5A). Certain evolutionarily young subfamilies of LINE-1, including L1HS, L1PA2, and L1PA3, demonstrated their highest m
6A frequencies at the BLT stage. SVA elements also exhibited a high degree of m
6A modification concurrent with ZGA. There was no frequent enrichment of m
6A on Alu elements. Notably, HERVH displayed a broad distribution of m
6A across the internal sequences, while solo THE1C exhibited a clear positional preference for m
6A occupancy (Figs.
5B and
EV6E). Further examination of other types of retrotransposons supports this divergent distribution of m
6A along their entire sequences (Fig.
EV6E). Due to their highly repetitive nature, one of the technical challenges in studying retrotransposons is the difficulty in assigning multi-mapped sequencing reads to unique genomic copies/loci (Lanciano and Cristofari,
2020). Our random assessment strategy for multi-mapped sequencing reads yielded consistent conclusions, ruling out computational biases (Fig.
EV6F,G). These data reveal that m
6A marks retrotransposon-derived RNAs and their stage-specific expression during the OET process, suggesting a potential role in the post-transcriptional regulation of both maternal and ZGA retrotransposon-derived RNAs. It is likely that the widespread distribution of m
6A marks on retrotransposons is largely ‘hard-wired’ by the genomic architecture (He et al,
2023; Uzonyi et al,
2023; Yang et al,
2022).
Human reproduction is regulated by retrotransposons (Macfarlan et al,
2012; Xiang et al,
2022) and studies on the potential intervention and influence of m
6A status on retrotransposons in human embryo development is intriguing. Due to the limited availability of human early embryos and ethical considerations, investigating m
6A status and retrotransposon activity in human early embryos is currently not feasible. Instead, to observe the possible effect of inhibiting m
6A levels of retrotransposon-derived RNAs, we resorted to use hESCs as a model. We treated hESCs with STM2457, a highly potent and selective catalytic inhibitor of the m
6A writer METTL3 (Yankova et al,
2021). As expected, at the transcriptome level, STM2457 treatment led to a considerable reduction of uniquely mapped sequencing reads (>50%) (Fig.
EV6H), and a substantial loss of ~73% in the number of m
6A peaks (Fig.
EV6I). In the context of retrotransposons, this treatment resulted in an around 47% reduction in m
6A peaks associated with retrotransposons (Fig.
5C). The observed variation in reduction rates could be due to differences in transcript structure and the inherent transcriptional regulation of retrotransposons. Retrotransposons, as repetitive elements, may have sequence-specific features that make them less sensitive to m
6A modification machinery inhibitors. Of note, m
6A levels on retrotransposons, including those from LTR (e.g., HERVH, HERVK, HERVL, HERV9NC, MLR1B, MSTA, THE1C) and LINE1 (e.g., L1HS, L1PA2, L1PA3), were significantly diminished after STM2457 treatment (Figs.
5D and
EV7A–C). Following STM2457 treatment, more than half of the genes involved in transcription and metabolic pathways lost m
6A modification, yet their expression levels remained largely unchanged (Fig.
EV7D–F). The retrotransposon stage-specific expression patterns during human OET, combining with the analysis of METTL3 inhibitor experiment collectively highlight the critical role of m
6A in labeling active retrotransposon transcripts.