Ancestral duplication of MADS‐box genes in land plants empowered the functional divergence between sporophytes and gametophytes

MADS箱 配子体 功能分歧 孢子体 生物 基因 基因复制 分歧(语言学) 细胞生物学 植物 遗传学 基因家族 基因表达 拟南芥 花粉 突变体 哲学 语言学
作者
Yichun Qiu,Zhen Li,Claudia Köhler
出处
期刊:New Phytologist [Wiley]
卷期号:244 (2): 358-363 被引量:2
标识
DOI:10.1111/nph.20065
摘要

MADS-box transcription factors (TFs) have gained widespread recognition for their exceptional diversity and pivotal roles in various biological functions across eukaryotic organisms. Specifically, in land plants, the MADS-box gene family has undergone substantial expansion and provided the genetic raw material for many developmental novelties, including flowers, fruits and seeds. Therefore, understanding the origin of MADS-box genes is crucial for gaining insights into the evolutionary success of land plants. Land plant MADS-box TFs have been categorized into two groups. MIKC-type (Type II) TFs are named after the structural arrangement of the MADS (M), Intervening (I) and plant-specific Keratin-like (K) domain, followed by a variable C-terminal region. By contrast, M-type (Type I) TFs lack the K domain (Alvarez-Buylla et al., 2000; Kofuji et al., 2003; Nam et al., 2004). The identification of mutants exhibiting distinct flower organ patterns led to the discovery of a multitude of MIKC-type MADS-box TFs in plants (Ng & Yanofsky, 2001; Nam et al., 2003; Kaufmann et al., 2005). By contrast, the identification of M-type genes occurred exclusively through bioinformatic analyses following the unveiling of the Arabidopsis thaliana genome (Alvarez-Buylla et al., 2000). Within the MIKC-type of MADS-box TFs, the MIKC*-type is distinguished based on the different arrangements of exons that encode the K domain (Svensson et al., 2000; Henschel et al., 2002; Zobell et al., 2010; Kwantes et al., 2012; Rümpler et al., 2023). Both the 'classic' MIKCC- and the MIKC*-types are present across land plant lineages and form well-supported separate clades (Kwantes et al., 2012; Gramzow & Theissen, 2013; Liu et al., 2013), suggesting that they diverged before the diversification of land plant lineages (Henschel et al., 2002; Kofuji et al., 2003; Tanabe et al., 2005; Kwantes et al., 2012). It was a long-standing prevailing view in the field that plant Type II genes are orthologs to myocyte enhancer factor-2 (MEF2) genes in animals. Those were presumed to have diverged from plant Type I genes and animal serum response factor (SRF) genes via an ancient duplication before the divergence of the extant eukaryotic lineages (Alvarez-Buylla et al., 2000). Thus, with the shared presence of the K domain, MIKC*-type MADS-box genes were intuitively considered more closely related to MIKCC-type genes, collectively called Type II. This adheres to the principle of parsimony, by which the acquisition of the K domain should ideally have occurred only once during the evolution of plants (Kaufmann et al., 2005; Thangavel & Nayar, 2018) (Fig. 1a). However, we recently showed that both Type I and Type II genes are land plant-specific MEF2 orthologs that arose from a duplication event predating the origin of land plants (Qiu et al., 2023). Therefore, the plant Type I and Type II genes are more closely related than previously appreciated (Fig. 1a). Noticeably, the Type I and II gene duplication hypothetically took place around the time inferred for the rise of MIKC*-type genes (Kofuji et al., 2003; Kwantes et al., 2012). This raises the question whether the previously proposed monophyletic relationship between the MIKC*- and MIKCC-type as the Type II clade is better supported than the alternative hypothesis proposing that MIKC*-type genes are close paralogs of the Type I (M-type) clade (Fig. 1a). Since the ancestral land plant MADS-box gene giving rise to all current subfamilies has been inferred to include the K domain (Kaufmann et al., 2005; Thangavel & Nayar, 2018; Qiu et al., 2023), this domain is not only shared by MIKCC-type and MIKC*-type, but also inherited by the ancestral gene leading to the extant Type I clade. Therefore, MIKCC- and MIKC*-types have just maintained the ancestral structures, which however does not provide sufficient evidence for their close relatedness. Regardless of whichever diverged first, MIKC*-, MIKCC- or M-type, all scenarios equally assume an ancestral gain of the K domain and a subsequent specific loss in the M-type lineage (Fig. 1a). To reappraise the origin of MIKC*-type TFs and to resolve the relatedness among MIKC*-, MIKCC- and M-types, we performed phylogenetic analyses with broad sampling coverage across major lineages of land plants (Supporting Information Table S1). We aligned the MADS domain amino acid sequences of all MADS-box TFs from six bryophytes, four lycophytes, four ferns, five gymnosperms and three angiosperms (Figs 1b, S1; Table S2), with sequences from charophytes, the closest successive sister lineages of land plants, and green algae as outgroups. In the maximum-likelihood tree, the MIKC*-type TFs form a well-supported clade with Type I (M-type) TFs. We additionally generated a maximum-likelihood tree based on the codon alignments, which largely supports the new topology (Fig. S2). Since the MADS-box gene family has undergone several rounds of gene duplications in plants (Nam et al., 2004), there are many redundant sequences in the above analysis, and especially the MIKCC-type sequences are dominating the dataset. We therefore selected representative sequences of comparable sample sizes for the MIKC*-type, MIKCC-type and Type I clades from divergent TF subfamilies in all those species. Phylogenetic trees generated with both maximum-likelihood and Bayesian inference show that MIKC*-type TFs constitute the nearest sister clade to the Type I (M-type) TFs, instead of MIKCC-type (Figs S3, S4). These new phylogenetic analyses all suggest that in a recent streptophytic ancestor of land plants, an MIKC-type gene duplicated into the precursor of the MIKCC clade and a precursor for the MIKC*/M-type clade. This latter gene subsequently duplicated to give rise to the precursor of the MIKC* clade and the precursor of the M-type clade, which arose after the loss of the K-domain exons. We noticed that not all the charophytic MADS-box sequences were positioned as successive sisters to all MADS-box sequences in land plants, but none of the charophytes we analyzed had a pair of MADS-box genes each clearly belonging to Type I or II clades. We thus refrain from concluding whether the charophytic lineages shared the abovementioned one or two rounds of MADS-box gene duplication. If the duplication took place in the early lineage of streptophytes, multiple independent gene losses must have occurred in the paraphyletic charophytes. We focused on the land plant MADS-box TFs and generated maximum-likelihood trees without sequences from charophytes (Fig. S5). With the sequences from chlorophytes as the outgroup, in land plants the MIKC* clade is still closer to the M-type clade rather than the MIKCC clade. We carried out approximate unbiased (AU) tests (Shimodaira, 2002) to compare the phylogenetic trees corresponding to competing hypotheses (Figs S6, S7). One topology represents our new phylogeny. The second one is a constraint phylogeny forcing the Type I clade to be the outgroup of the originally considered combined Type II clade of both MIKCC- and MIKC*-type, reflecting the previous MIKC monophyly model. We also created a third constraint phylogeny resembling an alternative MIKC paraphyly hypothesis, assuming MIKC*-type diverged before the MIKCC-type and Type I split. The AU tests suggest that the new phylogeny with monophyly of MIKC* and M-type is significantly better than the previous proposition of MIKC* and MIKCC monophyly. Although the assumed third topology for MIKCC and M-type monophyly is not completely rejected based on the topology tests, the new phylogeny is substantially better. Resolving the relationship between these clades was challenging, since the two inferred successive duplication events giving rise to the precursors of the three types likely took place in a very short time span in the history of streptophyte evolution, predating the split of bryophytes and vascular plants. This is reflected by low posterior probability supports in the Bayesian inferences (Fig. S4). Nevertheless, combining all available evidence from an up-to-date and comprehensive sampling and multiple analyses, we find the model that MIKCC-type TFs diverged first and the M-type arose by loss of K-domain after the later divergence from the MIKC*-type with the highest probability. The new phylogeny with the revised MIKC* origin aids to reconstruct the model depicting the functional evolution of MADS-box genes in land plants (Fig. 2). As suggested by several charophytic MADS-box genes (Tanabe et al., 2005), and the sole green algal MADS-box gene that has been functionally characterized, CsubMADS1 in the haploid Coccomyxa subellipsoidea (Nayar & Thangavel, 2021), it is likely that ancestral MADS-box genes primarily served gametophytic functions, particularly in stress tolerance and gamete development. MIKC*-type genes likely have maintained the ancestral gametophytic function, consistent with expression and function of MIKC*-type genes in pollen, the male gametophyte (Verelst et al., 2007a,b; Adamczyk & Fernandez, 2009; Liu et al., 2013). Interestingly, the single-copy MIKC* ortholog in the liverwort Marchantia polymorpha MpMADS1 can rescue mutants deficient of MIKC* in Arabidopsis (Zobell et al., 2010), suggesting conservation of MIKC*-type function in male gametophytes across land plants. Supporting this hypothesis, MIKC*-type genes in mosses, lycophytes and ferns are preferentially associated with gametophytes, especially the structures bearing male gametophytes (Svensson et al., 2000; Riese et al., 2005; Zobell et al., 2010; Kwantes et al., 2012). In seed plants, MIKC*-type gene expression became nearly completely restricted to male gametophytes (Verelst et al., 2007a,b; Adamczyk & Fernandez, 2009; Liu et al., 2013; Gramzow et al., 2014; Gu et al., 2022). Similarly, M-type MADS-box TFs are functionally important in female gametophytes in seed plants, as well as in the endosperm in flowering plants (Bemer et al., 2010; Masiero et al., 2011; Qiu & Köhler, 2022), complementing the requirement of MADS-box function after MIKC*-type evolved to be male-specific. It has been proposed that the expansion of the MADS-box gene family contributed to the complexity of the plant body plan (Theissen et al., 1996; Kaufmann et al., 2005; Thangavel & Nayar, 2018). This is most pronounced for the MIKCC clade, which is famous for its regulatory role in the patterning of reproductive organs (Smaczniak et al., 2012; Theissen et al., 2016). Nevertheless, the regulatory role of MIKCC-type TFs in sporophytes is assumed to be derived, since ancestral MADS-box genes likely did not have a sporophytic function. This is inferred by charophytic MADS-box genes, which are not expressed in the zygote, the only diploid phase of charophytes (Tanabe et al., 2005). In comparison with the MIKCC-type subfamily, the copy number of MIKC*-type TFs remained moderately low (Kwantes et al., 2012; Liu et al., 2013). Likewise, the M-type TFs remained less duplicated and differentiated in vascular plants, until they evolved a function in the endosperm of angiosperms (Qiu & Köhler, 2022). Thus, number and differentiation patterns of MIKC* and M-type MADS-box genes align with the simplicity of gametophytes in vascular plants. Conversely, in the gametophyte-dominant bryophytes, the MIKC*-type subfamily largely expanded in several moss species, and the M-type subfamily underwent lineage-specific expansion in the Anthoceros hornworts, which may have contributed to the considerably more complex structures of mosses and hornworts compared with liverworts (Zobell et al., 2010; Table S2). Together, our new results suggest a reclassification that moves MIKC*-type TFs into the Type I clade (Fig. 2). Based on this revised phylogeny, the updated Type I TFs are a clade of MADS-box genes that primarily preserved their ancestral function in gametophyte development. During the evolution of seed plants, MIKC*-type and M-type adopted male- and female-specific functions, respectively. In parallel, the refined Type II clade, comprised of typical MIKCC-type genes, gradually diverged to be the sporophytic MADS-box subfamily, which repeatedly duplicated and neofunctionalized to generate new genetic regulators underlying the diverse body architectures of sporophytes. We extended the collection of genomes analyzed in Qiu et al. (2023) with more species that represent all major lineages of land plants for MADS-box protein identification (Table S1). Amino acid sequences of MADS-box proteins in Arabidopsis (retrieved from TAIR10, https://www.arabidopsis.org/) were used as queries in the Blastp program to search for MADS-box proteins in these additional genomes. The MADS domains from identified MADS-box proteins were extracted based on the alignments to the MADS domain entries in the Conserved Domain Database (Lu et al., 2020) by the conserved domain search tool, CD-Search (Marchler-Bauer & Bryant, 2004). We aligned the MADS domains with Muscle by the default settings (Edgar, 2004). We also used the amino acid alignment as a guide to generate the corresponding codon alignments (Notes S1). We first included all identified MADS-box TFs from a series of representative land plants: Arabidopsis, rice and Amborella trichopoda (angiosperm); Thuja plicata, Cycas panzhihuaensis, Ginkgo biloba, Gnetum luofuense and Welwitschia mirabilis (gymnosperms); Ceratopteris richardii, Salvinia cucullate, Adiantum capillus-veneris and Alsophila spinulosa (ferns); Selaginella moellendorffii, Isoetes taiwanensis, Diphasiastrum complanatum and Lycopodium clavatum (lycophytes); Physcomitrium patens, Ceratodon purpureus, Sphagnum fallax, Takakia lepidozioides, Marchantia polymorpha and Anthoceros angustus (bryophytes) (Table S2). This enormous dataset contains many more MIKCC-type sequences compared with MIKC*- and M-types. Therefore, we generated a downsized dataset with selected sequences in the three major MADS-box TF subfamilies across land plants. For the purpose of elucidating the deep branching pattern between the three major MADS-box clades, we specifically chose the genes with high confidence to be classified into a certain type as the representative sequences based on previous knowledge and the large maximum-likelihood tree generated in this study. We applied Iq-Tree 2 to generate maximum-likelihood trees (Minh et al., 2020). The implemented ModelFinder (Kalyaanamoorthy et al., 2017) determined the JTT substitution matrix (Jones et al., 1992) to be the best substitution model in the tree inference for the large dataset of protein alignments and LG (Le & Gascuel, 2008) for the reduced dataset. For the nucleotide alignments, we allowed different evolutionary rates between partitions of the first, second and third codon positions. We ran 1000 replicates of ultrafast bootstraps to estimate the support for reconstructed branches (Hoang et al., 2018). We also employed Phylobayes (v.3.2) to perform Bayesian inference under the CAT+GTR model with two chains. After ensuring that the two chains had converged with a maxdiff < 0.3, a consensus tree was created. The effective sample sizes of the different parameters were verified to be greater than 200, except for α parameter of the gamma distribution of rates across sites, which was 78 (Lartillot et al., 2009). We further compared the topology of constraint phylogenetic trees fitting the competing hypotheses, by topology tests such as AU tests (Shimodaira, 2002) supported in Iq-Tree 2 (Minh et al., 2020). We thank Dr Jiaqi Wu for the helpful discussion. This research was funded by a grant from the Knut and Alice Wallenberg Foundation (2018-0206) to CK and the Max Planck Society. None declared. YQ, ZL and CK planned and designed the research. YQ and ZL analyzed data. YQ, ZL and CK wrote the manuscript. YQ and ZL contributed equally to this work. The data that support the findings of this study are openly available and resources are listed in Table S1 of this article. Fig. S1 Maximum-likelihood tree based on the amino acid alignments including all MADS-box TFs from a series of representative land plants across all major lineages. Fig. S2 Maximum-likelihood tree based on the codon alignments including all MADS-box TFs from a series of representative land plants across all major lineages. Fig. S3 Maximum-likelihood (ML) tree of selected land plant MADS-box TFs and charophytic, green algal and Cyanophora paradoxa sequences. Fig. S4 Bayesian inference of phylogeny of surveyed MADS domains. Fig. S5 Maximum-likelihood tree of surveyed MADS domains without sequences from charophytes. Fig. S6 Topology tests for phylogenetic trees constraint to certain evolutionary models with all the MADS-box TFs in Fig. 1(b). Fig. S7 Topology tests for phylogenetic trees constraint to certain evolutionary models with the selected MADS-box TFs in Fig. S3. Notes S1 The alignments used in the phylogenetic analyses. Table S1 Species surveyed in this study. Table S2 Land plant MADS-box sequences in the phylogenetic analyses corresponding to Fig. 1(b). Please note: Wiley is not responsible for the content or functionality of any Supporting Information supplied by the authors. Any queries (other than missing material) should be directed to the New Phytologist Central Office. Please note: The publisher is not responsible for the content or functionality of any supporting information supplied by the authors. Any queries (other than missing content) should be directed to the corresponding author for the article.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
科研通AI5应助aliu采纳,获得10
1秒前
Yy123发布了新的文献求助10
1秒前
xxx关注了科研通微信公众号
1秒前
yyy发布了新的文献求助10
1秒前
桂花酒酿发布了新的文献求助10
2秒前
3秒前
4秒前
灰灰喵完成签到 ,获得积分10
5秒前
大佬发布了新的文献求助10
5秒前
6秒前
linlin完成签到,获得积分10
6秒前
小方发布了新的文献求助10
9秒前
echo完成签到 ,获得积分10
10秒前
11秒前
12秒前
aliu完成签到,获得积分10
12秒前
今后应助最爱地瓜和虾滑采纳,获得10
14秒前
17秒前
17秒前
18秒前
aliu发布了新的文献求助10
18秒前
南歌子完成签到 ,获得积分10
21秒前
Orange应助XieQinxie采纳,获得10
21秒前
21秒前
jimskylxk发布了新的文献求助10
23秒前
xxx发布了新的文献求助10
24秒前
pan发布了新的文献求助10
25秒前
可爱的函函应助yyy采纳,获得10
29秒前
天天快乐应助pan采纳,获得10
30秒前
安好完成签到,获得积分10
31秒前
文聪发布了新的文献求助10
31秒前
35秒前
热心市民应助科研通管家采纳,获得10
35秒前
科研通AI5应助科研通管家采纳,获得10
35秒前
搜集达人应助科研通管家采纳,获得10
35秒前
情怀应助科研通管家采纳,获得10
35秒前
热心市民应助科研通管家采纳,获得10
35秒前
CipherSage应助科研通管家采纳,获得10
35秒前
深情安青应助夜秋瞳采纳,获得10
36秒前
Weiyu完成签到 ,获得积分10
37秒前
高分求助中
Technologies supporting mass customization of apparel: A pilot project 600
Introduction to Strong Mixing Conditions Volumes 1-3 500
Tip60 complex regulates eggshell formation and oviposition in the white-backed planthopper, providing effective targets for pest control 400
A Field Guide to the Amphibians and Reptiles of Madagascar - Frank Glaw and Miguel Vences - 3rd Edition 400
China Gadabouts: New Frontiers of Humanitarian Nursing, 1941–51 400
The Healthy Socialist Life in Maoist China, 1949–1980 400
Optical and electric properties of monocrystalline synthetic diamond irradiated by neutrons 320
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 有机化学 物理 生物化学 纳米技术 计算机科学 化学工程 内科学 复合材料 物理化学 电极 遗传学 量子力学 基因 冶金 催化作用
热门帖子
关注 科研通微信公众号,转发送积分 3800297
求助须知:如何正确求助?哪些是违规求助? 3345583
关于积分的说明 10325859
捐赠科研通 3062057
什么是DOI,文献DOI怎么找? 1680741
邀请新用户注册赠送积分活动 807201
科研通“疑难数据库(出版商)”最低求助积分说明 763557