The diversity of fungal genome

The genome size of an organism varies from species to species. The C-value paradox enigma is a very complex puzzle with regards to vast diversity in genome sizes in eukaryotes. Here we reported the detailed genomic information of 172 fungal species among different fungal genomes and found that fungal genomes are very diverse in nature. In fungi, the diversity of genomes varies from 8.97 Mb to 177.57 Mb. The average genome sizes of Ascomycota and Basidiomycota fungi are 36.91 and 46.48 Mb respectively. But higher genome size is observed in Oomycota (74.85 Mb) species, a lineage of fungus-like eukaryotic microorganisms. The average coding genes of Oomycota species are almost doubled than that of Acomycota and Basidiomycota fungus.


Introduction
Fungi are the larger group of eukaryotic organisms that ranges from yeast and slime molds to mushrooms. These organisms are majorly classified as monophyletic Eumycota group and their diversity ranges from 500 thousand to 9.9 million spanning over 1 billion years of evolutionary history [1,2]. They are abundant at worldwide scale due to their small size and their cryptic lifestyle in soil, dead and decomposing matter, as symbionts with algae, fungi, bryophyte, pteridophyte, higher plants and animals [3][4][5][6][7]. These organisms dominate earth from polar to temperate and tropical habitats [8][9][10]. Due to their ecological dominance, they play a central role in human endeavor. The fungus (mushroom and truffle) are directly used as human food and yeasts are used in bread industry. The fungi also carry out nutrient cycling by decomposing organic matter [11][12][13]. They also produce antibiotics, enzymes, mycotoxins, alkaloids, polyketides and other chemical compounds [14][15][16][17][18][19][20][21].
The kingdom fungi are classified into several major phyla namely Ascomycota, Basidiomycota, Chytridiomycota, Monoblepharidomycota, Neocallimastigomycota, Blastocladiomycota, Glomeromycota, Entomophthoromycota, Stramenopiles and Micorsporidia and sub-phyla namely Kickxellomycotina, mucoromycotina and Zoopagomycotina [22,23]. The diverse ecological dominance of fungus makes them important from an evolutionary point of view. That is why fungi are subjected to intense phylogenetic, ecological and molecular studies. The advancement in high throughput sequencing technology progressed rapidly that led to sequencing of large numbers of fungal genomes. The evolution of biological diversity raises several questions such as how much variation can be expected among closely or related genomes. This can be answered by the comparing closely related genomes. So we carried out a global search of fungal genomes in MycoCosm and JGI database and studied the evolutionary relationships of their genome sizes and reported here [24][25][26].

Fungal genome size
Recently, the genome sequencing technology has emerged as one of the most efficient tools that can provide whole information of a genome in a small period of time. Since the completion of genome sequencing of the model fungus S. cerevisae in 1996, sequencing of large numbers of fungal genomes are now completed. Sequencing of large numbers of fungal genomes will allow us to understand the diversity of genes encoding enzymes, and pathways that produces several novel compounds [24]. Although the fungi are very diverse in nature, their basic cellular physiology and genetics shares some common components with plants and animal cells. These include multicellularity, cytoskeletal structures, cell cycle, circadian rhythm, intercellular signaling, sexual reproduction, development and differentiation [27]. It was previously thought that genomes of all fungi are derived from the genome of the model fungi Saccharomyces cerevisae [27]. However recent explosion in fungal genome sequencing greatly expanded the fungal genomics and molecular diversity of these organisms. Compared to the genome size of animals and plants, the genome sizes of fungi are small [28]. The genome size of model fungi S. cerevisae is bit more than 12 Mb (Table 1). From the studied 172 fungal species, only seven species have genome sizes larger than 100 Mb (Table 1). So, the probability of occurrence of larger genomes in fungi is very small. The genome size of Cenococcum geophilum (177.57 Mb) is the largest and the genome size of Hansenula polymorpha (8.97 Mb) is the smallest from the studied species. Both species belong to Ascomycota. In the group of Basidiomycota species, the genome size of Wallemia sebi (9.82 Mb) is the smallest one and genome of Dendrothele bispora (130.65) is the largest one (  (Table 2). If we consider about the coding gene sequence in fungi, in average the Acomycota, Basidiomycota, Oomycota and Mucoromycotina groups encodes for 11129. 45, 15431.51, 24173.33, 13306 no. of genes respectively in their genomes ( Table 2).
The comparative analysis of fungal genomes show fungi are very divergent [27]. It was earlier thought that genomes of Magnaporthe grisea and Neurospora crassa share a common ancestor. But, comparative genomes analyses revealed only 47% amino acid sequence identity and absence of conserved synteny [27]. Only few genes are identified to be in conserved co-linearity. This shows that even members of the same genus can show remarkable divergence at the genomic level. A genomic comparison between Aspergillus nidulans, Aspergillus fumigatus and Aspergillus oryzae shows only 68% of amino acid sequence identity [27]. The genome duplication and translocation have major impact in evolution in yeast ( Figure 1) [29,30]. The whole genome duplication in yeast followed by massive gene loss is confirmed by comparative experimental analysis [31,32]. This indicates that fungal genomes are very dynamic in nature. Lavergne et al. [33] reported that genome size reduction can trigger rapid phenotypic evolution in invasive plants. Their report suggests that the invasive genotypes had smaller genomes. Smaller genome sizes have phenotypic effects that increased the invasive potential [33]. But in exception, for example, the duckweeds which are smallest, fast-growing and simplest flowering plants are invasive in nature and contains increased DNA content in their genomes [34].

Evolution in genome size
Genomes are aggregates of genes and this concept nicely fits with the prokaryotic organisms and viruses [35]. This concept is very inappropriate for eukaryotic organisms as most of the eukaryotic genomes are studded with nongenic and unconstrained repetitive DNA. This can lead to approximately 200,000 fold variation in genome size [36]. The genome size of an organism depends on the particular developmental and ecological need of the organism [37]. The genes are made up of DNA and it is a general assumption that more complex organisms requires more genes and thus contain more DNA in its genomes. The simple organisms probably contain fewer essential genes compared to more complex organisms and thus contain less DNA in its genomes. However this observation is not true. Some very simple organisms could have more DNA content than complex multicellular organisms. For example, some amoeba species have 200 times more DNA than humans [38]. Similarly, lilies have 200 times more DNA than that of rice [39]. But in many organisms much of the DNA content is noncoding and repetitive. But it is very important to understand which evolutionary forces produces enormous amount of noncoding DNA? What are the adaptive functions of these nongenic DNA? If these nongenic DNA don't have any essential adaptive roles, than why natural selection favors the burden of synthesis of extra DNA? Several hypotheses are postulated since long days to address these questions. But still there is debate over it. Some of the hypotheses are discussed later. From the studied fungal genomes, the average genome sizes of Oomycota species (74.85 Mb) are higher than other. The Ascomycota and Mucoromycotina species shares more or less than same average genome size i.e. 36.91 and 37.02 Mb, respectively. In contrary, the average genome sizes of Basidiomycota species is 46.48 Mbs. The increase in genome size in Oomycota species is also directly correlated with the increase in the numbers of average coding gene sequences. The average numbers of coding genes present in Oomycota species are 24173.33 genes per genome which is almost the double number present in Ascomycota and Basidiomycota species.

The adaptive theories of genome evolution
If certain numbers of genes are responsible for the phenotype and genotypic characters of an organism, why there are extra amounts of DNA in its genome? The adaptive theory explains that this extra DNA abundance is for adaptive function and its content don't have any significant effects in phenotype of the organism [40]. A large genome directly increases the nuclear and cellular volumes [41]. This largely helps to buffer the fluctuation in the concentration of regulatory proteins or protect coding DNA from spontaneous mutation [42]. So the variation in the genome size is due to adaptive needs or due to natural selection in different organisms [37].

Junk DNA theory of genome evolution
The junk DNA hypothesis suggests that these extra DNAs are useless, maladaptive DNAs and fixed by random drift [43]. These DNAs are carried in chromosome and don't have any significant role in the phenotype of an organism [43]. These junk DNAs are known as parasitic DNA or transposable elements (TEs) [44]. The mutational mechanisms of DNA gain or loss can lead to minor changes in the genome of an organism, but changes in genome size may occur by the involvement of different evolutionary forces [37]. An increase in transposition rate certainly can lead to an increase in genome size [37]. Instead of thinking in genome size evolution by adaptive evolution theory or by junk DNA theory, it is very important to understand which evolutionary force is responsible for changes in genome size. The mutational and selective forces might have vast potential to affect the change in genome sizes ( Figure 1) [37]. If we can get the specific clue, we can try to estimate the strength of individual force and whether the magnitude of individual force may produce changes in genome sizes. This approach can explain the quantitative sense about genetic mechanism and the selective forces that affect the genome size. The activities of transposable elements are very fast and can able to amplify the a transposable copy number into 20-100 copies (~0.1-1 Mbp) in a single generation [45,46]. The changes in genome size through spontaneous deletion or insertion are relatively slow [47]. For example, the  Figure 1 Role of different forces affecting the evolution of genome size. The major important factors are transposable elements (TEs), short sequence repeats, microsatellites, genome duplication and others. The mutational and selection pressure plays a significant role in this process. The negligible selective effects governed by genetic drift also contribute for the evolution of genome size. Overall all the forces play a role towards the increase in genome size at different levels. The photograph is adapted according to the report of Petrov [37].
Drosophila melanogaster genome losses less than a single base pair per generation [47]. If there is strong selection in increasing in gnome sizes, strong mutational pressure also can not affect the evolution of genome size [37]. However, strong selection for increase in genome size can substantially slow down the impact of mutation rate. If we can get the information of time scale of genome size divergence, then we can infer the genome-size changes between two closely related organisms. If we will consider the evolutionary development of fungus, Ascomycota has higher evolutionary rate than Basidiomycota [48]. But when we compared the average genome size of Ascomycota, Basidiomycota and Mucoromycotina, we found that the genome size of Basidiomycota is larger than the genome size of Ascomycota and Mucoromycotina. This may suggests that the evolution of fungal genome size is due to addition of nucleotides/DNA contents rather than deletion of nucleotides. Some forces act on the traits correlated with total genome size of an organism [37]. In this case, natural selection forces affect only to few genomic components. For example, the increase in rate of heterochromatin shrinkage through heterochromatic DNA should not affect the size of euchromatin [37]. Similarly, the expansion in satellite DNA should not hamper the size of satellite free sequences. Another important question is that, whether different genomic components are varying together in a correlated fashion during evolution of genome size? Although there are no significant current evidences regarding this question, there are certain cytogenetic and molecular studies available. The cytogenetic study revealed that genome size differences are scattered throughout the euchromatic portion of the genome [49][50][51][52]. Comparison of orthologous introns revealed correlation between average size of intron and genome size [53]. The changes in the intron length do not account for the changes in the genome size. Although transposable elements are largely associated with the increase in genome size, presence of increased simple repeated sequences, pseudogenes, increased size of inter-enhancer spacers and microsatellites are also associated with increase in genome size (Figure 1) [54][55][56]. When there are changes in genome size, they do it across all the genomic components. This suggests that a global force acts as the direct agent for change in genome size. So, from our study we can speculate that Oomycota species might harbors high densities of TEs, simple repeat sequences, microsatellites and pseudogenes. Similarly, the Basidiomycota species might have more densities of TEs, simple repeat sequences, microsatellites and pseudogenes compared to Ascomycota and other groups. Whitney et al. [57] reported about the nonadaptive process in plant genome size evolution. They hypothesized that genome expansion is maladaptive and lineages with small effective population size evolve larger genomes than those with large population size. In addition, mating systems are likely to affect genome size evolution via population size and spread of transposable elements [57].

Conclusion
The question of genome size (C-value paradox) is very puzzling. Most probably we can better understand about the evolution of fungal genome size by completely understanding the roles of noncoding DNAs. It is also equally important to understand whether the addition and deletion of additional DNA content varies between species to species and at organism level too. Although experimental approach like cytogenetic study of euchromatic region can give some lime light about this issue, still high fidelity experimental approaches are lacking till to date.