A glimpse into sex determination and SD genes evolution in vertebrates with a focus on fish species
The field of evolution of sex determination and SD genes in vertebrates has long been shaped by knowledge gathered in mammals, because previously this was the only group for which the master SD gene, SRY, was known [2]. The generally accepted hypothesis for the mechanism by which an SD gene can evolve, namely by allelic diversification of an autosomal gene towards a male or female sex promoting function, and subsequent maintenance of the male-specifying allele as a male dominant SD gene on the proto-Y chromosome, was satisfied by SRY and its supposed progenitor SOX3 [3]. In 1999, dmrt1 was found as a candidate gene for SD in chicken, which proposed a different mechanism, namely dosage sensitivity [16]. In 2002, the first fish SD gene was identified in the medaka, Oryzias latipes. Unexpectedly, this gene, called dmrt1bY or dmy [7,10], arose by a gene duplication event from the autosomal dmrt1a gene being transposed to the proto-Y, adding another evolutionary mechanism for the emergence of an SD gene. Full sequencing of the male-specific region of the Y-chromosome (MSY), the first to be known after the human Y, revealed that several additional predictions of the general theory of Y-chromosome evolution were also not fulfilled, like stepwise diversification on an autosome, recombination suppression, degeneration of Y-linked genes, and accumulation of malefavoring genes [17]. This finding raised doubts about a unifying concept for the evolution of SD genes and heterogametic sex chromosomes.
Fish are uniquely suited to study the evolution of sex determination and SD genes (Fig. 1). Comprising about half of the 60 000 species of vertebrates, fish show also the greatest variety of sex determination. Unisexuality, simultaneous and consecutive hermaphroditism, environmental, and genetic sex determination are found in different, often closely related species and the distribution of various mechanisms follows no obvious phylogenetic pattern. With respect to genetic SD, it became quickly clear after its first discovery that dmrt1bY of medaka is not the master SD gene of fish in general [14]. Despite huge efforts over more than a decade, SD genes of only a few more species have been added to this list, and all those species have strong monogenic sex determination. In a sister species to medaka, O. luzonensis, and also in sablefish, Anoplopoma fimbria, allelic variations of a TGF-β member gene named gsdfY determines male development [8,13]. In Takifugu rubripes (pufferfish) as well, allelic variation at a single nucleotide position of the amh-receptor 2 gene, again a downstream component of the SD cascade in other vertebrates, controls phenotypic sex [6]. In the Patagonian pejerrey, Odontesthes hatcheri, a Y-linked duplicate of the amh gene drives male sex determination [5], a situation that mirrors the evolutionary scenario of the medaka SD gene.
The same SD gene is also found in the tilapia, Oreochromis niloticus but in that species it is a Y chromosome missense SNP on a tandem duplication of amh that drives sex determination [15]. Interestingly, another TGF-β member, namely gdf6Y has also been found as a potential SD in the killifish Nothobranchius furzeri [11], suggesting that members from the TGF-β pathway are often recruited as SD genes in vertebrates. In rainbow trout, Oncorhynchus mykiss, gene duplication and neofunctionalization led to the emergence of the SD gene [14], but totally unexpected and contrary to the situation in all other vertebrates for which information on proven or candidate SD genes exists, the duplicated gene sdY is not derived from a previously known component of the SD cascade but from the interferon regulatory factor 9, which normally functions in the context of immunity. The current state of knowledge about the evolution of SD in vertebrates – last but not least influenced by the recent findings in fish – has resulted in a very unsatisfying situation. With the examples of male-biasing SD genes identified to date, it is rather impossible to derive a comprehensive understanding of processes and mechanisms that lead to the evolution of new SD genes and to evaluate the prevalence and general relevance of different systems. We do not know whether dosage sensitivity of master SD genes is an enigmatic case that is restricted to the avian lineage or is used more generally. Gene duplication and neofunctionalization is appearing now as a more frequent process to create novel SD genes, but the three fish species in which it has been found might be special in some way and bias our view by chance due to the low number of known SD genes in general. It can also be asked whether teleost fish that experienced in their evolutionary history a whole genome duplication, and certain lineage-specific additional ones, profit from this genomic situation with respect to the evolution of new SD genes. Regarding female heterogamety, there is so far no example for a predicted dominant female determining SD gene in fish that could add to the single case of Xenopus laevis, where dominantnegative action of a W-linked truncated dmrt1 gene is proposed to suppress a postulated default male development in WZ frogs [4]. A totally unresolved complication comes from several reports where even in cases that appear to be clear monogenic SD, polymorphic autosomal modifier genes or environmental influences can modify or override the sex-chromosomal mechanisms. It can only be vaguely hypothesized that these findings indicate transition states between SD switches, and again the prevalence of such situations is absolutely unclear. It is obvious that the first step towards a better understanding of SD mechanisms and how new SD genes evolve is to characterize SD genes in more species, study their inheritance, and eventually illuminate their mode of action
The RAD-Sex approach: a Genome Wide Association Study (GWAS) screening for exploring the diversity of sex determination systems in fish. RAD-tags (Restriction Associated DNA tags) are sequences adjacent to restriction enzyme cutting sites that can be assayed by a next generation sequencing technique known as RAD sequencing method (RAD-seq) [18]. RAD-seq has been shown to be an extremely powerful method for sampling the genomes of multiple individuals in a population. Because it does not require preliminary information on the genetics or the genome of the studied species, this approach is especially interesting for developing genetic analyses in the context of non-model organisms [19]. This technique was recently applied with success to search for sex specific sequences in Gecko lizards [20] and for a sex determining locus in zebrafish, in which strain-specific sex determination systems were characterized [21,22]. To serve as an additional proof of principle, we applied this approach to the medaka, Oryzias latipes, which has a strong well-characterized genetic SD system [7,10,17]. The main idea was to investigate if RAD-seq was equally efficient in identifying sex-specific sequences close to dmrt1bY, the known SD gene in that species [7,10], using as starting biological material individuals from an outbred population of fish and not a genetic mapping family panel as was used in the case of the zebrafish [21]. To that end, fin clips were sampled from 31 phenotypic males and 32 phenotypic females of the Carbio strain of medaka. Genomic DNA was extracted from these samples and RAD library construction was carried out with each individual given a distinct barcode. Samples were sequenced in two lanes of Illumina HiSeq2000 using single end 100 bp reads. This dataset analysis allowed us to characterize more than 600 sex-polymorphic RAD-tag sequences among an overall total of 120 000 genetic markers. Most of these sex-polymorphic sequences map to the central portion of medaka linkage group LG1, the region that distinguishes the Y from the X chromosome (see Fig. 2). Among these sex-polymorphic RAD-tag sequences, 58 were sex-specific, i.e. the sequences were only present in males and totally absent in all the females. When mapped onto the medaka genome, 22 of these sex-specific RAD-tags were found to be on the Y chromosome (LG1) and 30 on unordered scaffolds probably due to an incomplete assembly of the medaka Y chromosome. Among these sex-specific RAD-tag sequences, 9 mapped to the Y non-recombinant sex-specific region (GenBank ID: AP006153) including two sequences that were less than 1 kb downstream of the 5’ end of the dmrt1bY medaka sex determining gene (GenBank ID: AY129241) [10].
These results clearly demonstrate that the RAD-Sex approach is highly efficient because it enabled the identification of sexspecific sequences located very close to the previously known SD gene even in a GWAS that does not allow a simultaneous genetic mapping of the RAD-tags. It should be noted, that our analysis, although conducted on a species with a sequenced genome, did not use the genome sequence for the analysis of sex-associated RAD-tags: the genome sequence only provided validation of the proof of principle. This comparatively inexpensive approach can thus be applied to multiple species in a relatively high throughput project, without the need for obtaining biological materials from controlled genetic crosses that could be difficult to obtain in some species. This approach is also possible in species with no genetic or genomic information available, thus providing a special opportunity to develop an evolutionary-based project that can investigate the sex-determining system in a vast array of species.
Glossary and common abbreviations |