Create your own conference schedule! Click here for full instructions

Abstract Detail


MAST , AUSTIN R [1], Lemmon, Alan [2], Buddenhagen, Chris [3], Lemmon, Emily [3].

Anchored Phylogenomics in Angiosperms: Maximizing Data Combinability Through Coordinated Locus Selection.

Targeted enrichment of DNA libraries prior to next generation sequencing permits the assembly of datasets with 100s or 1000s of putatively orthologous loci from across the genome for sophisticated species tree inference. A potentially transformative goal for the angiosperm phylogenomics community is agreement on a common set of target loci, since it would potentially remove the time-consuming marker development step and maximize data combinability across studies. With collaborators we have identified and validated a candidate set of loci suitable for target enrichment across angiosperms.  We began with the 959 genes previously identified by other researchers as apparently single-copy in the genomes of Arabidopsis, poplar, grape, and rice, identified the corresponding exons in Arabidopsis, found that 3050 of the exons were above a threshold size necessary for enrichment, and narrowed these down further to 1721 exons that are ≥55% similar between Arabidopsis and rice. Using these two taxa as a reference we then identified orthologous regions from 33 complete angiosperm genomes (representing the phylogenetic breadth of angiosperms, including Amborella) and nine low-coverage genomes that we produced for non-model angiosperms. 499 of the exons had an average copy number ≤1.2 and occurrence in ≥85% of the genomes. The sequences for those 499 exons in 26 genomes were used to produce a custom Agilent SureSelect Target Enrichment Kit. We enriched genomic libraries for 50 angiosperms, representing orders from across the phylogeny (Poales, Brassicales, Dipsacales, Caryophyllales, Proteales, and Magnoliales). The kit had an average enrichment success of 93.6% of the targets across the 50 species (range = 82.4–98.2%), and the assembled locus length averaged 903 bases (range = 428–2766) for the 50 species.  Variability in the 499 target regions (the exons; with an average pairwise distance of 75.3%) produced a phylogeny for 91 taxa (the 50 enriched species, the 33 angiosperms with complete genomes used in the earlier step, and nine non-model angiosperm with low-coverage genomes) that was broadly consistent with previously published whole plastid phylogenies when those agreed with previous functional-nuclear-phylogenomics results, but it echoed (mostly) the latter when the previous plastid and nuclear results did not agree.  We will discuss utility of the flanking regions at shallow levels of the phylogeny in the six focal orders if time permits. Variability in the exons were useful for resolving relationships at nearly all levels of divergence, but the considerable variability in the flanking regions made the flanking regions most useful at shallow levels.

Log in to add this item to your schedule

1 - Florida State University, Department Of Biological Science, 319 Stadium Drive, Tallahassee, FL, 32306, USA
2 - Florida State University, Department of Scientific Computing, Tallahassee, FL, 32306, USA
3 - Florida State University, Department of Biological Science, Tallahassee, FL, 32306, USA

Anchored Phylogenomics
Sequence Capture
target enrichment

Presentation Type: Oral Paper:Papers for Topics
Session: 43
Location: Evergreen/Grove
Date: Wednesday, July 30th, 2014
Time: 2:45 PM
Number: 43006
Abstract ID:628
Candidate for Awards:None

Copyright 2000-2013, Botanical Society of America. All rights reserved