Genomic repeat abundances contain phylogenetic signal

Steven Dodsworth, Mark W. Chase, Laura J. Kelly, Ilia J. Leitch, Jiři Macas, Petr Novak, Mathieu Piednoel, Hanna Weiss-Schneeweiss, Andrew R. Leitch*

*Corresponding author for this work

    Research output: Contribution to journalArticlepeer-review

    27 Downloads (Pure)

    Abstract

    A large proportion of genomic information, particularly repetitive elements, is usually ignored when researchers are using next-generation sequencing. Herewedemonstrate the usefulness of this repetitive fraction inphylogenetic analyses, utilizing comparative graph-based clustering of next-generation sequence reads, which results in abundance estimates of different classes of genomic repeats. Phylogenetic trees are then inferred based on the genome-wide abundance of different repeat types treated as continuously varying characters; such repeats are scattered across chromosomes and in angiosperms can constitute a majority of nuclear genomic DNA. In six diverse examples, five angiosperms and one insect, this method provides generally well-supported relationships at interspecific and intergeneric levels that agree with results from more standard phylogenetic analyses of commonly used markers. We propose that this methodology may prove especially useful in groups where there is little genetic differentiation in standard phylogenetic markers. At the same time as providing data for phylogenetic inference, this method additionally yields a wealth of data for comparative studies of genome evolution.

    Original languageEnglish
    Pages (from-to)112-126
    Number of pages15
    JournalSystematic Biology
    Volume64
    Issue number1
    Early online date25 Sept 2014
    DOIs
    Publication statusPublished - 1 Jan 2015

    Keywords

    • continuous characters
    • genomics
    • molecular systematics
    • next-generation sequencing
    • phylogenetics
    • repetitive DNA
    • UKRI
    • NERC
    • NE/G01724/1

    Fingerprint

    Dive into the research topics of 'Genomic repeat abundances contain phylogenetic signal'. Together they form a unique fingerprint.

    Cite this