TY - JOUR
T1 - Family-level sampling of mitochondrial genomes in Coleoptera
T2 - compositional heterogeneity and phylogenetics
AU - Timmermans, Martijn J. T. N.
AU - Barton, Christopher
AU - Haran, Julien
AU - Ahrens, Dirk
AU - Culverwell, C. Lorna
AU - Ollikainen, Alison
AU - Dodsworth, Steven
AU - Foster, Peter G.
AU - Bocak, Ladislav
AU - Vogler, Alfried P.
N1 - Funding Information:
This work was supported by a grant from the Leverhulme Trust to A.P.V., C.B., D.A., and L.B. (grant F/00969/H). M.T.J.N. was supported by an NERC Postdoctoral Fellowship (NE/I021578/1). We thank Junying Lim and Benjamin Linard for assistance with the editing and annotation of the mitochondrial sequences. We are obliged to M. Balke, M. Barclay, M. Bednarik, S. Fabrizi, J. Gomez-Zurita, P. Hammond, R. A. B. Leschen, and C. Murria for donating specimens, and to two anonymous reviewers for valuable comments.
Publisher Copyright:
© The Author(s) 2015.
PY - 2016/1/1
Y1 - 2016/1/1
N2 - Mitochondrial genomes are readily sequenced with recent technology and thus evolutionary lineages can be densely sampled. This permits better phylogenetic estimates and assessment of potential biases resulting from heterogeneity in nucleotide composition and rate of change. We gathered 245 mitochondrial sequences for the Coleoptera representing all 4 suborders, 15 superfamilies of Polyphaga, and altogether 97 families, including 159 newly sequenced full or partial mitogenomes. Compositional heterogeneity greatly affected 3rd codon positions, and to a lesser extent the 1st and 2nd positions, even after RY coding. Heterogeneity also affected the encoded protein sequence, in particular in the nad2, nad4, nad5, and nad6 genes. Credible tree topologies were obtained with the nhPhyML ("nonhomogeneous") algorithm implementing a model for branch-specific equilibrium frequencies. Likelihood searches using RAxML were improved by data partitioning by gene and codon position. Finally, the PhyloBayes software, which allows different substitution processes for amino acid replacement at various sites, produced a tree that best matched known higher level taxa and defined basal relationships in Coleoptera. After rooting with Neuropterida outgroups, suborder relationships were resolved as (Polyphaga (Myxophaga (Archostemata +Adephaga))). The infraorder relationships in Polyphagawere (Scirtiformia (Elateriformia ((Staphyliniformia+ Scarabaeiformia) (Bostrichiformia (Cucujiformia))))). Polyphagan superfamilies were recovered as monophyla except Staphylinoidea (paraphyletic for Scarabaeiformia) and Cucujoidea, which can no longer be considered a valid taxon. The study shows that, although compositional heterogeneity is not universal, it cannot be eliminated for some mitochondrial genes, but dense taxon sampling and the use of appropriate Bayesian analyses can still produce robust phylogenetic trees.
AB - Mitochondrial genomes are readily sequenced with recent technology and thus evolutionary lineages can be densely sampled. This permits better phylogenetic estimates and assessment of potential biases resulting from heterogeneity in nucleotide composition and rate of change. We gathered 245 mitochondrial sequences for the Coleoptera representing all 4 suborders, 15 superfamilies of Polyphaga, and altogether 97 families, including 159 newly sequenced full or partial mitogenomes. Compositional heterogeneity greatly affected 3rd codon positions, and to a lesser extent the 1st and 2nd positions, even after RY coding. Heterogeneity also affected the encoded protein sequence, in particular in the nad2, nad4, nad5, and nad6 genes. Credible tree topologies were obtained with the nhPhyML ("nonhomogeneous") algorithm implementing a model for branch-specific equilibrium frequencies. Likelihood searches using RAxML were improved by data partitioning by gene and codon position. Finally, the PhyloBayes software, which allows different substitution processes for amino acid replacement at various sites, produced a tree that best matched known higher level taxa and defined basal relationships in Coleoptera. After rooting with Neuropterida outgroups, suborder relationships were resolved as (Polyphaga (Myxophaga (Archostemata +Adephaga))). The infraorder relationships in Polyphagawere (Scirtiformia (Elateriformia ((Staphyliniformia+ Scarabaeiformia) (Bostrichiformia (Cucujiformia))))). Polyphagan superfamilies were recovered as monophyla except Staphylinoidea (paraphyletic for Scarabaeiformia) and Cucujoidea, which can no longer be considered a valid taxon. The study shows that, although compositional heterogeneity is not universal, it cannot be eliminated for some mitochondrial genes, but dense taxon sampling and the use of appropriate Bayesian analyses can still produce robust phylogenetic trees.
KW - long-range PCR
KW - mitogenomes
KW - mixture models
KW - PhyloBayes
KW - rogue taxa
KW - RY coding
UR - http://www.scopus.com/inward/record.url?scp=84992663916&partnerID=8YFLogxK
U2 - 10.1093/gbe/evv241
DO - 10.1093/gbe/evv241
M3 - Article
C2 - 26645679
AN - SCOPUS:84992663916
SN - 1759-6653
VL - 8
SP - 161
EP - 175
JO - Genome Biology and Evolution
JF - Genome Biology and Evolution
IS - 1
ER -