Dynamic evolution of venom proteins in squamate reptiles. Nicholas R. Casewell, Gavin A. Huttley and Wolfgang Wüster

Similar documents
Dynamic evolution of venom proteins in squamate reptiles

posterior probabilities Values below branches: Maximum Likelihood bootstrap values.

Cover Page. The handle holds various files of this Leiden University dissertation.

Supporting Information

Ch 1.2 Determining How Species Are Related.notebook February 06, 2018

Modern Evolutionary Classification. Lesson Overview. Lesson Overview Modern Evolutionary Classification

CLADISTICS Student Packet SUMMARY Phylogeny Phylogenetic trees/cladograms

Bayesian mixed models and the phylogeny of pitvipers (Viperidae: Serpentes)

OPEN WIDE: DECODING THE SECRETS OF VENOM

Assembling an Arsenal: Origin and Evolution of the Snake Venom Proteome Inferred from Phylogenetic Analysis of Toxin Sequences

TOPIC CLADISTICS

Evidence for Evolution by Natural Selection. Hunting for evolution clues Elementary, my dear, Darwin!

This article appeared in a journal published by Elsevier. The attached copy is furnished to the author for internal non-commercial research and

Testing Phylogenetic Hypotheses with Molecular Data 1

1. Department of Zoology, University of Oxford, South Parks Road, Oxford, OX1 3PS,

Letter. Bryan G. Fry. Results

Species: Panthera pardus Genus: Panthera Family: Felidae Order: Carnivora Class: Mammalia Phylum: Chordata

Warm-Up: Fill in the Blank

Fig Phylogeny & Systematics

17.2 Classification Based on Evolutionary Relationships Organization of all that speciation!

Cladistics (reading and making of cladograms)

Which Came First: The Lizard or the Egg? Robustness in Phylogenetic Reconstruction of Ancestral States

Evolutionary Trade-Offs in Mammalian Sensory Perceptions: Visual Pathways of Bats. By Adam Proctor Mentor: Dr. Emma Teeling

Geo 302D: Age of Dinosaurs LAB 4: Systematics Part 1

Molecular Phylogenetics and Evolution

A Mitochondrial DNA Phylogeny of Extant Species of the Genus Trachemys with Resulting Taxonomic Implications

Department of Biology, University of Central Florida, 4000 Central Florida Blvd, Orlando, Florida , USA 2

History of Lineages. Chapter 11. Jamie Oaks 1. April 11, Kincaid Hall 524. c 2007 Boris Kulikov boris-kulikov.blogspot.

Teaching Workshop: Color Vision in Primates and Other Mammals

Department of Biology, University of Central Florida, 4000 Central Florida Blvd., Orlando, FL 32816, USA 2

Comparing DNA Sequence to Understand

GEODIS 2.0 DOCUMENTATION

Who Cares? The Evolution of Parental Care in Squamate Reptiles. Ben Halliwell Geoffrey While, Tobias Uller

Phylogenetic Affinities of the Rare and Enigmatic Limb-Reduced Anelytropsis (Reptilia: Squamata) as Inferred with Mitochondrial 16S rrna Sequence Data

An investigation of the evolution of the anguimorph lizard venom system

Phylogeny Reconstruction

Comparing DNA Sequences Cladogram Practice

muscles (enhancing biting strength). Possible states: none, one, or two.

UNIT III A. Descent with Modification(Ch19) B. Phylogeny (Ch20) C. Evolution of Populations (Ch21) D. Origin of Species or Speciation (Ch22)

These small issues are easily addressed by small changes in wording, and should in no way delay publication of this first- rate paper.

Ecological Function of Venom in Varanus, with a Compilation of Dietary Records from the Literature

The Making of the Fittest: LESSON STUDENT MATERIALS USING DNA TO EXPLORE LIZARD PHYLOGENY

Phenotype Observed Expected (O-E) 2 (O-E) 2 /E dotted yellow solid yellow dotted blue solid blue

LETTERS. Early evolution of the venom system in lizards and snakes

Modern taxonomy. Building family trees 10/10/2011. Knowing a lot about lots of creatures. Tom Hartman. Systematics includes: 1.

Genes What are they good for? STUDENT HANDOUT. Module 4

Bioinformatics: Investigating Molecular/Biochemical Evidence for Evolution

Interpreting Evolutionary Trees Honors Integrated Science 4 Name Per.

Lecture 11 Wednesday, September 19, 2012

A Role for Genomics in Rattlesnake Research: Current Knowledge and Future Potential

The Role of Geography and Ecological Opportunity in the Diversification of Day Geckos (Phelsuma)

Phylogenetic relationships of horned lizards (Phrynosoma) based on nuclear and mitochondrial data: Evidence for a misleading mitochondrial gene tree

Title: Phylogenetic Methods and Vertebrate Phylogeny

Spot the Difference: Using the domestic cat as a model for the nutritional management of captive cheetahs. Katherine M. Bell

Name: Date: Hour: Fill out the following character matrix. Mark an X if an organism has the trait.

What are taxonomy, classification, and systematics?

Molecular Phylogenetics and Evolution

Genetic divergence of early song discrimination between two young songbird species

Introduction to Cladistic Analysis

Revell et al., Supplementary Appendices 1. These are electronic supplementary appendices to: Revell, L. J., M. A. Johnson, J. A.

Squamate Reptile Genomics and Evolution

Evolution of Birds. Summary:

Evolution as Fact. The figure below shows transitional fossils in the whale lineage.

Evolution of the Bordetella autotransporter Pertactin: identifications of regions subject to positive selection

Phylogeographic assessment of Acanthodactylus boskianus (Reptilia: Lacertidae) based on phylogenetic analysis of mitochondrial DNA.

Freedom of Information

RNA-seq and high-definition mass spectrometry reveal the complex and divergent venoms of two rear-fanged colubrid snakes

INQUIRY & INVESTIGATION

Northern origin and diversification in the central lowlands? Complex phylogeography and taxonomy of widespread day geckos (Phelsuma) from Madagascar

The purpose of this lab was to examine inheritance patters in cats through a

No limbs Eastern glass lizard. Monitor lizard. Iguanas. ANCESTRAL LIZARD (with limbs) Snakes. No limbs. Geckos Pearson Education, Inc.

1 This question is about the evolution, genetics, behaviour and physiology of cats.

COMPARING DNA SEQUENCES TO UNDERSTAND EVOLUTIONARY RELATIONSHIPS WITH BLAST

Supplementary material to Forecasting with the Standardized Self-Perturbed Kalman Filter

8/19/2013. What is convergence? Topic 11: Convergence. What is convergence? What is convergence? What is convergence? What is convergence?

Integrating genomics, testing, and management strategies to control OPP

Reptilian Physiology

2013 Holiday Lectures on Science Medicine in the Genomic Era

Phylogeny of snakes (Serpentes): combining morphological and molecular data in likelihood, Bayesian and parsimony analyses

Studies on the molecular underpinnings of sex determination mechanism evolution and molecular sexing tools in turtles

AP Lab Three: Comparing DNA Sequences to Understand Evolutionary Relationships with BLAST

Yes, heterozygous organisms can pass a dominant allele onto the offspring. Only one dominant allele is needed to have the dominant genotype.

Introduction to phylogenetic trees and tree-thinking Copyright 2005, D. A. Baum (Free use for non-commercial educational pruposes)

ISOB: A Database of Indigenous Snake Species of Bangladesh with respective known venom composition

COMPARING DNA SEQUENCES TO UNDERSTAND EVOLUTIONARY RELATIONSHIPS WITH BLAST

17 Inherited change Exam-style questions. AQA Biology

A Role for Genomics in Rattlesnake Research: Current Knowledge and Future Potential

Mendelian Genetics SI

Comparing DNA Sequences to Understand Evolutionary Relationships with BLAST

Ecography. Supplementary material

Supplementary Fig. 1: Comparison of chase parameters for focal pack (a-f, n=1119) and for 4 dogs from 3 other packs (g-m, n=107).

Manhattan and quantile-quantile plots (with inflation factors, λ) for across-breed disease phenotypes A) CCLD B)

Evolution in dogs. Megan Elmore CS374 11/16/2010. (thanks to Dan Newburger for many slides' content)

THE venomous snakes are classified in four main

DATA AND DATA INTERPRETATION IN THE STUDY OF LIMB EVOLUTION: A REPLY TO GALIS ET AL. ON THE REEVOLUTION OF DIGITS IN THE LIZARD GENUS BACHIA

recent extinctions disturb path to equilibrium diversity in Caribbean bats

Inheritance of Livershunt in Irish Wolfhounds By Maura Lyons PhD

Introduction. Robert Literman 1 Alexandria Burrett 1 Basanta Bista 1 Nicole Valenzuela 1

Fossils in the Phylogeny of the Isopod Crustaceans

Transcription:

Dynamic evolution of venom proteins in squamate reptiles Nicholas R. Casewell, Gavin A. Huttley and Wolfgang Wüster Supplementary Information Supplementary Figure S1. Phylogeny of the Toxicofera and evolution of the venom system under the single early origin (SEO) hypothesis of Fry et al. [10]. Simplified from Fry et al. [10] and Vidal and Hedges [52]. Key: LJ = lower jaw, UJ = upper jaw, red = venom, blue = non-venom.

Supplementary Figure S2. Bayesian DNA gene tree of the crotamine toxin family. Multiple support values are given at key nodes in the following order: Bayesian DNA, maximum likelihood (ML) DNA, Bayesian amino acid (aa), ML aa. x indicates no support for the node in that analysis. Tips of the tree coloured in red indicate Toxicoferan sequences sourced from the venom gland and blue those sourced from non-venom gland tissues ( physiological non-toxins). Pie charts represent the bpp of ancestral state reconstructions at that node: red = venom, blue = non-venom. The numbered codes for each sequence presented in the genetree represent GenBank GI accession numbers.

Supplementary Figure S3. Bayesian DNA gene tree of the CVF toxin family. Multiple support values are given at key nodes in the following order: Bayesian DNA, ML DNA, Bayesian aa, ML aa. x indicates no support for the node in that analysis. Tips of the tree coloured in red indicate Toxicoferan sequences sourced from the venom gland and blue those sourced from non-venom gland tissues ( physiological non-toxins). Pie charts represent the bpp of ancestral state reconstructions at that node: red = venom, blue = non-venom. The numbered codes for each sequence presented in the genetree represent GenBank GI accession numbers.

Supplementary Figure S4. Bayesian DNA gene tree of the hyaluronidase toxin family. Multiple support values are given at key nodes in the following order: Bayesian DNA, ML DNA, Bayesian aa, ML aa. x indicates no support for the node in that analysis. Tips of the tree coloured in red indicate Toxicoferan sequences sourced from the venom gland and blue those sourced from nonvenom gland tissues ( physiological non-toxins). Pie charts represent the bpp of ancestral state reconstructions at that node: red = venom, blue = non-venom. The numbered codes for each sequence presented in the genetree represent GenBank GI accession numbers.

Supplementary Figure S5. Bayesian DNA gene tree of the NGF toxin family. Multiple support values are given at key nodes in the following order: Bayesian DNA, ML DNA, Bayesian aa, ML aa. x indicates no support for the node in that analysis. Tips of the tree coloured in red indicate Toxicoferan sequences sourced from the venom gland and blue those sourced from non-venom gland tissues ( physiological non-toxins). Pie charts represent the bpp of ancestral state reconstructions at that node: red = venom, blue = non-venom. The numbered codes for each sequence presented in the genetree represent GenBank GI accession numbers.

Supplementary Figure S6. Bayesian DNA gene tree of the veficolin toxin family. Multiple support values are given at key nodes in the following order: Bayesian DNA, ML DNA, Bayesian aa, ML aa. x indicates no support for the node in that analysis. Tips of the tree coloured in red indicate Toxicoferan sequences sourced from the venom gland and blue those sourced from non-venom gland tissues ( physiological non-toxins). Pie charts represent the bpp of ancestral state reconstructions at that node: red = venom, blue = non-venom. The numbered codes for each sequence presented in the genetree represent GenBank GI accession numbers.

Supplementary Table S1. Comparison of Bayes factors generated by Bayesian analysis of codon partitioned and unpartitioned DNA datasets. Codon Unpartitioned: Unpartitioned: Gene family partitioned (H 0 ) mixed model (H A ) model selected (H B ) Bayes factors 2(H 0 H A ) 2(H 0 H B ) Crotamine -2540.18-2614.77-2544.88 149.18 ** 9.40 * CVF -21616.34-22825.84-21617.95 2419.00 *** 3.22 * Cystatin -8041.55-8244.54-8049.09 405.97 *** 15.08 * Hyaluronidase -16704.83-17827.16 N/A 2244.65 *** - Lectin -12743.79-13229.68-12646.24 971.78 *** 4.90 * Kallikrein -25130.34-26321.00-25138.82 2381.32 *** 16.96 * Natriuretic -21010.74-21518.12-21012.66 1014.75 *** 3.84 * NGF -13985.25-14620.33 N/A 1270.17 *** - Veficolin -11122.39-11735.18-11124.01 1225.57 *** 3.24 * Codon partitions utilised are displayed in Supplementary Table S2. Unpartitioned datasets were analysed with: (i) mixed models of sequence evolution (H A ) and (ii) models selected by MrModelTest v2.3 (H B ) [15]. The GTR + I + Γ models was selected by MrModelTest for each gene family except crotamine where GTR + Γ was selected. The marginal log-likelihoods produced by the codon partitioned datasets (H 0 ) were compared with unpartitioned datasets to generate Bayes factors (2logB10) = 2(H 0 H A or H B ). Interpretation of the differences between Bayes factors is taken from Kass and Raftery [40] - *** very strong, ** strong, * positive, NE little to no evidence. In all gene families Bayes factors advocate the use of codon partitioned datasets implementing multiple models of sequence evolution. N/A indicates where codon partitioned models are identical at each position and also the same as the unpartitioned model.

Supplementary Table S2. Estimated models of sequence evolution for DNA and amino acid datasets determined by MrModelTest and ModelGenerator. Data Type Dataset Codon Position Model DNA Crotamine CVF Cystatin Hyaluronidase Kallikrein Lectin Natriuretic NGF Veficolin 1 SYM + Γ 3 HKY + Γ 1 GTR + I + Γ 3 GTR + Γ 1 HKY + Γ 3 HKY + Γ 1 GTR + I + Γ 3 GTR + I + Γ 1 GTR + I + Γ 3 HKY + I + Γ 1 GTR + I + Γ 3 GTR + Γ 1 GTR + Γ 2 GTR + Γ 3 GTR + Γ 1 GTR + I + Γ 3 GTR + I + Γ 1 GTR + Γ 3 GTR + Γ Amino acid Crotamine - WAG + Γ CVF - WAG + Ι + Γ Cystatin - WAG + Γ Hyaluronidase - WAG + Ι + Γ Kallikrein - WAG + Γ Lectin - WAG + Ι + Γ Natriuretic - WAG + Γ NGF - WAG + Ι + Γ Veficolin - WAG + Γ

Supplementary Table S3. Test statistics for positive selection analyses undertaken on non-toxin branches observed in the gene trees Gene family Non-toxin branch Test statistic (lnl) Free parameters Null Alt Null Alt LR Significance cut-off P Crotamine CVF FS3E17002I7SX contig79280 N. naja 213372-1152.425-1152.367 74 77 0.117 0.050 0.990-3789.094-3789.094 74 77 0.000 1.000-3743.629-3742.759 74 77 1.739 0.017 0.628 P. bivittatus contig25850-4194.289-4194.289 74 77 0.000 1.000 Cystatin contig75753 FT7MHCY04JXDK contig09116 contig56006-1090.813-1089.864 72 75 1.896 0.594-1360.246-1356.857 72 75 6.779 0.079-1253.044-1249.724 72 75 6.640 0.010 0.084-1331.481-1329.206 72 75 4.548 0.208 contig66504-1319.006-1317.102 72 75 3.808 0.283 Hyaluronidase Kallikrein Lectin Natriuretic NGF contig26789 contig77142 P. bivittatus contig02483 contig03054 contig10774 B. jararaca 16124242 P. bivittatus contig15144 FT7MHCY03HENJ -5597.587-5595.536 82 85 4.101 0.050 0.251-6004.159-6003.059 84 87 2.199 0.532-5974.993-5972.765 84 87 4.455 0.025 0.216-3740.909-3736.461 81 84 8.895 0.031-3813.828-3806.575 81 84 14.50 6 0.050 0.002-4243.996-4243.163 84 87 1.666 0.050 0.645-1872.165-1872.165 78 81 0.000 1.000-1948.934-1948.934 78 81 0.000 0.025 1.000 Veficolin contig79448 contig06062-2382.297 2380.974 70 73 2.645 0.450-2255.387-2253.109 70 73 4.556 0.025 Positive selection tests were calculated for non-toxin branches observed nested within toxin clades of the gene trees. The probability (P) of the test is calculated using likelihood ratio (LR) tests of the 0.207

test statistic generated from null and alternative (alt) models and three degrees of freedom. Significance of the test is indicated by bold and underline font where the P-value of the test falls below the sequential Bonferroni correction significance cut-off.

Supplementary Table S4. Additional non-venom gland sequences previously identified from snake physiological tissues that were incorporated into the toxin family datasets. Dataset Species Accession number Source of sequence Crotamine CVF Natriuretic Crotalus durissus Naja naja Bothrops jararaca Pseudonaja textilis 14915765 Liver 213372 Liver 16124242 Brain 157169075 Heart

Supplementary Table S5. DNA sequences excluded from phylogenetic analyses due to significant (P<0.05) evidence of recombination by the RDP, GENECONV or Bootscan methods. Dataset Species Accession number Source of sequence Crotamine Lectin Kallikrein Crotalus durissus Enhydris polylepis Enhydris polylepis Crotalus durissus 14915765 Liver 156485255 Venom gland 156485259 Venom gland 76365437 Venom gland