Table of Contents

Exercises: evolutionary tests

The tools proposed in this section should be used on alignments in which we are confident. Statistical analysis that are going to be computed by most of the tools in this section are highly sensible to wrongly aligned regions.

We are going to do here some exercises on the examples proposed by Phylemon for a selection of tools for each sub-section.

Model selection

In this section Phylemon groups programs to help in selecting the model that fits our data best.

jModelTest

Running HYPHY's Model Test

In the front page of the application for running jModelTest on the Phylemon server, the user is asked for a nucleotide sequence alignment input file, and the tree relating the sequences.

Analyzing the results

  1. Which model was selected using the hierarchical LRT scheme?
  2. Which model was selected using the Akaike Information Criterion?
  3. Did they both select the model? If not, why might this have occurred?
    1. How do the AIC scores of the two models compare? Are they largely different? 1)
    2. What happened with the model selected by AIC in the hLRT (see the output file for a record of each step taken in the hLRT)? 2)
  4. Which model would you use for further analysis?

Answers

Adaptation tests

CodeML

Models

Have a look to the examples available for this tools. Here examples are classified by the kind of models the user wants to compute (some models are missing here, more information is available in PAML user guide ):

Exercise

Go to the CodeML's page. And have a look to the examples proposed, that are the one explained above.

  1. Which of this example would you use if you want to have an idea of the different evolutionary rates along you phylogeny?
  2. Which model would be better to see different rates along each site of your alignment? Can this model be used alone to detect positive selection?
  3. Which kind of comparison do we have to do if we want to detect positive selection on a branch that seems to have high value(s) of omega?

Answers

1)
Δi = AICi - minAIC models having Δi within 1-2 of the best model have substantial support and should receive consideration. Models having Δi within 3-7 of the best model have considerably less support, while models with Δi > 10 have essentially no support.