If we have run through the Model Comparison section we will have evaluated the fit of substitution models on a multiple sequence alignment. Now that we have an idea of which models best explain the evolution of our gene/protein throughout a given sampled, we would like to calculate the distances among sequences. This is one of the first steps in many types of phylogenetic analyses.
If we are working with DNA data we have to use DnaDist tool (otherwise we should use ProtDist).
DnaDist
.
Here we want to quickly build a phylogeny from an alignement. For the purpose of this exercise we are going to work with the PhyML example: “NJ optimized
”.
In a first step, load the example:
The first point to notice is that, in contrast with some other tools, PhyML allows user to load DNA or protein sequences. This feature is great but users often miss to select the correct option.