We did not implemented a direct way to build phylogenies from a single pipeline of programs. But each user can easily build its own using the Pipeliner. Example 2 is a good starting point, you can keep both branches of the pipeline or remove one, in order to retrieve your results faster (remember once done you can save your pipeline and reuse it at any time).
More help about Pipeliner is available here.
The First important thing you have to be aware of when working with sequences is to avoid using complex text editors (eg: Microsoft word) when manipulate your files.
Complex text editors usually insert extra hidden characters containing format information, you will be able to get rid of them by using one of those more “simple” text editors
If you are using PHYLIP format (sequential or interleaved), make sure that the names of your sequences do not contain blank characters. Try to rename them with underscores.
YES, with CDS-ProtAl tool you will be able to translate your sequences using one of the available genetic code (see the Exercises on utilities section).
YES, with CDS-ProtAl tool, run the example proposed by Phylemon and have a look at the third Output result of this utility.
YES, some of the members of the Phylemon team are organizing the Course on Molecular Evolution, Phylogenetics and Adaptation, in Oeiras (Portugal), Cambridge (UK), Valencia (Spain). Perhaps it is in your interest to apply in one of them.
Our purpose when asking you to enter an email, is just in case one of those two cases append:
Any case, if you are not interested in receiving those notifications, you can easily fill the form with a fake e-mail address.
In order to protect our server from overload and users jobs from being lost, all jobs in Phylemon are send to a queuing system (see Oracle Grid Engine) that will distribute them in our cluster. In case all our “nodes” are already working, your jobs will be queued in chronological order (queue time hardly raises more than few minutes).
If you have a tree in newick format and you want transform it, you can use tree viewer/editor ETE. Go to the ETE form through the Utilities section, upload your tree and click on run.
Once loaded you can easily transform your tree by:
Once finished you can download the resulting tree by clicking on the link “outtree.nw“. This version of the tree will also be accessible in the tab “Job generated” of the upload form, in case you want to use this tree as input for an other tool.
It can be that some times MrBayes returns a segmentation fault, in the very last steps of an analysis. Usually Phylemon is not able to detect this and keep saying “Still running”. What you can do in order to take advantage of this previous run, is to download the job (go to the end of the result page, expand the “Othe actions” and download the job). Have a look to the tree files, and select the last tree of one of them (you can select the one with the better likelihood by comparing all files). And finally rerun your same job but adding in you MrBayes block the startingtree parameter like this:
#Nexus begin data; dimensions ntax=5 nchar=8; [dimensions ntax=6 nchar=8;] format datatype=DNA gap=- missing=?; matrix a ATGCATGC b ATCCATGC c ATCAATGA d ATTCTTCC e ATCAATGA [f ATCAGTCC] ; end; [begin mrbayes; delete f; end;] begin trees; tree startingtree = (a:2.56,((b:0.36,c:0.42):0.09,(e:0.23,d:0.87):0.66)); end; begin mrbayes; startvals tau=startingtree; startvals V=startingtree; mcmcp nperts=2; mcmcp ngen=10000; mcmc; end;
Example 23 in CodeML represents exactly what you want to do, the complete output corresponding to the estimate of selective pressure among sites according to the BEB approach would be found at the end of the rst file.
Note that, in order to accept these values, you would need first to accept that the assumption made by the M8 model that some sites are evolving at omega rate > 1 is correct. This can be achieved with a likelihood ratio test (LRT) between M7 and M8 models.
Summarizing, you need:
Some other adjustment in the methodology may also be done (I recommend you to have a look to CodeML user guide, available through the ”help
” link in Phylemon's Codeml page) in order to be sure that the estimate of the likelihood of each model is the best. You can for example run each model several times (2 or 3 times) with different starting values of omega, and keep as valid the result reaching the highest likelihood.
You will not find a tool in Phylemon to directly convert Nexus formatted trees to newick. However, if you load your Nexus tree in Archaeopteryx you will be able to get its corresponding translation to newick (or even phyloXML) format. Just got to “View as text” and click on “Newick”, a text box with the Newick tree will popup.