- Research cladistics and phylogeny at Bozeman Science. Review common misconceptions here.
- Select a virus to research.
NCBI is a free data repository (one of many) where people upload their sequenced data. They also have to include information called metadata which includes things like the location, date, species, sequencing method etc. associated with the sequence. The Virus Variation Resource is a tool to easily search metadata and access virus sequences of interest.
- Consider what types of samples you might be interested in, and what characteristics would allow you to highlight relationships between virus strains.
If you are interested in a higher level of challenge, you may consider sequences of proteins extracted from yeast or bacterial pathogens.
- Download sequence files of interest.
- Open SeaView and load your downloaded sequences.
- Make an alignement. An alignment does the best job at figuring out how the protein likely evolved and arranges the characters to reflect this.
- Once you have aligned your sequences you will build a tree from them! In the trees menu click PhyML, then click run on the menu that pops up.
- Once you have your tree click save as unrooted tree
- Trees are difficult to interpret so when presenting them its often advisable to use some color to make your data easier for the audience to digest.
- Load unrooted tree in Fig Tree and apply color based on information from metadata to highlight patterns.
- Submit your colorized tree, along with a one page executive summary identifying your pathogen of interest, describing the source of your data, and explaining molecular relationships you can infer from your tree.