Abstract
Phylogenetics is the study of the evolutionary relationships among species. It is derived from the ancient greek words, phylon meaning "race", and genetikos, meaning "relative to birth". An important methodology in phylogenetics is a cladistics methodology (parsimony) applied to the study of taxonomic classification. Modern study includes as source data aspects of molecular biology, such as the DNA sequence of homologous (orthologous) genes. The algorithms used attempt to reconstruct evolutionary relationships in the form of phylogenetic trees, based on the available morphological data, behavioral data, and usually DNA sequence data (Fitch W. M., 1971). The topic of this thesis is the parallel implementation of an existing algorithm called Maximum Parsimony, a search for a guaranteed optimal tree(s) based the fewest number of mutations required for tree construction. The algorithm grows linearly with the increase in DNA sequence length and combinatorially with the number of organisms studied (Felsenstein J. , The number of evolutionary trees., 1978). The algorithm may take hours to complete. The limitations of the current implementations such as PAUP are that they are limited to just one core on the CPU, even if 8 are available. This parallel implementation may use as many cores as are available. The method of research is to replicate the accuracy of existing serial software, parallelize the algorithm to many cores without losing accuracy, optimize by various methods, then attempt to port to other hardware architectures. Some time is spent on the implementation of the algorithms onto GPUs and Clusters. The results are that, while this implementation matches the accuracy of the current standard, and speeds up in parallel, it does not presently match the speed of PAUP for reasons yet to be determined.
Library of Congress Subject Headings
Cladistic analysis--Data processing; Parallel algorithms
Publication Date
11-1-2011
Document Type
Thesis
Advisor
Skuse, Gary
Advisor/Committee Member
Khanna, Gurcharan
Advisor/Committee Member
Kaminsky, Alan
Recommended Citation
Darling, Andrew, "Parallel implementation of maximum parsimony search algorithm on multicore CPUs" (2011). Thesis. Rochester Institute of Technology. Accessed from
https://repository.rit.edu/theses/5445
Campus
RIT – Main Campus
Comments
Note: imported from RIT’s Digital Media Library running on DSpace to RIT Scholar Works in December 2013.