High Performance Bioinformatics

Abstract

Bioinformatics is the science of managing, mining, and interpreting information from biological sequences and structures. Genome sequencing projects have contributed to an exponential growth in complete and partial sequence databases. Similarly, the rapidly expanding structural genomics initiative aims to catalog the structure-function information for proteins. Advances in technology such as microarrays have launched the subfield of genomics and proteomics to study the genes, proteins, and the regulatory gene expression circuitry inside the cell. What characterizes the state of the field is the flood of data that exists today or that is anticipated in the future. Combined with the fact that many of the bioinformatics tasks are highly compute intensive (e.g., ad initio protein folding), it is clear that high performance computing has a fundamental role to play in various bioinformatics problems ranging from the protein folding problem to large-scale genomics to inferring pathways and regulatory networks. The goal of this session is to present the latest research in high-performance computing applied to bioinformatics tasks. We are especially interested in scalable, parallel, and distributed algorithms for mining and analyzing bioinformatics data, as well as system tools that support large-scale high performance bioinformatics.

Publication
Euro-Par 2004 Parallel Processing, 10th International Euro-Par Conference, Pisa, Italy, August 31-September 3, 2004