The Challenge of Predicting Gene Function
0.25
0.5
0.75
1.25
1.5
1.75
2
The biological sciences are undergoing an explosion in the amount of available data. New data analysis methods are needed to deal with the data. A central problem in bioinformatics is the assignment of function to sequenced open reading frames (ORFs). The most common approach is based on inferred homology using a statistically based se- quence similarity (SIM) method e.g. PSI-BLAST.