Yonghui Chen
Kevin D. Reilly
Alan Sprague
Zhijie Guan

SEQOPTICS: A Protein Sequencing Clustering Method.

BMC Bioinformatics 2006 7 (Suppl 4):S10, 69-75,
First International Multi-Symposiums on
Computer and Computational Sciences
Volume 1 (IMSCCS'06),
69-75.


Abstract:

Protein sequence clustering has been widely used as part of the analysis of protein structure and function. In most cases single link or graph-based clustering algorithms have been applied. In this paper, we demonstrate an approach of clustering proteins, SEQOPTICS (sequence clustering with OPTICS), which is based on OPTIC (Ordering Points To Identify the Clustering Structure), an attractive approach due to its emphasis on visualization of results and support for interactive work, e.g., in choosing parameters. OPTICS has not been used, as far we know, for proteing sequence structuring. We have implemented a system with OPTICS at its core to perform protein sequence clustering. In this paper, we test SEQOPTICS with four data sets from different data sources. Visualization of the sequence clustering structure is demonstrated. Our system was evaluated by comparison with other existing methods. Analysis of the results demonstrates that our system performs better by the Jaccard coefficient evaluation criterion.