Yonghui Chen
Kevin D. Reilly
Alan Sprague
Zhijie Guan
SEQOPTICS: A Protein Sequencing Clustering Method.
BMC Bioinformatics 2006 7 (Suppl 4):S10, 69-75,
First International Multi-Symposiums on
Computer and Computational Sciences
Volume 1 (IMSCCS'06),
69-75.
Abstract:
Protein sequence clustering has been widely used as part
of the analysis of protein structure and function. In
most cases single link or graph-based clustering algorithms
have been applied. In this paper, we demonstrate an
approach of clustering proteins, SEQOPTICS (sequence
clustering with OPTICS), which is based on OPTIC
(Ordering Points To Identify the Clustering Structure),
an attractive approach due to its emphasis on
visualization of results and support for interactive work,
e.g., in choosing parameters. OPTICS has not been
used, as far we know, for proteing sequence structuring.
We have implemented a system with OPTICS at its
core to perform protein sequence clustering. In this
paper, we test SEQOPTICS with four data sets from
different data sources. Visualization of the sequence
clustering structure is demonstrated. Our system was
evaluated by comparison with other existing methods.
Analysis of the results demonstrates that our system
performs better by the Jaccard coefficient evaluation
criterion.