Journal of Computer Chemistry, Japan
Online ISSN : 1347-3824
Print ISSN : 1347-1767
ISSN-L : 1347-1767
Letters (Selected Paper)
Performance Research of Clustering Methods for Detecting State Transition Trajectories in Hemoglobin
Kei TAKAMIYukichi KITAMURAMasataka NAGAOKA
Author information
JOURNAL FREE ACCESS FULL-TEXT HTML

2020 Volume 19 Issue 4 Pages 154-157

Details
Abstract

The time-series clustering method is one of unsupervised machine learning techniques that classify time-series data. In this article, we applied three methods to the clustering analysis for 200 molecular dynamics (MD) trajectories of human adult hemoglobin (HbA), and have reported their clustering performances for detecting the T-R state transition trajectories (TrajT-R). By compared with their silhouette indices, we have discussed the proper clustering conditions.

Figures
Figure 1.

 ED and DTW. (Top) Two time series x and y. (Bottom) The distances between x and y in ED and those in DTW. The terminal points are each linked by a line segment.

Figure 2.

 The silhouette index s ¯ k depending on the number of clusters partitioning k (k=2 to 5) in three time-series clustering methods, ED, DBA and soft-DTW.

Table
Table 1. The silhouette indices and the numbers of TrajT-R in three methods, ED, DBA and soft-DTW.
Methods s ¯ ka # of TrajT-Rb
This work:
 ED 0.38 34 (17%)
 DBA 0.37 54 (27%)
 soft-DTW 0.78 24 (12%)
Previous studies:
 Empirical criterion - 8%[2], 16%[3]

a s ¯ kis an average of s ¯ k(iα) over all the number of TrajT-R.

b The percentage in parentheses are the ratio of TrajT-R in all the 200 MD trajectories.

References
 
© 2020 Society of Computer Chemistry, Japan
feedback
Top