2008 年 128 巻 11 号 p. 1547-1555
A vast amount of DNA sequence data, protein three-dimensional (3D) structure data, and RNA expression data have been produced by the efforts of genome sequencing, structural genomics, and omics projects, and we are at the stage where comprehensive views of cell activity and molecular mechanisms of life can be deduced. But in reality, we are inundated with massive amounts of data and are still in the process of finding ways to fully utilize the data. In this report, I would like to present our observations on the growth of protein 3D structure data and our effort to deduce the functions from the 3D structures. We found that the 3D structure of quite a high proportion of proteins derived from genome sequences can be now predicted and methods to predict the functions from 3D structures are in high demand. The methods we have developed can be used to predict some functions, namely RNA and ligand interfaces, based on those 3D structures and DNA sequences with relatively high accuracy. The methods enable predictions that are accurate enough to help with deducing the atomic structures of the complexes.