-
1) Description of the Interaction Model and Simulation Algorithm
Carlos Adriel Del Carpio Munoz, Atsushi Yoshimori
1999 Volume 10 Pages
3-12
Published: 1999
Released on J-STAGE: July 11, 2011
JOURNAL
FREE ACCESS
We describe a novel computer system directed to evaluate protein complex formation in a liquid environment. The relevant feature of the system is a potential function expressing the main thermodynamic and kinetic factors leading to protein interaction in solution. The protein interaction model expresses the interaction energy as basically composed of three forces: electrostatic (hydrogen bond), van der Waals, and hydrophobic. The latter is defined in function of the forces that the solvent molecules exert on the surface of the complex, and the van der Waals forces between the monomers and the solvent.
The interaction model implemented in the system has proven a high discrimination ability between different protein dockings, scoring high those close to the observed crystal structures. These results have led to the establishment of the basic principles underlying protein interaction, which constitutes the main way of expression of the biological function of these macromolecules.
View full abstract
-
Chungfan Kim, Akihko Konagaya, Kiyoshi Asai
1999 Volume 10 Pages
13-22
Published: 1999
Released on J-STAGE: July 11, 2011
JOURNAL
FREE ACCESS
In this paper, we evaluated the complexity and accuracy of dicodon model for gene finding using Hidden Markov Model with Self-Identification Learning. We used five different models as competitors with smaller parametric space than the dicodon model. Our evaluation result shows that the dicodon model outperforms other competitors in terms of sensitivity as well as specificity. This result indicates that the dicodon model can not be represented by a combination of the pair amino-acid, the codon usage, and the G+C content.
View full abstract
-
Tatsuya Akutsu, Kim Lan Sim
1999 Volume 10 Pages
23-29
Published: 1999
Released on J-STAGE: July 11, 2011
JOURNAL
FREE ACCESS
Protein threading, a method employed in protein three-dimensional (3D) structure prediction was only proposed in the early 1990's although predicting protein 3D structure from its given amino acid sequence has been around since 1970's. Here we describe a protein threading method/system that we have developed based on multiple protein structure alignment. In order to compute multiple structure alignments, we developed a similar structure search program on massive parallel computers and a program for constructing a multiple structure alignment from pairwise structure alignments, where the latter is based on the center star method for sequence alignment. A simple dynamic-programming based algorithm which uses a profile matrix obtained from the result of multiple structure alignment was also developed to compute a threading (i. e., an alignment between a target sequence and a known structure). Using this system, we participated in the threading category (category AL) of CASP3 (Third Community Wide Experiment on the Critical Assessment of Techniques for Protein Structure Prediction). The results are discussed.
View full abstract
-
Xiaohong Li, Pedro Romero, Meeta Rani, A. Keith Dunker, Zoran Obradovi ...
1999 Volume 10 Pages
30-40
Published: 1999
Released on J-STAGE: July 11, 2011
JOURNAL
FREE ACCESS
Logistic regression (LR), discriminant analysis (DA), and neural networks (NN) were used to predict ordered and disordered regions in proteins. Training data were from a set of non-redundant X-ray crystal structures, with the data being partitioned into N-terminal, C-terminal and internal (I) regions. The DA and LR methods gave almost identical 5-cross validation accuracies that averaged to the following values: 75.9±3.1%(N-regions), 70.7±1.5%(I-regions), and 74.6±4.4%(C-regions). NN predictions gave slightly higher scores: 78.8±1.2%(N-regions), 72.5±1.2%(I-regions), and 75.3±3.3%(C-regions). Predictions improved with length of the disordered regions. Averaged over the three methods, values ranged from 52% to 78% for length=9-14 to≥21, respectively, for I-regions, from 72% to 81% for length=5 to 12-15, respectively, for N-regions, and from 70% to 80% for length=5 to 12-15, respectively, for C-regions. These data support the hypothesis that disorder is encoded by the amino acid sequence.
View full abstract
-
Ethan Garner, Pedro Romero, A.Keith Dunker, Celeste Brown, Zoran Obrad ...
1999 Volume 10 Pages
41-50
Published: 1999
Released on J-STAGE: July 11, 2011
JOURNAL
FREE ACCESS
Disordered regions are sequences within proteins that fail to fold into a fixed tertiary structure and have been shown to be involved in a variety of biological functions. We recently applied neural network predictors of disorder developed from X-ray data to several protein sequences characterized as disordered by NMR (Garner, Cannon, Romero, Obradovic and Dunker, Genome Informatics, 9: 201-213, 1998). A few predictions on the NMR-characterized disordered regions were noted to contain “false” negative indications of order that correlated with regions of function. These and additional examples are examined in more detail here. Overall, 8 of 9 functional segments in 5 disordered proteins were identified or partially identified by this approach. The functions of these regions appear to involve binding to DNA, RNA, and proteins. These regions are known to undergo disorder-to-order transitions upon binding. This apparent ability of the predictors to identify functional regions in disordered proteins could be due to the existence of different flavors, or sub-classes of disorder, originating from the sequence of the disordered regions and perhaps owing to local inclinations toward order. These different flavors may be a characteristic that could be used to identify binding regions within proteins that are difficult to characterize structurally.
View full abstract
-
Xin Chen, Sam Kwong, Ming Li
1999 Volume 10 Pages
51-61
Published: 1999
Released on J-STAGE: July 11, 2011
JOURNAL
FREE ACCESS
We present a lossless compression algorithm,
GenCompress, for genetic sequences, based on searching for approximate repeats. Our algorithm achieves the best compression ratios for benchmark DNA sequences. Significantly better compression results show that the approximate repeats are one of the main hidden regularities in DNA sequences.
We then describe a theory of measuring the relatedness between two DNA sequences. Using our algorithm, we present strong experimental support for this theory, and demonstrate its application in comparing genomes and constructing evolutionary trees.
View full abstract
-
Hirotada Kobayashi, Hiroshi Imai
1999 Volume 10 Pages
62-72
Published: 1999
Released on J-STAGE: July 11, 2011
JOURNAL
FREE ACCESS
Constructing restriction maps is one of the important steps towards the determination of DNA sequences. Recently, the single-molecule approaches to constructing restriction maps, such as Optical Mapping by D. Schwartz
et al., have developed. In practice, with the single-molecule approach like Optical Mapping, the identification of the restriction sites is complicated by several error factors due to resolving power of biological experiments. The ordered restriction map alignment problem is a problem to estimate the actual restriction sites from many imprecise copies of map from single molecule.
In this paper, we formulate the problem on the basis of the statistical maximum likelihood estimate, and propose a new efficient local search algorithm for this problem, by applying the Expectation-Maximization (EM) algorithm along with the concept of two-clustering. Our algorithm works well for a lot of sets of simulated data, some of which we believe more difficult than the actual cases.
View full abstract
-
Koichiro Doi, Hiroshi Imai
1999 Volume 10 Pages
73-82
Published: 1999
Released on J-STAGE: July 11, 2011
JOURNAL
FREE ACCESS
The selection of a suitable set of primers is very important for polymerase chain reaction (PCR) experiments. Most existing algorithms for primer selection are concerned with producing a primer pair for each DNA sequence. However, when all the DNA sequences of the target objects are already known, like the approximately 6, 000 yeast ORFs, we may want to design a small set of primers to PCR amplify all the targets, which can then be resolved electrophoretically in a series of experiments. This would be quite useful, because decreasing the number of primers greatly reduces the cost of an experiment.
This paper extends the problem of primer selection for a single experiment presented in Doi and Imai [3] to primer selection for multiple PCR experiments, and proposes algorithms for the extended problem. The algorithms design primer sets one at a time. We extend the greedy algorithm for one PCR experiment in [3] by handling amplified segments in DNA sequences that have been identified by primer pairs already selected and by changing the priorities in the greedy algorithm. This algorithm is applied to real yeast data. The number of primers equaled 85% of the number of identified DNA sequences, which represented more than 90% of all the target DNA sequences. This is 42% the number of primers needed for multiplex PCR. Furthermore, the length of each primer is less than half the length of multiplex PCR primers so the cost of producing the primers is reduced to 20% of the cost in the multiplex PCR case.
View full abstract
-
Nadia El-Mabrouk, David Sankoff
1999 Volume 10 Pages
83-93
Published: 1999
Released on J-STAGE: July 11, 2011
JOURNAL
FREE ACCESS
We propose a model of the doubling of a bacterial genome followed by gene order rearrangement to explain present-day patterns of duplicated genes. On the hypothesis that inversion (reversal) is the predominant mechanism of rearrangement, we ask how to reconstruct the ancestral genome at the moment of genome duplication. We present a polynomial algorithm for finding such a genome that minimizes (within 2 reversals) the Hannenhalli-Pevzner formula for reversal distance from the modern genome. We illustrate by applying the algorithm to a set of duplicate genes in the Marchantia polymorpha mitochondrial genome.
View full abstract
-
Mitsuteru Nakao, Hidemasa Bono, Shuichi Kawashima, Tomomi Kamiya, Kazu ...
1999 Volume 10 Pages
94-103
Published: 1999
Released on J-STAGE: July 11, 2011
JOURNAL
FREE ACCESS
The massively parallel hybridization technologies by DNA chips and microarrays make it possible to monitor expression patterns of the whole set of genes in a genome under various conditions. The vast amount of data generated by such technologies necessitates the development of a new database management system that integrates expression data with other molecular biology databases and various analysis tools. We report here an extension of our KEGG (Kyoto Encyclopedia of Genes and Genomes) and DBGET/LinkDB systems for analyzing gene expression data in conjunction with pathway information and genomic information. It is now possible to make use of expression data for the reconstruction of pathways from the complete genome sequences.
View full abstract
-
See-Kiong Ng, Marie Wong
1999 Volume 10 Pages
104-112
Published: 1999
Released on J-STAGE: July 11, 2011
JOURNAL
FREE ACCESS
We are entering a new era of research where the latest scientific discoveries are often first reported online and are readily accessible by scientists worldwide. This rapid electronic dissemination of research breakthroughs has greatly accelerated the current pace in genomics and proteomics research. The race to the discovery of a gene or a drug has now become increasingly dependent on how quickly a scientist can scan through voluminous amount of information available online to construct the relevant picture (such as protein-protein interaction pathways) as it takes shape amongst the rapidly expanding pool of globally accessible biological data (e. g. GENBANK) and scientific literature (e. g. MEDLINE).
We describe a prototype system for automatic pathway discovery from on-line text abstracts, combining technologies that (1) retrieve research abstracts from online sources, (2) extract relevant information from the free texts, and (3) present the extracted information graphically and intuitively. Our work demonstrates that this framework allows us to routinely scan online scientific literature for automatic discovery of knowledge, giving modern scientists the necessary competitive edge in managing the information explosion in this electronic age.
View full abstract
-
Masanori Arita, Kiyoshi Asai, Takaaki Nishioka
1999 Volume 10 Pages
113-120
Published: 1999
Released on J-STAGE: July 11, 2011
JOURNAL
FREE ACCESS
A precursor is a compound which is transformed to a class of functional molecules within short steps. It is an important process in the production of natural drugs to decide whether a given compound is a precursor or not. We present two strategies to select precursor compounds in the secondary metabolism of terpenoids: one is to find the packing of basic molecules in the given cyclic structure, and the other is to find the synthetic map of the given set of compounds. Both strategies play important roles in reproducing tracer experiments on a computer.
View full abstract
-
Katsutoshi Takahashi, Masayuki Nakazawa, Yasuo Watanabe, Akihiko Konag ...
1999 Volume 10 Pages
121-132
Published: 1999
Released on J-STAGE: July 11, 2011
JOURNAL
FREE ACCESS
We have developed the automated processing algorithms for 2-dimensional (2-D) electrophoretograms of genomic DNA based on RLGS (Restriction Landmark Genomic Scanning) method, which scans the restriction enzyme recognition sites as the landmark and maps them onto a 2-D electrophoresis gel. Our powerful processing algorithms realize the automated spot recognition from RLGS electrophoretograms and the automated comparison of a huge number of such images. In the final stage of the automated processing, a master spot pattern, on which all the spots in the RLGS images are mapped at once, can be obtained. The spot pattern variations which seemed to be specific to the pathogenic DNA molecular changes can be easily detected by simply looking over the master spot pattern. When we applied our algorithms to the analysis of 33 RLGS images derived from human colon tissues, we successfully detected several colon tumor specific spot pattern changes.
View full abstract
-
Masao Nagasaki, Shuichi Onami, Satoru Miyano, Hiroaki Kitano
1999 Volume 10 Pages
133-143
Published: 1999
Released on J-STAGE: July 11, 2011
JOURNAL
FREE ACCESS
The way for expressing biological systems is a key element of usability. Expressions used in the biological society and those in the computer science society have their own merits. But they are too different for one society to utilize the expressions of the other society. In this paper, we design the
bio-calculus that attempts to bridge this gap. We provide syntax which is similar to conventional expressions in biology and at the same time specifies information needed for simulation analysis. The information and mathematical background of
bio-calculus is what is desired for the field of computer science. We show the practicality of bio-calculus by describing and simulating some molecular interactions with
bio-calculus.
View full abstract
-
Tomohiro Yasuda, Hideo Bannai, Shuichi Onami, Satoru Miyano, Hiroaki K ...
1999 Volume 10 Pages
144-154
Published: 1999
Released on J-STAGE: July 11, 2011
JOURNAL
FREE ACCESS
This study aims at automatic construction of a cell lineage from 4D (multi-focal, time-lapse) images, which are taken using a Nomarski DIC (differential-interference contrast) microscope. A system with such abilities would be a powerful tool for studying embryo genesis and gene function based on mutants, whose cell lineage may differ from that of wild types. We have designed and implemented a system for this purpose, and examined its ability through computational experiments.The procedure of our system consists of two parts:(1) Image processing which detect the positions of the nuclei from each 2D microscope image, and (2) Constructing a hypothetical cell lineage based on the information obtained in (1). We have also developed a tool which allows a human expert to easily filter out erroneous nuclei candidates generated in (1). We present computational results and also discuss other ideas which may improve the performance of our system.
View full abstract
-
Akihiro Nakaya, Haretugu Hishigaki, Shinichi Morishita
1999 Volume 10 Pages
155-165
Published: 1999
Released on J-STAGE: July 11, 2011
JOURNAL
FREE ACCESS
The synergetic effects of multiple marker loci regarding quantitative traits such as blood glucose level have attracted interest. In the OLETF model rat of non-insulin dependent diabetes mellitus (NIDDM), our previous study focusing on the effects of multiple genetic factors has found significant marker combinations with respect to oral glucose tolerance (OGT) at 60 minutes after oral administration. Besides the interaction among markers at a particular time point, their correlated behavior in a time series is another interest. Based on the previous results, in this paper, we report the behavior of markers in a time series by using a series of measurements of OGT.
View full abstract
-
Yutaka Ueno, Kiyoshi Asai, Masanori Arita
1999 Volume 10 Pages
166-175
Published: 1999
Released on J-STAGE: July 11, 2011
JOURNAL
FREE ACCESS
We have constructed a general framework for integrating application programs with control through a local Web browser. This method is based on a simple inter-process message function from an external process to application programs. Commands to a target program areprepared in a script file, which is parsed by a message dispatcher program. When it is used as ahelper application to a Web browser, these messages will be sent from the browser by clicking a hyper-link in a Web document. Our framework also supports pluggable extension-modules for application programs by means of dynamic linking. A prototype system is implemented on our molecular structure-viewer program, MOSBY. It successfully featured a function to load an extension-module required for the docking study of molecular fragments from a Web page. Our simple framework facilitates the concise configuration of Web softwares without complicated knowledge on network computation and security issues. It is also applicable for a wide range of network computations processing private data using a Web browser.
View full abstract
-
Aaron Stokes, Hideo Matsuda, Akihiro Hashimoto
1999 Volume 10 Pages
176-185
Published: 1999
Released on J-STAGE: July 11, 2011
JOURNAL
FREE ACCESS
Complete DNA sequences (genomes) and associated data are being made available worldwide at an astonishing rate. Through computer analysis of such data, molecular biologists hope to gain an overall understanding of the genome, such as by predicting large-scale gene networks. However, this is difficult because diverse genome data are scattered across many highly heterogeneous databases, and because existing database systems lack the facilities to expose and analyze functional relationships among the data. To address these problems, we propose a new type of genome database system. Since a genome can be thought of intuitively as a kind of ‘document’, our system uses a structured document language based on XML to effectively represent genomes and associated data. The information-rich structures of the genome documents help cope with data diversity and heterogeneity. A powerful query language is introduced that exposes important biological relationships among the genome data. We have obtained favorable results from several experiments, demonstrating the usefulness of our method in building a top-down view of genome functionality.
View full abstract
-
Tomohiro Moriyama, Ayumi Shinohara, Masayuki Takeda, Osamu Maruyama, T ...
1999 Volume 10 Pages
186-195
Published: 1999
Released on J-STAGE: July 11, 2011
JOURNAL
FREE ACCESS
We are developing a system which finds a genetic network from data obtained by multiple gene disruptions and overexpressions. We deal with a genetic network as a weighted graph, where each weight represents the strength of activation from a gene to another gene. In this paper, we explain the overview of our system, and our strategy to visualize the weighted network. Wealso study the computational complexity related to the visualization.
View full abstract
-
David Haussler
1999 Volume 10 Pages
199
Published: 1999
Released on J-STAGE: July 11, 2011
JOURNAL
FREE ACCESS
-
Yusuke Nakamura
1999 Volume 10 Pages
200-201
Published: 1999
Released on J-STAGE: July 11, 2011
JOURNAL
FREE ACCESS
-
Yutaka Akiyama, Tamotsu Noguchi, Kentaro Onizuka, Makoto Ando
1999 Volume 10 Pages
202-203
Published: 1999
Released on J-STAGE: July 11, 2011
JOURNAL
FREE ACCESS
-
Susumu Goto, Kotaro Shiraishi, Kayo Okamoto, Hiroko Ishida, Toshi Naka ...
1999 Volume 10 Pages
204-205
Published: 1999
Released on J-STAGE: July 11, 2011
JOURNAL
FREE ACCESS
-
Yasuhiko Kitamura, Tetsuro Nanbu, Shoji Tatsumi
1999 Volume 10 Pages
206-207
Published: 1999
Released on J-STAGE: July 11, 2011
JOURNAL
FREE ACCESS
-
Ken Kurokawa, Yoshitaka Hasegawa, Kenichi Kawashima, Toshiyuki Okumura ...
1999 Volume 10 Pages
208-209
Published: 1999
Released on J-STAGE: July 11, 2011
JOURNAL
FREE ACCESS
-
Shinsei Minoshima, Susumu Mitsuyama, Saho Ohno, Takashi Kawamura, Nobu ...
1999 Volume 10 Pages
210-212
Published: 1999
Released on J-STAGE: July 11, 2011
JOURNAL
FREE ACCESS
-
Tadasu Shin-i, Yuji Kohara
1999 Volume 10 Pages
213-214
Published: 1999
Released on J-STAGE: July 11, 2011
JOURNAL
FREE ACCESS
-
Yoshinori Tamada, Satoshi Matsumoto, Osamu Maruyama, Satoru Miyano
1999 Volume 10 Pages
215-216
Published: 1999
Released on J-STAGE: July 11, 2011
JOURNAL
FREE ACCESS
-
Atsushi Doi, Rainer Drath, Masao Nagasaki, Hiroshi Matsuno, Satoru Miy ...
1999 Volume 10 Pages
217-218
Published: 1999
Released on J-STAGE: July 11, 2011
JOURNAL
FREE ACCESS
-
Miyako Tanaka, Hiroshi Matsuno, Sanae Nakazono, Satoru Miyano
1999 Volume 10 Pages
219-220
Published: 1999
Released on J-STAGE: July 11, 2011
JOURNAL
FREE ACCESS
-
Koji Kadota, Rika Miki, Yasushi Okazaki, Kentaro Shimizu, Yoshihide Ha ...
1999 Volume 10 Pages
221-222
Published: 1999
Released on J-STAGE: July 11, 2011
JOURNAL
FREE ACCESS
-
Kyotetsu Kanaya, Akira Ohyama, Tatsuya Akutsu, Asao Fujiyama
1999 Volume 10 Pages
223-224
Published: 1999
Released on J-STAGE: July 11, 2011
JOURNAL
FREE ACCESS
-
Ken-ichi Kurata, Akira Suyama
1999 Volume 10 Pages
225-226
Published: 1999
Released on J-STAGE: July 11, 2011
JOURNAL
FREE ACCESS
-
Tatsuhiko Tsunoda, Yoshiaki Hojo, Chikashi Kihara, Norihiko Shiraishi, ...
1999 Volume 10 Pages
227-228
Published: 1999
Released on J-STAGE: July 11, 2011
JOURNAL
FREE ACCESS
-
Jianghong An, Takao Nakama, Yasushi Kubota, Hiroshi Wako, Akinori Sara ...
1999 Volume 10 Pages
229-230
Published: 1999
Released on J-STAGE: July 11, 2011
JOURNAL
FREE ACCESS
-
Carlos Adriel Del Carpio Munoz
1999 Volume 10 Pages
231-232
Published: 1999
Released on J-STAGE: July 11, 2011
JOURNAL
FREE ACCESS
-
Hiroaki Kato, Yoshimasa Takahashi, Hidetsugu Abe
1999 Volume 10 Pages
233-234
Published: 1999
Released on J-STAGE: July 11, 2011
JOURNAL
FREE ACCESS
-
Makiko Suwa, Henrik T. Yudate, Yasuhiko Masuho, Asaf A. Salamov, Chris ...
1999 Volume 10 Pages
235-236
Published: 1999
Released on J-STAGE: July 11, 2011
JOURNAL
FREE ACCESS
-
Shouji Tatsumoto, Kenji Satou
1999 Volume 10 Pages
237-238
Published: 1999
Released on J-STAGE: July 11, 2011
JOURNAL
FREE ACCESS
-
Junichi Uechi, Takatugu Hirokawa, Makiko Suwa, Shigeki Mitaku
1999 Volume 10 Pages
239-240
Published: 1999
Released on J-STAGE: July 11, 2011
JOURNAL
FREE ACCESS
-
Yutaka Ueno, Katsutoshi Takahashi, Kiyoshi Asai, Chikara Sato
1999 Volume 10 Pages
241-242
Published: 1999
Released on J-STAGE: July 11, 2011
JOURNAL
FREE ACCESS
-
Takaaki Nishioka, Kimito Funatsu
1999 Volume 10 Pages
243-244
Published: 1999
Released on J-STAGE: July 11, 2011
JOURNAL
FREE ACCESS
-
Atsushi Ogiwara
1999 Volume 10 Pages
245-246
Published: 1999
Released on J-STAGE: July 11, 2011
JOURNAL
FREE ACCESS
-
Akinori Sarai, Hidetoshi Kono, Michael Gromiha, Fabio Pichierri, Kenji ...
1999 Volume 10 Pages
247-248
Published: 1999
Released on J-STAGE: July 11, 2011
JOURNAL
FREE ACCESS
-
Goro Terai, Takeshi Mizuno, Toshihisa Takagi
1999 Volume 10 Pages
249-250
Published: 1999
Released on J-STAGE: July 11, 2011
JOURNAL
FREE ACCESS
-
Yifei Wang, Akinori Sarai
1999 Volume 10 Pages
251-252
Published: 1999
Released on J-STAGE: July 11, 2011
JOURNAL
FREE ACCESS
-
Yukihiro Maki, Shoji Watanabe, Yukihiro Eguchi, Daisuke Tominaga, Masa ...
1999 Volume 10 Pages
253-254
Published: 1999
Released on J-STAGE: July 11, 2011
JOURNAL
FREE ACCESS
-
Osamu Gotoh
1999 Volume 10 Pages
255-256
Published: 1999
Released on J-STAGE: July 11, 2011
JOURNAL
FREE ACCESS
-
Haretsugu Hishigaki, Toshihide Ono, Akira Tanigami, Toshihisa Takagi
1999 Volume 10 Pages
257-258
Published: 1999
Released on J-STAGE: July 11, 2011
JOURNAL
FREE ACCESS
-
Hideki Noguchi, Taizo Hanai, Hiroyuki Honda, Takeshi Kobayashi
1999 Volume 10 Pages
259-260
Published: 1999
Released on J-STAGE: July 11, 2011
JOURNAL
FREE ACCESS