-
Xiaoping Gao, Kunqing Xie, Xingxing Jin, Xiaofeng Lei
2007Volume 6 Pages
S611-S619
Published: 2007
Released on J-STAGE: October 05, 2007
JOURNAL
FREE ACCESS
Teleconnection is a linkage between two climate events that occur in widely separated regions of the globe on a monthly or longer timescale. In the past, statistical methods have been used to discover teleconnections. However, because of the overwhelming volume and high resolution of datasets acquired by modern data acquisition systems, these methods are not sufficient. In this paper, we propose a novel approach to finding teleconnections in global climate datasets using data mining technologies. We present experiments on real datasets and find some interesting teleconnections, including well-known ones such as ENSO. The experiments indicate that our method is usable and efficient.
View full abstract
-
Dan Yang
2007Volume 6 Pages
S620-S635
Published: 2007
Released on J-STAGE: October 05, 2007
JOURNAL
FREE ACCESS
Every country's economic development affects all levels of its society and thus the results of its social science research. To make social science research better serve their economic development, many countries have established social science research institutes, among which are management research institutes more related to economic research institutes. Through comparative research of the locations and founding dates of the institutes in different countries, this article analyses the development trends and the relationship between economics and management research, providing us with the relevant experience and background for planning purposes.
View full abstract
-
Feng Jiao, Guiming He
2007Volume 6 Pages
S636-S640
Published: 2007
Released on J-STAGE: October 05, 2007
JOURNAL
FREE ACCESS
This paper describes a real-time online prototype automobile and truck driver-fatigue monitor. It uses remotely located charge-coupled-device cameras equipped with active infrared illuminators to acquire video images of the driver. Various visual cues that typically characterize the level of alertness of a person are extracted in real time and systematically combined to infer the fatigue level of the driver. The visual cues employed characterize eyelid movement, gaze movement, head movement, and facial expression. A probabilistic model is developed to model human fatigue and to predict fatigue based on the visual cues obtained. The simultaneous use of multiple visual cues and their systematic combination yields a much more robust and accurate fatigue characterization than using a single visual cue. This system was validated under real-life fatigue conditions with human subjects of different ethnic backgrounds, genders, and ages; with/without glasses; and under different illumination conditions. It was found to be reasonably robust, reliable, and accurate in fatigue characterization.
View full abstract
-
Tan Sun, Guobin Huang
2007Volume 6 Pages
S641-S651
Published: 2007
Released on J-STAGE: October 12, 2007
JOURNAL
FREE ACCESS
The authors analyze the developmental framework of digital libraries in China and point out their current demand characteristics, development requirements, and developmental period. They then conclude that it is necessary to start up a new paradigm evolution of a digital library, from a traditional digital library to a virtual digital library. On that basis, they describe in detail several problems and developmental approaches that developing a virtual digital library must deal with, drawing lessons from the prototype DILIGENT.
View full abstract
-
Min Qi, Bao-lin Zhang, Guang-he Liang, Jie Wang, Xin-ping Cai
2007Volume 6 Pages
S652-S657
Published: 2007
Released on J-STAGE: October 12, 2007
JOURNAL
FREE ACCESS
3D modeling and visualization of geology volume is very important to interpret accurately and locate subsurface geology volume for mining exploration and deep prospecting. However, it faces a lack of information because the target area is usually unexplored and lacks geological data. This paper presents our experience in applying a 3D model of geology volume based on geophysics. This work has researched and developed a 3D visualization system. It is based on an OO (orientated object) approach and modular programming, uses the C++ language and Microsoft .NET platform. This system has built first a high resistivity method and MT database. The system uses irregular tetrahedrons to construct its model and then finally has built the 3D geological model itself.
View full abstract
-
Xingsen Li, Yong Shi, Jun Li, Peng Zhang
2007Volume 6 Pages
S658-S666
Published: 2007
Released on J-STAGE: October 12, 2007
JOURNAL
FREE ACCESS
Data are important for making decisions. However, the quality of the data affects the quality of decisions. Data mining as one of the most important sources of knowledge needs high quality data to mine, but there are not enough good quality data in many enterprises. By analyzing the reasons for low data quality systematically, a new method called data mining consulting for improving data quality has been established. It defines data quality in a wider sense from the view of data mining, finds data quality problems, and solves data quality problems by a series of methods. Its application shows that it has good practicality and can increase data quality considerable.
View full abstract
-
Jian Wang, Rui Chen
2007Volume 6 Pages
S667-S675
Published: 2007
Released on J-STAGE: October 12, 2007
JOURNAL
FREE ACCESS
This paper adopts the Panel Data Model based on the combination of time sequence and data of cross sections and does research on data of seventeen institutes in the Chinese Academy of Sciences. It makes empirical analysis of the Intellectual Property Rights (IPR) authorization and the relationship between R&D input and R&D scientific and technological output. Research results show that the role played by the Intellectual Property Rights Authorization in scientific and technological growth is notable and positively correlated.
View full abstract
-
V. V. Ezhela
2007Volume 6 Pages
S676-S689
Published: 2007
Released on J-STAGE: October 23, 2007
JOURNAL
FREE ACCESS
A critical overview of the current doubtful practice on presentation of correlated data in the physics literature and in the scientific and technological databases is presented. The simple rules to calculate the rounding thresholds to preserve the positive definiteness of the covariance and correlation matrices as well as the rounding thresholds for the components of the mean vector to keep them inside the "non-rounded" scatter region are formulated. Evidence that in the multivariate case there are severe limitations on the applicability of the linear differential law of uncertainty propagation is presented. The explicit relation of the number of input random variables I, the number of output variables D, and the order T of Taylor polynomials sufficient to preserve the self-consistent numerical presentation of the mean value of the vector function and its covariance matrix under nonlinear differential propagation procedure is obtained. It is stressed that the rounding thresholds for the safe rounding of correlated data impose the severe requirements on the storage and exchange formats of the correlated data that could not be met in the traditional publications on the paper but could be realized in the electronic media.
View full abstract
-
Ke Li, Fan Lin, Kunqing Xie
2007Volume 6 Pages
S690-S697
Published: 2007
Released on J-STAGE: October 23, 2007
JOURNAL
FREE ACCESS
Because of the development of modern-day satellites and other data acquisition systems, global climate research often involves overwhelming volume and complexity of high dimensional datasets. As a data preprocessing and analysis method, the clustering method is playing a more and more important role in these researches. In this paper, we propose a spatial clustering algorithm that, to some extent, cures the problem of dimensionality in high dimensional clustering. The similarity measure of our algorithm is based on the number of top-k nearest neighbors that two grids share. The neighbors of each grid are computed based on the time series associated with each grid, and computing the nearest neighbor of an object is the most time consuming step. According to Tobler's "First Law of Geography," we add a spatial window constraint upon each grid to restrict the number of grids considered and greatly improve the efficiency of our algorithm. We apply this algorithm to a 100-year global climate dataset and partition the global surface into sub areas under various spatial granularities. Experiments indicate that our spatial clustering algorithm works well.
View full abstract
-
Xudong Wang
2007Volume 6 Pages
S698-S714
Published: 2007
Released on J-STAGE: October 23, 2007
JOURNAL
FREE ACCESS
Digital World History is a new expression of world history (or maybe "a new method for world history expression") and a paradigm of world history description, study, and application by virtual informatization and recovery. It is also a comprehensive systematic study through dynamic marks, integrated description, and retrieval of human society evolution and its causality dependant on the theory and methodology of digitization information. It aims at breaking the limitation of diachronic language attributed to the process of history cognition, summation, and recovery, addressing a possible scheme to fuse historical factors in relation to changing history, dynamically applying a multiplicity of results so that the discipline of world history can meet the needs of the information-equipped society of the 21st century. In this article, the author uses theoretical modelling methods, resulting in a blueprint of the quality issue, namely the Digital World History premise, and a paradigm for setting the foundation and scientific data strategy as a basis for its necessity.
View full abstract
-
Qianzi Tian, Di Jiang
2007Volume 6 Pages
S715-S722
Published: 2007
Released on J-STAGE: October 23, 2007
JOURNAL
FREE ACCESS
In this paper, we introduce in brief the basic conditions of the Sino-Tibetan data resources, the STEDT project (the Sino-Tibetan Etymological Dictionary and Thesaurus) at the University of California, Berkeley and the STDP (The Sino-Tibetan Database and Retrieval System Project) at the Chinese Academy of Social Sciences (CASS), including the data structures, data volumes, and retrieval methods. We also discuss interdisciplinary information on the origin of East Asian civilization, which consists of several disciplines, including linguistics, molecular biology, human genetics, and archaeology.
View full abstract
-
Bangzhong Wang, Yinglong Xu, Baogui Bi
2007Volume 6 Pages
S723-S737
Published: 2007
Released on J-STAGE: October 23, 2007
JOURNAL
FREE ACCESS
With the development of the global economy, the impact of tropical cyclones has become far-reaching. Thus they are a fundamental issue to be addressed both nationally and globally. The socio-economic impact is particularly noticeable in developing countries, especially China. This paper begins with the effects of cyclones on regional and global economies. Then a brief introduction to the past and current situations and progress in cyclones forecasting and warning in China are presented. Finally the paper gives recommendations about improving and perfecting the tropical cyclone forecasting and warning systems.
View full abstract
-
Cuo Cai, Kunqing Xie
2007Volume 6 Pages
S738-S742
Published: 2007
Released on J-STAGE: October 23, 2007
JOURNAL
FREE ACCESS
Currently there are many methods of collecting geoscience data, such as station observations, satellite images, sensor networks, etc. All of these data sources from different regions and time intervals are combined in geoscience research activities today. Using a mixture of several different data sources may have benefits but may also lead to severe data quality problems, such as inconsistent data and missing values. There have been efforts to produce more consistent data sets from multiple data sources. However, because of the huge gaps in data quality among the different sources, data quality inequality among different regions and time intervals has still occurred in the resultant data sets. As the construction methods of these data sets are quite complicated, it would be difficult for users to know the data quality of a dataset not to mention the data quality for a specified location or a given time interval. In this paper, the authors address the problem by generating a data quality measure for all regions and time intervals of a dataset. The data quality measure is computed by comparing the constructed datasets and their sources or other relevant data, using data mining techniques. This paper also demonstrates how to handle major quality problems, such as outliers and missing values, by using data mining techniques in the geoscience data, especially in global climate data.
View full abstract
-
QingHong Yan, Ning Zhang, Jie Wu, Tao Zhang
2007Volume 6 Pages
S743-S748
Published: 2007
Released on J-STAGE: October 23, 2007
JOURNAL
FREE ACCESS
Event-related potential (ERP) is the measurement of the brain's electrical activity in response to different types of events, such as attention, words, thinking, or sounds. By measuring the brain's response to such events, we can learn how different types of information are processed. As the mass of recorded ERP data explodes, an automatic and accurate tool to store, manage, and retrieve data readily is of increasing concern in neuroinformatics. In this paper, we describe a relational ERP database that has been constructed using the SQL server 2000 database management system and an IIS web server that has been setup for data retrieval through a custom web interface (http://202.113.232.103:8088/erpdb/index.asp). A novel database structure has been used to store ERP data of different activity channels, which provides a rapid and accurate way for data retrieval within any given range on the time zone with various searching options. The database is divided into: (1) subjects' information and record information and (2) ERP data, which has been structured and standardized in a database table supplemented with unrestricted text files. It can integrate or exchange data with other clinical databases or computer-based information systems through a program based on ADO techniques. Users are able to readily retrieve ERP data through the user-friendly web page interface. All online resources of the database are freely available to the scientific community. As the database develops further, we anticipate it will become a valuable tool that will make a great contribution to everyday clinical practice, teaching, and research work inneuroscience and psychology in the future.
View full abstract
-
Fan Lin, XingXing Jin, Cheng Hu, XiaoPing Gao, Kunqing Xie, XiaoFeng L ...
2007Volume 6 Pages
S749-S755
Published: 2007
Released on J-STAGE: October 26, 2007
JOURNAL
FREE ACCESS
In this paper, we apply data mining technologies to a 100-year global land precipitation dataset and a 100-year Sea Surface Temperature (SST) dataset. Some interesting teleconnections are discovered, including well-known patterns and unknown patterns (to the best of our knowledge), such as teleconnections between the abnormally low temperature events of the North Atlantic and floods in Northern Bolivia, abnormally low temperatures of the Venezuelan Coast and floods in Northern Algeria and Tunisia, etc. In particular, we use a high dimensional clustering method and a method that mines episode association rules in event sequences. The former is used to cluster the original time series datasets into higher spatial granularity, and the later is used to discover teleconnection patterns among events sequences that are generated by the clustering method. In order to verify our method, we also do experiments on the SOI index and a 100-year global land precipitation dataset and find many well-known teleconnections, such as teleconnections between SOI lower events and drought events of Eastern Australia, South Africa, and North Brazil; SOI lower events and flood events of the middle-lower reaches of Yangtze River; etc. We also do explorative experiments to help domain scientists discover new knowledge.
View full abstract
-
Chen Cao
2007Volume 6 Pages
S756-S759
Published: 2007
Released on J-STAGE: October 26, 2007
JOURNAL
FREE ACCESS
This paper presents results of applying a machine learning technique, the Support Vector Machine (SVM), to the astronomical problem of matching the Infra-Red Astronomical Satellite (IRAS) and Sloan Digital Sky Survey (SDSS) object catalogues. In this study, the IRAS catalogue has much larger positional uncertainties than those of the SDSS. A model was constructed by applying the supervised learning algorithm (SVM) to a set of training data. Validation of the model shows a good identification performance (∼ 90% correct), better than that derived from classical cross-matching algorithms, such as the likelihood-ratio method used in previous studies.
View full abstract
-
Xing Chen
2007Volume 6 Pages
S760-S769
Published: 2007
Released on J-STAGE: October 26, 2007
JOURNAL
FREE ACCESS
This article first discusses the characteristics of scientific research in the humanities and social sciences (HSS), and then lays out its basic e-management demands. In building a scientific research e-management platform (SREMP), attention must be paid to the characteristics of its data and structure. The data in the SREMP of HSS domains have multiformity. That is, their structure should be an integrative multi-functioned information subsystem with a clearly graded data management mode, flexible user jurisdiction management functions, accurate and common retrieval systems, reliable security design, and flexibility to improve its operations, and ease in maintenance and amelioration. The article also describes the development tendencies of a SREMP.
View full abstract
-
Dafang Zhuang, Wen Yuan, Jiyuan Liu, Dongsheng Qiu, Tao Ming
2007Volume 6 Pages
S770-S778
Published: 2007
Released on J-STAGE: October 26, 2007
JOURNAL
FREE ACCESS
The data sharing system for resource and environment science databases of the Chinese Academy of Science (CAS) is of an open three-tiered architecture, which integrates the geographical databases of about 9 institutes of CAS by the mechanism of distributive unstructured data management, metadata integration, catalogue services, and security control. The data tiers consist of several distributive data servers that are located in each CAS institute and support such unstructured data formats as vector files, remote sensing images or other raster files, documents, multi-media files, tables, and other format files. For the spatial data files, format transformation service is provided. The middle tier involves a centralized metadata server, which stores metadata records of data on all data servers. The primary function of this tier is catalog service, supporting the creation, search, browsing, updating, and deletion of catalogs. The client tier involves an integrated client that provides the end-users interfaces to search, browse, and download data or create a catalog and upload data.
View full abstract
-
Ganghua Lin
2007Volume 6 Pages
S779-S781
Published: 2007
Released on J-STAGE: October 26, 2007
JOURNAL
FREE ACCESS
Solar physics and heliospheric study have arrived at the era of the Virtual Solar Observatory (VSO). The VSO gathers data from observation stations and data centers geographically distributed throughout the world, providing data from multiple spectral bands. In this paper, the author briefly analyzes the existing data model and presents a description of our local data ( that is produced at the Huairou Solar Observation Station ) in a well-defined way. The work lays the foundation for querying our data in VSO. It forms the beginning for adding additional search elements and categories to the VSO, therefore providing an infrastructure for a web site of the solar physics domain in China.
View full abstract
-
Ruihui Di, Honglei Qin, Xiaobai Li
2007Volume 6 Pages
S782-S788
Published: 2007
Released on J-STAGE: November 06, 2007
JOURNAL
FREE ACCESS
In this paper, a new model for suppressing jammers to GPS receivers is proposed. In the model, circular antenna arrays combining minimum norm (min-norm) and linearly constrained minimum variance (LCMV) algorithms have been used for signal anti-jamming. Six GPS signals' and two jammers' original incident direction were assumed respectively. The simulation was performed with a variation of the power of the two jammers and the element number of the circular antenna array. The simulation result indicates that by utilizing this new signal suppression model, nulls depths assigned to the jammer reach -238dBW when the number of element of circular antenna array is assumed to be 30. It also indicates that the stronger power of the jammer, the deeper nulls depths can be assigned with this new signal processing structure.
View full abstract
-
Shi-Zhao Zhang
2007Volume 6 Pages
S789-S791
Published: 2007
Released on J-STAGE: November 06, 2007
JOURNAL
FREE ACCESS
A new method for digitizing letters and Chinese characters is proposed.
View full abstract
-
Qingzhai Geng, Xingming Zhu, Jianan Cai
2007Volume 6 Pages
S792-S796
Published: 2007
Released on J-STAGE: December 01, 2007
JOURNAL
FREE ACCESS
Sharing scientific data sharing about water resources is an indispensable component of national fundamental data-sharing construction and the science and technology innovation system. To implement data sharing about water resources, which covers a broad extent and a great variety of data, is a difficult task because of the technical complexity and, more critically, non-technical factors, such as data policy, standards, and sharing circumstances. In this paper, the status of water resources scientific data sharing is analyzed at home and abroad, and some concepts are presented regarding these analyses. Using data from the Scientific Data Sharing Project, the authors study the system of data sharing about water resources and propose technological standards and managing and servicing systems of data sharing of water resources for the public as well as for the scientific research sectors of the nation.
View full abstract
-
Jungang Xu, Liang Zhou, Kun Zhang, Wenyao Zhang
2007Volume 6 Pages
S797-S805
Published: 2007
Released on J-STAGE: December 01, 2007
JOURNAL
FREE ACCESS
In this paper, a Bi-angle Model Management method (BiMM) is proposed to manage models in virtual reality research. One angle is based on the model itself, which includes the model, model scheme, and texture; another angle is based on the model sort - each sort has its child sorts except leaf nodes. Based on this method, we have developed a model management application that has the following major functions: model management, model sort management, model query, model statistics, model registration into database as a whole, etc. With this method, researchers can manage model data conveniently and efficiently.
View full abstract
-
Ai-jun Li, Zhi-gang Yin
2007Volume 6 Pages
S806-S812
Published: 2007
Released on J-STAGE: December 01, 2007
JOURNAL
FREE ACCESS
Speech corpus is the basis for analyzing the characteristics of speech signals and developing speech synthesis and recognition systems. In China, almost all speech research and development affiliations are developing their own speech corpora. We have so many different kinds numbers of Chinese speech corpora that it is important to be able to conveniently share these speech corpora to avoid wasting time and money and to make research work more efficient. The primary goal of this research is to find a standard scheme which can make the corpus be established more efficiently and be used or shared more easily. A huge speech corpus on 10 regional accented Chinese, RASC863 (a Regional Accent Speech Corpus funded by National 863 Project) will be exemplified to illuminate the standardization of speech corpus production.
View full abstract
-
Likang Zhang
2007Volume 6 Pages
S813-S823
Published: 2007
Released on J-STAGE: December 01, 2007
JOURNAL
FREE ACCESS
Focusing on common searches in Chinese, this paper analyzes the web search retrieval mechanism for Chinese words concerning library-related core periodicals and meta-library vocabulary. Going into further details by counting and sorting the search words appearing at given periods of time on the web, the impact of web search on words related to library and information sciences as well as its development trend are intended to be revealed.
View full abstract
-
Deqin Yan, Liping Zhang
2007Volume 6 Pages
S824-S830
Published: 2007
Released on J-STAGE: December 01, 2007
JOURNAL
FREE ACCESS
In this paper, a method of extracting rules with immune algorithms from information systems is proposed. Designing an immune algorithm is based on a sharing mechanism to extract rules. The principle of sharing and competing resources in the sharing mechanism is consistent with the relationship of sharing and rivalry among rules. In order to extract rules efficiently, a new concept of flexible confidence and rule measurement is introduced. Experiments demonstrate that the proposed method is effective.
View full abstract
-
Liyi Ma, Wanhua Qiu
2007Volume 6 Pages
S831-S836
Published: 2007
Released on J-STAGE: December 01, 2007
JOURNAL
FREE ACCESS
There are massive amounts of process data in the usual course of doing engineering. How to choose and accumulate these data to provide reference for newly-built projects in designing and building is a question that project superintendents face. We propose to construct a knowledge management platform for engineering project management to realize the potential of the accumulated decision-making data and study data classification and knowledge management, using architectural engineering data as an example.
View full abstract
-
Qing-Zhen Han, Yue-Hong Zhao, Hao Wen
2007Volume 6 Pages
S837-S846
Published: 2007
Released on J-STAGE: December 12, 2007
JOURNAL
FREE ACCESS
The formation of olefin complexes is of potential importance in the separation of olefins. The solvents affect the activation energies, and hence the rates and equilibrium constants of the complexing reactions, which performance should be well estimated for the purpose of industrial practice. The solvent effects on the complexing reaction of propylene and nickel dithiolene Ni(S
2C
2H
2)
2 + C
2H
4=CH
2 -> Ni(S
2C
2H
2)
2.C
2H
4=CH
2 are studied in this work, using density functional theory with B3LYP and an Onsager model. Complete optimizations of all the stagnation points are performed in benzene, toluene, tetrahydrofuran, dichloromethane, 1,2-dichloroethane, acetone, ethanol, methanol, 1,2,3-propanetriol, dimethylsulfoxide and water, respectively. The reaction of complexing nickel dithiolene with propylene is a two-step process: the first step coordinates the propylene to S atoms in dithiolene, forming a trans-structural intermediate. The second step then yields the cis-structural product. The activation energy of the first step is higher than that of the second, indicating that the first step is the rate-determining step. The solvents make slight changes in the geometries of the reactants, transition states, intermediates and products. However, the corresponding molecular dipole moment becomes large with increase of the solvent polarity, which is beneficial to accelerate the reaction. Furthermore, the activation energies of the first (or second) step will exponentially decrease from 125.0 to 113.0 kJ mol
-1 (or from 101.8 to 83.43 kJ mol
-1) when the dielectric constants of solvents increase from 1.00 to 78.39, while the reaction rates of the first (or second) step exponentially increase from 0.7673x10
-9 to 96.20x10
-9 s
-1 (or from 0.5503 to 1.038 s
-1), and the equilibrium constants rapidly increase from 0.5066 to 343.4 lmol
-1. The sharp variations of activation energies, rate constants, and equilibrium constants appear when the value of the dielectric constant of solvents lies between 1 and approximately 20, while these variations become mild when the dielectric constant of solvents is larger than 20. All of these results demonstrate that the complexing reaction of propylene and nickel dithiolene become much easier and faster to occur in polar solvents. The relationship between the equilibrium constants of the complexing reaction K
eq and the dielectric constants of solvents ε can be presented mathematically as K
eq=A exp(-ε/t) with correlation parameters A = 378.4 lmol
-1, B = 350.7 lmol
-1 and t = 21.17. This relationship may be seen as a reference for solvent selection in olefin separation practice.
View full abstract
-
Toshihiro Ashino, Nobuto Oka
2007Volume 6 Pages
S847-S852
Published: 2007
Released on J-STAGE: December 12, 2007
JOURNAL
FREE ACCESS
Every material database on the Internet has a different data schema. There are some trial attempts to define unified schema for material databases, but since the structure of scientific data is very complicated and changes dynamically, defining a complete data schema is an impossible task. There are two major approaches for material data standards: one is MatDB, an attempt to define precise and detailed metadata, and the other is MatML, which only defines a framework. However, there is a third way: loose syndication, such as blogs with RSS. RSS, RDF Site Summary, or Really Simple Syndication, was developed to summarize document pages, but it can be extended to describe metadata of factual databases. In this presentation, an RSS extension for material database summaries is discussed.
View full abstract
-
Yun Li, Ai-ping Fu
2007Volume 6 Pages
S853-S856
Published: 2007
Released on J-STAGE: December 12, 2007
JOURNAL
FREE ACCESS
With the development of computer and network technology, the study of metadata and the standards of metadata have become key research topics in recent years. Metadata design gives us a good tool to help with lexicography. Though it is indispensable for the external metadata of dictionaries, internal metadata design for entry content is even more important. Among these designs, those metadata for heads of character and headwords are still the basic work. These designs provide basic linguistic material and support finishing the work remaining in dictionary compilation. This paper describes a set of metadata of heads of character and headwords with the reference to the Temporary Chinese Dictionary.
View full abstract
-
Dasheng Wang, Lei Liu
2007Volume 6 Pages
S857-S866
Published: December 02, 2007
Released on J-STAGE: December 12, 2007
JOURNAL
FREE ACCESS
Reading is the main way in which people acquire information; the function of a paper information reading room is to understand public habits, create an environment, and make a "readers' space" to link users with information. In English, the word "information" is used to indicate the Chinese word " ." This word has several meanings in English: news, intelligence, knowledge, report, speech, lecture, notice, service platform, inform, tell, [computer] information. It is clear that "information" is a word with a wide range of connotations. According to Wang (1986), "information is a signal from the universe that is combined with a media substance. Information is from a substance, but it is not the substance where it is from. It is a new substance, a combination of medium and signal. Such change occurred in the twinkling of an eye" (Wang, 2005). In this paper, we discuss the meaning of information and present ideas about how paper-oriented reading rooms have become digital and how digital reading rooms can be extended to cybercafés, TV, radios, and telephones.
View full abstract
-
Cuiping Ge, Jun Zhao, Shaoliang Zhang, Lei Shang, Sheng Yin
2007Volume 6 Pages
S867-S878
Published: December 02, 2007
Released on J-STAGE: December 12, 2007
JOURNAL
FREE ACCESS
The comprehensive database system of the Northeast agro-ecology of black soil (CSDB_BL) is user-friendly software designed to store and manage large amounts of data on agriculture. The data was collected in an efficient and systematic way by long-term experiments and observations of black land and statistics information. It is based on the ORACLE database management system and the interface is written in PB language. The database has the following main facilities:(1) runs on Windows platforms; (2) facilitates data entry from *.dbf to ORACLE or creates ORACLE tables directly; (3)has a metadata facility that describes the methods used in the laboratory or in the observations; (4) data can be transferred to an expert system for simulation analysis and estimates made by Visual C++ and Visual Basic; (5) can be connected with GIS, so it is easy to analyze changes in land use ; and (6) allows metadata and data entity to be shared on the internet. The following datasets are included in CSDB_BL: long-term experiments and observations of water, soil, climate, biology, special research projects, and a natural resource survey of Hailun County in the 1980s; images from remote sensing, graphs of vectors and grids, and statistics from Northeast of China. CSDB_BL can be used in the research and evaluation of agricultural sustainability nationally, regionally, or locally. Also, it can be used as a tool to assist the government in planning for agricultural development. Expert systems connected with CSDB_BL can give farmers directions for farm planting management.
View full abstract
-
Fenglin Peng, Dan Wang, Xinbao Zheng, Lijun Xing, Keyun Tang, Bangyan ...
2007Volume 6 Pages
S879-S883
Published: 2007
Released on J-STAGE: December 12, 2007
JOURNAL
FREE ACCESS
Since the 1990s, our center's geophysics database has been available on the Internet (http://gp.wdc.cn and http://wdc.geophys.cn). Based on HTML language, the website offers simple data service and is being constantly upgraded and improved. We have adopted ORACLE as the database and use JSP (Java Server Pages) technology to create dynamic pages. The basic function of the network is to store, check, inquire, and renew the data. Users can query and download the data in txt form and view pictures generated by Matlab. At present the system is running well in both versions: Chinese and English, each having a unified style. The pages are simple and convenient to use. Its users come from most provinces of China, including Taiwan, and from developed or developing countries, such as the U.S.A., Germany, Japan, Singapore, Bulgaria, Canada, etc.
View full abstract
-
Long-hua He, Junjie Li
2007Volume 6 Pages
S884-S888
Published: December 02, 2007
Released on J-STAGE: December 12, 2007
JOURNAL
FREE ACCESS
Recently ontology research has received much attention in geo-information science and the concept of ontology is very important for spatial information concept modeling and data sharing, classification of geographical classes. More importantly, it enriches the semantic theory of spatial information. Geo-information services and geo-information interpretation and extraction are the two main applications of geo-ontology. Ontologies have great application potential for geo-information service.
View full abstract
-
Chenyu Li, Kunqing Xie, Xiujun Ma, Cuo Cai, Yanfeng Sun
2007Volume 6 Pages
S889-S897
Published: December 02, 2007
Released on J-STAGE: December 12, 2007
JOURNAL
FREE ACCESS
Grid computing for resources sharing and distributed computing has been researched widely in the past. As for distributed spatial datasets, the current centralized administrative scheme may become the system performance bottleneck. This paper presents a distributed cooperative grid computing technology to facilitate complex spatial applications by collaboration among distributed spatial resources. A hierarchical spatial index and communication protocol has been designed for the collaboration, which enables a dynamical choice for the best quality nodes for specified subtasks, synchronized execution, and compensation for a failure to execute a subtask. Also, we present an approach for dynamic resource allocation and distributed transaction mechanics to ensure consistency.
View full abstract
-
Jing Fang, Wenqing Liu, Haibo Tan
2007Volume 6 Pages
S898-S903
Published: December 08, 2007
Released on J-STAGE: December 19, 2007
JOURNAL
FREE ACCESS
The administration and storage of environmental characteristic spectral data are highly relevant in many fields of environmental study, such as measurement of trace gases in the atmosphere and air quality estimation. For this reason, a web-accessible database has been developed, offering ready access to the main parameters of molecular absorption spectral data. Web-based and friendly interfaces allow for interactive queries as well as previews of plots and downloads of files of the resulting spectral data for thorough comparative analyses.
View full abstract
-
Yun-liang Zhang, Quan Zhang
2007Volume 6 Pages
S904-S909
Published: December 15, 2007
Released on J-STAGE: December 19, 2007
JOURNAL
FREE ACCESS
Text mining deals with complex and unstructured texts. Usually a particular collection of texts that is specified to one or more domains is necessary. We have developed a customizable text classifier for users to mine the collection automatically. It derives from the sentence category of the HNC theory and corresponding techniques. It can start with a few texts, and it can adjust automatically or be adjusted by user. The user can also control the number of domains chosen and decide the standard with which to choose the texts based on demand and abundance of materials. The performance of the classifier varies with the user's choice.
View full abstract
-
Liya Li, Yang Song, Qiumei Guo
2007Volume 6 Pages
S910-S912
Published: December 15, 2007
Released on J-STAGE: December 19, 2007
JOURNAL
FREE ACCESS
Scientific data sharing is a long-term and complicated task. The related data sharing and distribution policies are prime concerns. By using both domestic and international experiences in scientific data sharing, the sources, distribution, and classification of scientific data in advanced manufacturing and automation are discussed. A primary data sharing and distribution policy in advanced manufacture and automation is introduced.
View full abstract
-
Xiaogang Li, Jin Gao, Chaofang Dong, Cuiwei Du, Degui Luo, Lin Lu
2007Volume 6 Pages
S913-S925
Published: December 15, 2007
Released on J-STAGE: December 19, 2007
JOURNAL
FREE ACCESS
This article discusses the key features of a newly developed national data-sharing online network for material environmental corrosion. Written in Java language and based on Oracle database technology, the central database in the network is supported with two unique series of corrosion failure data, both of which were accumulated during a long period of time. The first category of data, provided by national environment corrosion test sites, is corrosion failure data for different materials in typical environments (atmosphere, seawater and soil). The other category is corrosion data in production environments, provided by a variety of firms. This network system enables standardized management of environmental corrosion data, an effective data sharing process, and research and development support for new products and after-sale services. Moreover this network system provides a firm base and data-service platform for the evaluation of project bids, safety, and service life. This article also discusses issues including data quality management and evaluation in the material corrosion data sharing process, access authority of different users, compensation for providers of shared historical data, and finally, the related policy and law legal processes, which are required to protect the intellectual property rights of the database.
View full abstract
-
Jian Tan, Liang Gao, Yingchao Ren, Lei Wu, Fuqing Zhang, Lei Sheng, Ch ...
2007Volume 6 Pages
S926-S929
Published: December 15, 2007
Released on J-STAGE: December 19, 2007
JOURNAL
FREE ACCESS
We present recent research focused on the construction of a Web GIS-based GPS vehicle monitoring system to locate or navigate thousands of vehicles simultaneously.
View full abstract
-
Ling Bao
2007Volume 6 Pages
S930-S940
Published: December 23, 2007
Released on J-STAGE: December 31, 2007
JOURNAL
FREE ACCESS
The Overseas Sinology Database is composed of three databases: scholar, organization, and journal. The thesis database is regard as separate and is attached to the scholar database. The database information comes from major areas of the world, especially the countries adjacent to China, and updates are done continuously. The Sinology Database is in several different languages and should satisfy the differing needs of data collection and database application. The data quality is strictly controlled during the whole data life cycle, which includes data collection, processing, storage, and accessing. In addition, according to the standards and specifications of the metadata, metadata are created to accompany the data, which satisfies the cooperation among different databases. Finally, besides the function of searching, statistical calculation, and sorting, the database is also used for data mining and knowledge discovery. Through these methods, conclusions about changes in Sinology can be drawn, which will aid us in understanding the world and China in particular.
View full abstract
-
Linbo Jing
2007Volume 6 Pages
S941-S961
Published: December 23, 2007
Released on J-STAGE: December 31, 2007
JOURNAL
FREE ACCESS
This article begins with attention to the digital divide. It gives a brief overview of the digital divide on a global basis and analyzes specific aspects of the digital divide in our country. It also introduces the informationization construction of Guizhou Province and points out problems with the digital divide in that province. Then it focuses on the practice of Guizhou Province to bridge the digital divide ---- the practice and experience of GZNW. The final section gives a series of policy recommendations on how to bridge the digital divide, realize digital dividends, and how to build a new socialist countryside.
View full abstract
-
Yuxian Wu
2007Volume 6 Pages
S962-S967
Published: December 23, 2007
Released on J-STAGE: December 31, 2007
JOURNAL
FREE ACCESS
It is historical that data development has its own mode (collect, treatment, delivery, store, and use), from Manual mode, Mechanism mode, and Electronic mode, now to the Network mode. And search engine plus self-learning is the advanced mode of data development. Network mode has also been changing, the underlying motivation exits in the development & progress of Internet itself. There are two huge trends force the mode of data development to face new challenge & make decision. One is the revolution resulted by the change of the user market need & represented by Web2.0. Another is the revolution resulted by technological developing tendency & represented by Grid. Squeezed by the two huge trends from opposite directions, the lagging, crude and inefficient mode will change revolutionarily forced by wise decision-making or silent market. As for data-development mode, the change of technology & operation need the change of game rule simultaneously. So eliminating barriers, promoting resource-sharing, rationalize relations of market/non-market is to be a big inescapable work
View full abstract
-
Jili Cheng
2007Volume 6 Pages
S968-S972
Published: December 23, 2007
Released on J-STAGE: December 31, 2007
JOURNAL
FREE ACCESS
Philosophy research used to rely mainly on the traditional published journals and newspapers for collecting or communicating data. However, because of financial limits or lack of capability to collect data, required published materials and even restricted materials and developing information from research projects often could not be obtained. The rise of digital techniques and Internet opportunities has allowed data resource sharing of philosophy research. However, although there are several ICPs with large-scale comprehensive commercial databases in the field in China, no real non-profit professional database for philosophy researchers exists. Therefore, in 2002, the Philosophy Institute of the Chinese Academy of Social Sciences began a project to build "The Database of Philosophy Research." Until Mar. 2006 the number of subsets had reached 30, with more than 30,000 records, retrieval services reached 6,000, and article-reading reached 30,000. Because of the concept of intellectual property, the service of the database is currently limited to the information held in CASS. Nevertheless, this is the first academic database for philosophy research, so its orientation is towards resource-sharing, leading users to data, and serving large number of demands from other provinces and departments.
View full abstract
-
Paata J. Kervalishvili
2007Volume 6 Pages
S973-S980
Published: December 23, 2007
Released on J-STAGE: December 31, 2007
JOURNAL
FREE ACCESS
-
Hewen Tang, Yongsheng Cao
2007Volume 6 Pages
S981-S985
Published: December 23, 2007
Released on J-STAGE: December 31, 2007
JOURNAL
FREE ACCESS
Data mining is a technique used to uncover previously unknown and potentially useful knowledge from large datasets. It turns data into actionable information for better decision making. Delphi incorporates a collection of high-performance data-analysis components known as the Decision Cube. The Decision Cube offers an easy path to powerful data analysis in an application. This paper describes a simple approach for data mining using the Decision Cube components of Delphi. It requires only some drag-and-dropping and property-setting, with no need for manual coding. It does not require special knowledge of data mining. At the end, an example is offered.
View full abstract