Data Science Journal

Contents of Volume 9, 2010

Papers

Discovering Imperceptible Associations Based on Interestingness: A Utility-Oriented Data Mining

S. Shankar, T. Purusothaman

2010 Volume 9 Pages 1-12
Published: February 24, 2010
Released on J-STAGE: February 24, 2010
Advance online publication: February 12, 2010

DOIhttps://doi.org/10.2481/dsj.008-030

JOURNAL FREE ACCESS

Show abstractHide abstract

This article proposes an innovative utility sentient approach for the mining of interesting association patterns from transaction databases. First, frequent patterns are discovered from the transaction database using the FP-Growth algorithm. From the frequent patterns mined, this approach extracts novel interesting association patterns with emphasis on significance, utility, and the subjective interests of the users. The experimental results portray the efficiency of this approach in mining utility-oriented and interesting association rules. A comparative analysis is also presented to illustrate our approach's effectiveness.

View full abstract

Download PDF (1132K)
Applying Statistical Design to Control the Risk of Over-Design with Stochastic Simulation

Yi Wu, Peng Zhou, Jian Lin, Wanhua Qiu

2010 Volume 9 Pages 13-28
Published: March 12, 2010
Released on J-STAGE: March 12, 2010
Advance online publication: February 28, 2010

DOIhttps://doi.org/10.2481/dsj.008-003

JOURNAL FREE ACCESS

Show abstractHide abstract

By comparing a hard real-time system and a soft real-time system, this article elicits the risk of over-design in soft real-time system designing. To deal with this risk, a novel concept of statistical design is proposed. The statistical design is the process accurately accounting for and mitigating the effects of variation in part geometry and other environmental conditions, while at the same time optimizing a target performance factor. However, statistical design can be a very difficult and complex task when using clas-sical mathematical methods. Thus, a simulation methodology to optimize the design is proposed in order to bridge the gap between real-time analysis and optimization for robust and reliable system design.

View full abstract

Download PDF (1104K)
An IDA-Based Parallel Storage Scheme in the Scientific Data Grid

Weizhong Lu, Yuanchun Zhou, Lei Liu, Baoping Yan

2010 Volume 9 Pages 29-41
Published: May 26, 2010
Released on J-STAGE: May 26, 2010
Advance online publication: May 19, 2010

DOIhttps://doi.org/10.2481/dsj.009-006

JOURNAL FREE ACCESS

Show abstractHide abstract

It is important to improve data reliability and data access efficiency for data-intensive applications in a data grid environment. In this paper, we propose an Information Dispersal Algorithm (IDA)-based parallel storage scheme for massive data distribution and parallel access in the Scientific Data Grid. The scheme partitions a data file into unrecognizable blocks and distributes them across many target storage nodes according to user profile and system conditions. A subset of blocks, which can be downloaded in parallel to remote clients, is required to reconstruct the data file. This scheme can be deployed on the top of current grid middleware. A demonstration and experimental analysis show that the IDA-based parallel storage scheme has better data reliability and data access performance than the existing data replication methods. Furthermore, this scheme has the potential to reduce considerably storage requirements for large-scale databases on a data grid.

View full abstract

Download PDF (989K)
Web Syndication Approaches for Sharing Primary Data in "Small Science" Domains

Eric C. Kansa, Ahrash Bissell

2010 Volume 9 Pages 42-53
Published: July 08, 2010
Released on J-STAGE: July 08, 2010
Advance online publication: June 29, 2010

DOIhttps://doi.org/10.2481/dsj.009-012

JOURNAL FREE ACCESS

Show abstractHide abstract

In some areas of science, sophisticated web services and semantics underlie "cyberinfrastructure". However, in "small science" domains, especially in field sciences such as archaeology, conservation, and public health, datasets often resist standardization. Publishing data in the small sciences should embrace this diversity rather than attempt to corral research into "universal" (domain) standards. A growing ecosystem of increasingly powerful Web syndication based approaches for sharing data on the public Web can offer a viable approach. Atom Feed based services can be used with scientific collections to identify and create linkages across different datasets, even across disciplinary boundaries without shared domain standards.

View full abstract

Download PDF (920K)
Materials Ontology: An Infrastructure for Exchanging Materials Information and Knowledge

Toshihiro Ashino

2010 Volume 9 Pages 54-61
Published: July 08, 2010
Released on J-STAGE: July 08, 2010
Advance online publication: June 26, 2010

DOIhttps://doi.org/10.2481/dsj.008-041

JOURNAL FREE ACCESS

Show abstractHide abstract

We have rich information resources for materials science and engineering - raw measurement data, computational simulation methods, digitized handbooks, and digital libraries. However, these resources have a wide variety of formats, terminologies, and concepts, which makes it difficult to find appropriate information for materials design, development, and evaluation. One solution to this problem is to integrate these resources into a computer readable concept map, called a domain ontology, which describes concepts and relationships among the concepts in materials science and engineering. This paper describes a trial that constructs a standard of metadata description using ontology language and demonstrates the validity of this construction through data exchange among heterogeneous material databases. "Materials Ontology," which consists of several sub ontologies corresponding to substance, process, environment, and property, is developed using the ontology language of the Semantic Web, OWL, which enables the definition of a flexible and detailed structure of materials information. A versatile "materials data format" is built on the Materials Ontology as a component of the materials information platform and is applied to exchange data among three different thermal property databases, maintained by two major materials science research institutes in Japan.

View full abstract

Download PDF (1836K)
Utility Independent Privacy Preserving Data Mining - Horizontally Partitioned Data

E Poovammal, M Ponnavaikko

2010 Volume 9 Pages 62-72
Published: July 17, 2010
Released on J-STAGE: July 17, 2010
Advance online publication: June 23, 2010

DOIhttps://doi.org/10.2481/dsj.008-040

JOURNAL FREE ACCESS

Show abstractHide abstract

Micro data is a valuable source of information for research. However, publishing data about individuals for research purposes, without revealing sensitive information, is an important problem. The main objective of privacy preserving data mining algorithms is to obtain accurate results/rules by analyzing the maximum possible amount of data without unintended information disclosure. Data sets for analysis may be in a centralized server or in a distributed environment. In a distributed environment, the data may be horizontally or vertically partitioned. We have developed a simple technique by which horizontally partitioned data can be used for any type of mining task without information loss. The partitioned sensitive data at 'm' different sites are transformed using a mapping table or graded grouping technique, depending on the data type. This transformed data set is given to a third party for analysis. This may not be a trusted party, but it is still allowed to perform mining operations on the data set and to release the results to all the 'm' parties. The results are interpreted among the 'm' parties involved in the data sharing. The experiments conducted on real data sets prove that our proposed simple transformation procedure preserves one hundred percent of the performance of any data mining algorithm as compared to the original data set while preserving privacy.

View full abstract

Download PDF (646K)
A Proposal on Using Reuse Readiness Levels to Measure Software Reusability

Robert R. Downs, James J. Marshall

2010 Volume 9 Pages 73-92
Published: July 24, 2010
Released on J-STAGE: July 24, 2010
Advance online publication: July 17, 2010

DOIhttps://doi.org/10.2481/dsj.009-007

JOURNAL FREE ACCESS

Show abstractHide abstract

The use of scientific data is becoming increasingly dependent on the software that fosters such use. As the ability to reuse software contributes to capabilities for reusing software-dependent data, instruments for measuring software reusability contribute to the reuse of software and related data. The development and current state of a proposed set of Reuse Readiness Levels (RRLs) are summarized, and potential uses of the software reusability measures are described, along with proposed use cases to support sponsorship of software projects, software production, software adoption, and data stewardship during the systems development lifecycle and the data lifecycle.

View full abstract

Download PDF (824K)
An Alternative Ratio-cum-Product Estimator of Population Mean Using a Coefficient of Kurtosis for Two Auxiliary Variates

Rajesh Tailor, Med Ram Verma, Balkishan Sharma

2010 Volume 9 Pages 93-99
Published: July 24, 2010
Released on J-STAGE: July 24, 2010
Advance online publication: June 29, 2010

DOIhttps://doi.org/10.2481/dsj.008-034

JOURNAL FREE ACCESS

Show abstractHide abstract

An alternative ratio-cum-product estimator of population mean using the coefficient of kurtosis for two auxiliary variates has been proposed. The proposed estimator has been compared with a simple mean estimator, the usual ratio estimator, a product estimator, and estimators proposed by Singh (1967) and Singh et al. (2004). An empirical study is also carried out in support of the theoretical findings.

View full abstract

Download PDF (559K)
Data Management Activities of Canada's National Science Library - 2010 Update and Prospective

Mary Zborowski

2010 Volume 9 Pages 100-106
Published: January 25, 2011
Released on J-STAGE: January 25, 2011
Advance online publication: January 13, 2011

DOIhttps://doi.org/10.2481/dsj.009-026

JOURNAL FREE ACCESS

Show abstractHide abstract

NRC-CISTI serves Canada as its National Science Library (as mandated by Canada's Parliament in 1924) and also provides direct support to researchers of the National Research Council of Canada (NRC). By reason of its mandate, vision, and strategic positioning, NRC-CISTI has been rapidly and effectively mobilizing Canadian stakeholders and resources to become a lead player on both the Canadian national and international scenes in matters relating to the organization and management of scientific research data. In a previous communication (CODATA International Conference, 2008), the orientation of NRC-CISTI towards this objective and its short- and medium-term plans and strategies were presented. Since then, significant milestones have been achieved. This paper presents NRC-CISTI's most recent activities in these areas, which are progressing well alongside a strategic organizational redesign process that is realigning NRC-CISTI's structure, mission, and mandate to better serve its clients. Throughout this transformational phase, activities relating to data management remain vibrant.

View full abstract

Download PDF (554K)
A System for Ontology-Based Sharing of Expert Knowledge in Sustainability Science

Steven Kraines, Weisen Guo

2011 Volume 9 Pages 107-123
Published: January 29, 2011
Released on J-STAGE: January 29, 2011
Advance online publication: January 13, 2011

DOIhttps://doi.org/10.2481/dsj.Kraines

JOURNAL FREE ACCESS

Show abstractHide abstract

Work towards creation of a knowledge sharing system for sustainability science through the application of semantic data modeling is described. An ontology grounded in description logics was developed based on the ISO 15926 data model to describe three types of sustainability science conceptualizations: situational knowledge, analytic methods, and scenario frameworks. Semantic statements were then created using this ontology to describe expert knowledge expressed in research proposals and papers related to sustainability science and in scenarios for achieving sustainable societies. Semantic matching based on logic and rule-based inference was used to quantify the conceptual overlap of semantic statements, which shows the semantic similarity of topics studied by different researchers in sustainability science, similarities that might be unknown to the researchers themselves.

View full abstract

Download PDF (2318K)
A Privacy-Preserving Data Mining Method Based on Singular Value Decomposition and Independent Component Analysis

Guang Li, Yadong Wang

2011 Volume 9 Pages 124-132
Published: February 16, 2011
Released on J-STAGE: February 16, 2011
Advance online publication: February 08, 2011

DOIhttps://doi.org/10.2481/dsj.009-025

JOURNAL FREE ACCESS

Show abstractHide abstract

Privacy protection is indispensable in data mining, and many privacy-preserving data mining (PPDM) methods have been proposed. One such method is based on singular value decomposition (SVD), which uses SVD to find unimportant information for data mining and removes it to protect privacy. Independent component analysis (ICA) is another data analysis method. If both SVD and ICA are used, unimportant information can be extracted more comprehensively. Accordingly, this paper proposes a new PPDM method using both SVD and ICA. Experiments show that our method performs better in preserving privacy than the SVD-based methods while also maintaining data utility.

View full abstract

Download PDF (350K)

"Proceedings of the International Symposium: Fifty Years after IGY - Modern Information Technologies and Earth and Solar Sciences -" (Eds. Iyemori, T. et al.) Part 2

Astronomical Virtual Observatories Through International Collaboration

Masatoshi Ohishi

2010 Volume 9 Pages S128-S134
Published: March 13, 2010
Released on J-STAGE: March 13, 2010
Advance online publication: March 10, 2010

DOIhttps://doi.org/10.2481/dsj.IGY-050

JOURNAL FREE ACCESS

Show abstractHide abstract

Astronomical Virtual Observatories (VOs) are emerging research environment for astronomy, and 16 countries and a region have funded to develop their VOs based on international standard protocols for interoperability. The 16 funded VO projects have established the International Virtual Observatory Alliance (http://www.ivoa.net/) to develop the standard interoperable interfaces such as registry (meta data), data access, query languages, output format (VOTable), data model, application interface, and so on. The IVOA members have constructed each VO environment through the IVOA interfaces. National Astronomical Observatory of Japan (NAOJ) started its VO project (Japanese Virtual Observatory - JVO) in 2002, and developed its VO system. We have succeeded to interoperate the latest JVO system with other VOs in the USA and Europe since December 2004. Observed data by the Subaru telescope, satellite data taken by the JAXA/ISAS, etc. are connected to the JVO system. Successful interoperation of the JVO system with other VOs means that astronomers in the world will be able to utilize top-level data obtained by these telescopes from anywhere in the world at anytime. System design of the JVO system, experiences during our development including problems of current standard protocols defined in the IVOA, and proposals to resolve these problems in the near future are described.

View full abstract

Download PDF (1379K)
The "GDSClient" Collecting Tool for Networked Solid Earth Science Data

H. Nagao, S. Tsuboi, Y. Ishihara, H. Yanaka

2010 Volume 9 Pages S135-S139
Published: March 28, 2010
Released on J-STAGE: March 28, 2010
Advance online publication: March 21, 2010

DOIhttps://doi.org/10.2481/dsj.IGY-051

JOURNAL FREE ACCESS

Show abstractHide abstract

The data center of our institute distributes solid earth science data obtained by the Ocean Hemisphere Project (OHP) network through the website of Pacific 21. We have developed Java-based software "GDSClient", which enables us to collect not only the data of the OHP network but also those distributed from other data centers by means of the web service technology. It is possible to request the data controlling parameters such as data centers, observatories, a data period, and other auxiliary detailed parameters. It is unnecessary to know differences between data centers with preparing a WSDL (Web Services Description Language) file, in which information of user interface is described in XML format. The latest GDSClients are released from the website of Pacific 21.

View full abstract

Download PDF (465K)

Special Issue

Information Technology Challenges in Earth and Solar Sciences (Part 2)

SPASE and the Heliophysics Virtual Observatories

J. R. Thieman, D. A. Roberts, T. A. King, C. C. Harvey, C. H. Perry, P ...

2010 Volume 9 Pages IGY85-IGY93
Published: March 06, 2010
Released on J-STAGE: March 06, 2010
Advance online publication: February 25, 2010

DOIhttps://doi.org/10.2481/dsj.SS_IGY-019

JOURNAL FREE ACCESS

Show abstractHide abstract

The Space Physics Archive Search and Extract (SPASE) project has developed an information model for interoperable access and retrieval of data within the Heliophysics (also known as space and solar physics) science community. The diversity of science data archives within this community has led to the establishment of many virtual observatories to coordinate the data pathways within Heliophysics subdisciplines, such as magnetospheres, waves, radiation belts, etc. The SPASE information model provides a semantic layer and common language for data descriptions so that searches might be made across the whole of the heliophysics data environment, especially through the virtual observatories.

View full abstract

Download PDF (717K)
Data Practices, Policy, and Rewards in the Information Era Demand a New Paradigm

C. Barton, R. Smith, R. Weaver

2010 Volume 9 Pages IGY95-IGY99
Published: March 12, 2010
Released on J-STAGE: March 12, 2010
Advance online publication: February 26, 2010

DOIhttps://doi.org/10.2481/dsj.SS_IGY-003

JOURNAL FREE ACCESS

Show abstractHide abstract

As informatics becomes embedded in the scientific method, workload shifts from the user to the provider of data and information services and systems. Yet there is little incentive for research scientists to devote time to data management and system development. Our reward system can be adjusted to encourage responsible data management and open access practices, as well as motivate people to develop systems and services for the common good. At the same time, the status and professional infrastructure for those engaged in informatics needs to match traditional scientific and technical disciplines and create an attractive, competitive career path. Five readily achievable steps can be taken to redress these imbalances.

View full abstract

Download PDF (739K)
Long-Term Seafloor Electromagnetic Observation in the Northwest Pacific May Detect the Vector Geomagnetic Secular Variation

H. Toh, Y. Hamano, T. Goto, H. Utada

2010 Volume 9 Pages IGY100-IGY109
Published: March 12, 2010
Released on J-STAGE: March 12, 2010
Advance online publication: March 06, 2010

DOIhttps://doi.org/10.2481/dsj.SS_IGY-004

JOURNAL FREE ACCESS

Show abstractHide abstract

Sea Floor ElectroMagnetic Stations (SFEMSs) are now operating at two deep seafloor sites called the 'WPB' and the 'NWP' in the West Philippine Basin and the Northwest Pacific Basin, respectively. One of the main objectives of the SFEMSs is to detect the geomagnetic secular variations on the deep seafloor where long-term geomagnetic observations have not so far been achieved. SFEMSs can measure the absolute geomagnetic total force as well as the geomagnetic vector field with precise attitude monitoring systems. The vector geomagnetic time-series that was observed for more than 5 years revealed that the westward drift of the equatorial dipole dominates in the geomagnetic secular variation at the NWP.

View full abstract

Download PDF (2578K)
An Extended Dynamic Model of a Geyser Induced by an Inflow of Gas (2):

Effects of Various Shapes and Repeated Expansions and Contractions in an Underground Watercourse

H. Kagami

2010 Volume 9 Pages IGY110-IGY120
Published: May 02, 2010
Released on J-STAGE: May 02, 2010
Advance online publication: April 16, 2010

DOIhttps://doi.org/10.2481/dsj.SS_IGY-006

JOURNAL FREE ACCESS

Show abstractHide abstract

We modified further our extended dynamic model of a geyser induced by an inflow of gas, by taking into consideration the effects during spouting of an elbow shape, pairs of sudden expansions and contractions, and repeats of this shape in an underground watercourse. Through numerical simulations of this extended dynamic model, we see that a large number of sudden expansions and contractions or a large angular elbow in the underground watercourse greatly affects the spouting dynamics of the geyser.

View full abstract

Download PDF (629K)

CRIS for European e-Infrastructure

1 Introduction

CERIF-CRIS for the European e-Infrastructure

K. Jeffery, A. Asserson

2010 Volume 9 Pages CRIS1-CRIS6
Published: July 24, 2010
Released on J-STAGE: July 24, 2010
Advance online publication: April 29, 2010

DOIhttps://doi.org/10.2481/dsj.CRIS1

JOURNAL FREE ACCESS

Show abstractHide abstract

The European e-infrastructure is the ICT support for research although the infrastructure will be extended for commercial/business use. It supports the research process across funding agencies to research institutions to innovation. It supports experimental facilities, modelling and simulation, communication between researchers, and workflow of research processes and research management. We propose the core should be CERIF: an EU recommendation to member states for exchanging research information and for homogeneous access to heterogeneous information. CERIF can also integrate associated systems (such as finance, human resource, project management, and library services) and provides interoperation among research institutions, research funders, and innovators.

View full abstract

Download PDF (1500K)

2 The need for a CRIS. Structure and Use of a CRIS - The Common European Research Information Format Model (CERIF)

The CERIF Model As the Core of a Research Organization

Keith Jeffery

2010 Volume 9 Pages CRIS7-CRIS13
Published: July 24, 2010
Released on J-STAGE: July 24, 2010
Advance online publication: April 29, 2010

DOIhttps://doi.org/10.2481/dsj.CRIS2

JOURNAL FREE ACCESS

Show abstractHide abstract

A CERIF-CRIS consists of base entities with records describing components of the research and link entities describing relationships among records in the base entities. As an example, three base entities may contain records describing a person, a publication and a project while two link entities relate respectively the person to the publication in role author and the person to the project in role project leader. This powerful linking or inter-relating capability includes temporal as well as role aspects and inter-relates dynamically and flexibly all the components of R&D. The CERIF model can be extended to inter-relate appropriate information from legacy information systems in an organisation, such as those covering accounting, human resources, project management, assets, stock control, etc. A CERIF-CRIS can thus provide a flexible low-cost integration comparable with an ERP (Enterprise Resource Planning) System, particularly in an organisation with R&D as its primary business.

View full abstract

Download PDF (1289K)
CRIS and Institutional Repositories

A. Asserson, K. Jeffery

2010 Volume 9 Pages CRIS14-CRIS23
Published: July 24, 2010
Released on J-STAGE: July 24, 2010
Advance online publication: April 30, 2010

DOIhttps://doi.org/10.2481/dsj.CRIS3

JOURNAL FREE ACCESS

Show abstractHide abstract

CRIS (Current Research Information Systems) provide researchers, research managers, innovators, and others with a view over the research activity of a domain. IRs (institutional repositories) provide a mechanism for an organisation to showcase through OA (open access) its intellectual property. Increasingly, organizations are mandating that their employed researchers deposit peer-reviewed published material in the IR. Research funders are increasingly mandating that publications be deposited in an open access repository: some mandate a central (or subject-based) repository, some an IR. In parallel, publishers are offering OA but replacing subscription-based access with author (or author institution) payment for publishing. However, many OA repositories have metadata based on DC (Dublin Core) which is inadequate; a CERIF (Common-European Research Information Format) CRIS provides metadata describing publications with formal syntax and declared semantics thus facilitating interoperation or homogeneous access over heterogeneous sources. The formality is essential for research output metrics, which are increasingly being used to determine future funding for research organizations.

View full abstract

Download PDF (1150K)

3 How to Set up and Use a CERIF-CRIS

CERIF: The Common European Research Information Format Model

Brigitte Jörg

2010 Volume 9 Pages CRIS24-CRIS31
Published: July 24, 2010
Released on J-STAGE: July 24, 2010
Advance online publication: June 29, 2010

DOIhttps://doi.org/10.2481/dsj.CRIS4

JOURNAL FREE ACCESS

Show abstractHide abstract

With increased computing power more data than ever are being and will be produced, stored and (re-) used. Data are collected in databases, computed and annotated, or transformed by specific tools. The knowledge from data is documented in research publications, reports, presentations, or other types of files. The management of data and knowledge is difficult, and even more complicated is their re-use, exchange, or integration. To allow for quality analysis or integration across data sets and to ensure access to scientific knowledge, additional information - Research Information - has to be assigned to data and knowledge entities. We present the metadata model CERIF to add information to entities such as Publication, Project, Organisation, Person, Product, Patent, Service, Equipment, and Facility and to manage the semantically enhanced relationships between these entities in a formalized way. CERIF has been released as an EC Recommendation to European Member States in 2000. Here, we refer to the latest version CERIF 2008-1.0.

View full abstract

Download PDF (795K)
CERIF Best Practice

Anne Asserson

2010 Volume 9 Pages CRIS32-CRIS38
Published: July 24, 2010
Released on J-STAGE: July 24, 2010
Advance online publication: April 29, 2010

DOIhttps://doi.org/10.2481/dsj.CRIS5

JOURNAL FREE ACCESS

Show abstractHide abstract

CRISs (Current Research Information Systems) are becoming increasingly important for organizations that are related to research, such as funding organisations, universities, and ministries. A CRIS holds information on research activities, results of research, and competence. A CRIS is useful for assessing a person or department, to show the institution's activity, to monitor scholarly activities, and as a base for the development of research strategy. This could be from a local CRIS, national CRIS, or from interoperable CRISs. A CRIS will be really useful if it is structured and can interoperate with other CRISs. The CERIF model (Current European Research Information Model) is a structured model and is able to give statistics for planning, evaluation, and assessment within an institution or benchmarking among institutions. The CERIF CRISs are able to give multiple views, such as a researcher's CV and an overview of an institution's projects (ongoing or ended) with project partners on an organizational or personal level. The output publications of a project are given for an individual researcher or institution, with linkage to the full text (in the local repository) and a list of journals where researchers or organizations are publishing, events, and an annual report on an individual researcher. A CERIF CRIS is recommended by the EU for interoperability among CRISs. A CERIF provides a one stop shop for users and gives uniform access to full text publications and scientific data. A partial model for people, organisation, and results, not projects, can be used. It is recommended, however, to implement the full model. To secure consistent information, it is also recommended to establish authority lists for people (unique ID, name, organization, position, age, sex, etc.) organsations (name, acronym, address, etc.), journals (title, acronym, publisher, URL, etc), and books (publisher, acronym, address, county, etc.) in the CERIF CRIS.

View full abstract

Download PDF (1295K)

4 CRIS and the European e-Infrastructure, Enabling European Research

The ESFRI Roadmap and Its Demands on the e-Infrastructure

K. Jeffery

2010 Volume 9 Pages CRIS39-CRIS43
Published: July 24, 2010
Released on J-STAGE: July 24, 2010
Advance online publication: April 29, 2010

DOIhttps://doi.org/10.2481/dsj.CRIS6

JOURNAL FREE ACCESS

Show abstractHide abstract

The ESFRI Roadmap marked a turning point in the evolution of European thinking on research facilities, providing a catalogue of such facilities with their characteristics. In parallel, the ESF (European Science Foundation) completed a questionnaire-based survey of research facilities. Finally, the ERF (European Research Facilities) consortium representing national facilities with international access was formed to parallel EIROForum (the European laboratories funded by international subscriptions). It is becoming increasingly clear that management of these facilities and management of the research process require extensive ICT: for research managers that is provided by CRIS (Current Research Information Systems) and to give researchers additionally access to facilities to control experiments with associated modelling and simulation and access to research datasets and software.

View full abstract

Download PDF (981K)
CRIS and the GRIDs Architecture

K. Jeffery

2010 Volume 9 Pages CRIS44-CRIS52
Published: July 24, 2010
Released on J-STAGE: July 24, 2010
Advance online publication: April 29, 2010

DOIhttps://doi.org/10.2481/dsj.CRIS9

JOURNAL FREE ACCESS

Show abstractHide abstract

The end-user demands low effort threshold access to systems providing e-information, e-business, and e-entertainment. Innovators and entrepreneurs require also equally low-energy access to heterogeneous information homogenised to a form and language familiar to them. On top of that, decision-makers, whether in a control room or government strategic planning, demand equally easy access to information that is statistically or inductively enhanced to knowledge and access to modelling or simulation systems to allow 'what if?' requests. Researchers and technical workers have an additional requirement for rapid integration of information with statistical, induction, modelling, and simulation systems to generate and verify hypotheses so generating data and information, to be used by others, which in turn advances knowledge. Access is required, and can now be provided, anytime, anyhow, anywhere through ambient computing technology. A new paradigm, GRIDs, provides the architectural framework.

View full abstract

Download PDF (1083K)

5 Using a CRIS for e-Infrastructure

e-Infrastructure, Science Data and CRIS

S. C. Lambert

2010 Volume 9 Pages CRIS53-CRIS58
Published: July 23, 2010
Released on J-STAGE: July 24, 2010
Advance online publication: May 11, 2010

DOIhttps://doi.org/10.2481/dsj.CRIS7

JOURNAL FREE ACCESS

Show abstractHide abstract

Scientific research is supported by infrastructure, and e-infrastructure is one part of this. Repositories of data are a part of the e-infrastructure and have their own particular needs arising from the requirement for permanence of their data holdings. There are many threats to permanence, and there is a growing awareness of these threats and how they may be countered. Current Research Information Systems and other support to the research lifecycle, while focused on facilitating research activities in the present, will have a role in the preservation of the outputs of research into the future.

View full abstract

Download PDF (582K)
Using a CRIS for e-Infrastructure: e-Infrastructure for Scholarly Publications

E. Dijk, M. van Meel

2010 Volume 9 Pages CRIS59-CRIS65
Published: July 24, 2010
Released on J-STAGE: July 24, 2010
Advance online publication: May 03, 2010

DOIhttps://doi.org/10.2481/dsj.CRIS8

JOURNAL FREE ACCESS

Show abstractHide abstract

Scholarly publications are a major part of the research infrastructure. One way to make output available is to store the publications in Open Access Repositories (OAR). A Current Research Information System (CRIS) that conforms to the standard CERIF (Common European Research Information Format) could be a key component in the e-infrastructure. A CRIS provides the structure and makes it possible to interoperate the CRIS metadata at every stage of the research cycle. The international DRIVER projects are creating a European repository infrastructure. Knowledge Exchange has launched a project to develop a metadata exchange format for publications between CRIS and OAR systems.

View full abstract

Download PDF (771K)

Essay

Information Gulags, Intellectual Straightjackets, and Memory Holes:

Three Principles to Guide the Preservation of Scientific Data

Paul F. Uhlir

2010 Volume 9 Pages ES1-ES5
Published: October 07, 2010
Released on J-STAGE: October 07, 2010
Advance online publication: September 23, 2010

DOIhttps://doi.org/10.2481/dsj.Essay-001-Uhlir

JOURNAL FREE ACCESS

Download PDF (496K)

Meeting Reports

Report from the 4^th Workshop on Extremely Large Databases

Jacek Becla, Kian-Tat Lim, Daniel Liwei Wang

2010 Volume 9 Pages MR1-MR8
Published: March 01, 2011
Released on J-STAGE: March 01, 2011
Advance online publication: February 22, 2011

DOIhttps://doi.org/10.2481/dsj.xldb10

JOURNAL FREE ACCESS

Show abstractHide abstract

Academic and industrial users are increasingly facing the challenge of petabytes of data, but managing and analyzing such large data sets still remains a daunting task. The 4^th Extremely Large Databases workshop was organized to examine the needs of communities under-represented at the past workshops facing these issues. Approaches to big data statistical analytics as well as emerging opportunities related to emerging hardware technologies were also debated. Writable extreme scale databases and the science benchmark were discussed. This paper is the final report of the discussions and activities at this workshop.

View full abstract

Download PDF (637K)

Errata

Erratum: CERIF: The Common European Research Information Format Model [Data Science Journal, Volume 9, 24 July 2010 CRIS24-CRIS31]

Brigitte Jörg

2010 Volume 9 Pages ECRIS1-ECRIS4
Published: August 13, 2010
Released on J-STAGE: August 13, 2010

DOIhttps://doi.org/10.2481/dsj.9_E1

JOURNAL FREE ACCESS

Show abstractHide abstract

The following are errata for the original article entitled "CERIF: The Common European Research Information Format Model" by Brigitte Jörg.

View full abstract

Download PDF (281K)

Register with J-STAGE for free!