JSAI Technical Report, Type 2 SIG

Which sense does an onomatopoeia belong to?

Tetsuaki NAKAMURA, Mai MIYABE, Eiji ARAMAKI

Article type: SIG paper
2013Volume 2013Issue AM-04 Pages 01-
Published: July 29, 2013
Released on J-STAGE: August 28, 2021

DOIhttps://doi.org/10.11517/jsaisigtwo.2013.AM-04_01

RESEARCH REPORT / TECHNICAL REPORT FREE ACCESS

Show abstractHide abstract

This study aims to develop a system which visualizes subjective information. Focus- ing on onomatopoeias as such information, we estimate which senses an onomatopoeia belongs to among "touch", "taste", "smell", "hearing", "sight", "pleasure (positive)" and "unpleasure (neg- ative)". For this purpose, we use a machine learning method (Support Vector Machine) which utilizes phonetic symbols and the number of occurrences of them in the onomatopoeia. Then, the experimental result for evaluation demonstrates that (1) the best performance is achieved for "hearing" and "sight", and (2) the performance of the classifier is similar to that of human. Finally, we propose the system which creates city maps displaying distribution of subjective information for senses.

View full abstract

Download PDF (1382K)
Analysis of User's Behavior in Information Retrieval Using Search Engine

Shogo KORI, Yu KATO, Yasufumi TAKAMA

Article type: SIG paper
2013Volume 2013Issue AM-04 Pages 02-
Published: July 29, 2013
Released on J-STAGE: August 28, 2021

DOIhttps://doi.org/10.11517/jsaisigtwo.2013.AM-04_02

RESEARCH REPORT / TECHNICAL REPORT FREE ACCESS

Show abstractHide abstract

[in Japanese]

View full abstract

Download PDF (3169K)
Log Data Analysis on Interactive Information Access

Tsuneaki KATO

Article type: SIG paper
2013Volume 2013Issue AM-04 Pages 03-
Published: July 29, 2013
Released on J-STAGE: August 28, 2021

DOIhttps://doi.org/10.11517/jsaisigtwo.2013.AM-04_03

RESEARCH REPORT / TECHNICAL REPORT FREE ACCESS

Show abstractHide abstract

The characteristics of user behaviors in explorative information access are reported, which reflect the differences of the environments she uses and the tasks she engages in. Using a model of information access behaviors and a log data coding based on that model, the analysis was conducted on the log data obtained in VisEx, an experiment for evaluating interactive and explorative information access environments. It shows that introduced retrieval methods, narrowing-down and similarity-based retrieval, are used as a substitute of sequential document checking, and those effectiveness differs depending on task characteristics.

View full abstract

Download PDF (1392K)
Latent Topic-based Graph Construction for Text Classification

Akiko ERIGUCHI, Ichiro KOBAYASHI

Article type: SIG paper
2013Volume 2013Issue AM-04 Pages 04-
Published: July 29, 2013
Released on J-STAGE: August 28, 2021

DOIhttps://doi.org/10.11517/jsaisigtwo.2013.AM-04_04

RESEARCH REPORT / TECHNICAL REPORT FREE ACCESS

Show abstractHide abstract

This paper aims to raise the accuracy of multi-class text classification by means of graph-based semi-supervised learning (GBSSL). It is essential to construct a proper graph expressing the relation among nodes in GBSSL. We propose a method to construct a similarity graph by employing both surface information and latent information to express similarity between nodes. Experimenting on Reuters-21578 corpus, we have confirmed that our proposed method works well for raising the accuracy of GBSSL in multi-class text classification task.

View full abstract

Download PDF (975K)
Validation on Efficient Text Classification Based on Latent Semantic with a Graph of Co-occurring Terms

Yukari OGURA, Ichiro KOBAYASHI

Article type: SIG paper
2013Volume 2013Issue AM-04 Pages 05-
Published: July 29, 2013
Released on J-STAGE: August 28, 2021

DOIhttps://doi.org/10.11517/jsaisigtwo.2013.AM-04_05

RESEARCH REPORT / TECHNICAL REPORT FREE ACCESS

Show abstractHide abstract

We have proposed a method to raise the accuracy of text classification based on latent topic information, introducing several techniques such as extracting important words with PageRank algorithm and reducing the size of target documents by replacing them with important sentences in themselves. We have experimented on text classification with Reuters-21578 data set and confirmed that our proposed method worked to raise the accuracy of text classification. In this paper, we aim to verify our method with additional experiments using 20 Newsgroups data set and report the experimental result.

View full abstract

Download PDF (611K)
Consideration of Design Guide for Constructing General Purpose System using TETDM

Tomoki KAJINAMI, Koichi TASHIRO, Takuma TONEGAWA, Yuuya KITAMURA, Yasu ...

Article type: SIG paper
2013Volume 2013Issue AM-04 Pages 06-
Published: July 29, 2013
Released on J-STAGE: August 28, 2021

DOIhttps://doi.org/10.11517/jsaisigtwo.2013.AM-04_06

RESEARCH REPORT / TECHNICAL REPORT FREE ACCESS

Show abstractHide abstract

This paper considers a collaborative policy for combining tools, in development of system using TETDM. TETDM is a total environment for text data mining, can prepare for various mining tasks by combination of small mining tools. However, a useful guide in the design of system constructed with several small tools developed by different tool developers has not been considered. This paper describes a design guide adjusting user's purpose and system's specifications for constructing general purpose system, and shows an example of practice.

View full abstract

Download PDF (889K)

Register with J-STAGE for free!