IEICE Transactions on Information and Systems
Online ISSN : 1745-1361
Print ISSN : 0916-8532
Special Section on Knowledge-Based Software Engineering
An Informative DOM Subtree Identification Method from Web Pages in Unfamiliar Web Sites
Masanobu TSURUTAHiroyuki SAKAIShigeru MASUYAMA
Author information
JOURNAL FREE ACCESS

2008 Volume E91.D Issue 4 Pages 986-989

Details
Abstract
We propose a method of informative DOM* subtree identification from a Web page in an unfamiliar Web site. Our method uses layout data of DOM nodes generated by a generic Web browser. The results show that our method outperforms a baseline method, and was able to identify informative DOM subtrees from Web pages robustly.
Content from these authors
© 2008 The Institute of Electronics, Information and Communication Engineers
Previous article Next article
feedback
Top