Journal of Natural Language Processing
Online ISSN : 2185-8314
Print ISSN : 1340-7619
ISSN-L : 1340-7619
Acquiring Polar Sentences from HTML Documents
NOBUHIRO KAJIMASARU KITSUREGAWA
Author information
JOURNAL FREE ACCESS

2008 Volume 15 Issue 3 Pages 77-90

Details
Abstract
This paper represents a method of acquiring polar sentences from HTML documents. The basic idea is to exploit three lexico-syntactic patterns and two layout structures of HTML documents.The method requires only a small amount of hand-crafted rules and can be implemented in low cost.In our experiment, the method was applied to one billion documents and 650 thouthands polar sentences were aquired.
Content from these authors
© The Association for Natural Language Processing
Previous article Next article
feedback
Top