Micro-text classification between small and big data

Markus Christen; Thomas Niederberger; Thomas Ott; Suleiman Aryobsei; Reto Hofstetter

doi:10.1587/nolta.6.556

Special Section on Recent Progress in Nonlinear Theory and Its Applications

Micro-text classification between small and big data

Markus Christen, Thomas Niederberger, Thomas Ott, Suleiman Aryobsei, Reto Hofstetter

Author information

Keywords: micro-text, text enrichment, text classification, innovation management, social media platforms, semi-supervised learning

JOURNAL FREE ACCESS

2015 Volume 6 Issue 4 Pages 556-569

DOI https://doi.org/10.1587/nolta.6.556

Details

Abstract

Micro-texts emerging from social media platforms have become an important source for research. Automatized classification and interpretation of such micro-texts is challenging. The problem is exaggerated if the number of texts is at a medium level, making it too small for effective machine learning, but too big to be efficiently analyzed solely by humans. We present a semi-supervised learning system for micro-text classification that combines machine learning techniques with the unmatched human ability for making demanding, i.e. nonlinear decisions based on sparse data. We compare our system with human performance and a predefined optimal classifier using a validated benchmark data-set.

Corresponding author

Correction information

Register with J-STAGE for free!