With building Web sites more, or with more PR systems being digitized ones are urged to know processing a diversity of OA data for e-contents. This series brief basic knowledge and practices in the course starting from data acquisition to data processing, that is, data acquisition, making texts of those data checking, characters, tagging and data processing. This article deals with the first part of chapter 4: Points attended in extracted text data, and clean up, that is, basic knowledge and tools for processing of character strings.
View full abstract