Implications of Punctuation Mark Normalization on Text Retrieval
Description:
This research investigated issues related to normalizing punctuation marks from a text retrieval perspective. A punctuated-centric approach was undertaken by exploring changes in meanings, whitespaces, words retrievability, and other issues related to normalizing punctuation marks. To investigate punctuation normalization issues, various frequency counts of punctuation marks and punctuation patterns were conducted using the text drawn from the Gutenberg Project archive and the Usenet Newsgrou…
more
Date:
August 2013
Creator:
Kim, Eungi
Partner:
UNT Libraries