Text Normalization Article Index for
Text
Website Links For
Text
 

Information About

Text Normalization




Examples of text normalization:

  • Unicode Normalization

  • converting all letters to lower or upper case

  • removing punctuation

  • removing letters with accent marks and other diacritics

  • expanding abbreviations


While this may be done manually, and usually is in the case of ad hoc and personal documents, many Programming Language s support mechanisms which enable text normalization.