Interface Normalizer

All Known Implementing Classes:
SimpleNormalizer

public interface Normalizer
Normalization transforms text into a canonical form by removing unwanted variations. Normalization may range from light textual cleanup such as compressing whitespace to more aggressive and knowledge-intensive forms like standardizing date formats or expanding abbreviations. The nature and extent of normalization, as well as whether it is most appropriate to apply on the document, sentence, or token level, must be determined in the context of a specific application.
  • Method Summary

    Modifier and Type
    Method
    Description
    Normalize the given string.
  • Method Details

    • normalize

      String normalize(String text)
      Normalize the given string.
      Parameters:
      text - the text.
      Returns:
      the normalized text.