Interface Normalizer

All Superinterfaces:
Function<String,String>, UnaryOperator<String>
All Known Implementing Classes:
SimpleNormalizer

public interface Normalizer extends UnaryOperator<String>
Normalization transforms text into a canonical form by removing unwanted variations. Normalization may range from light textual cleanup such as compressing whitespace to more aggressive and knowledge-intensive forms like standardizing date formats or expanding abbreviations. The nature and extent of normalization, as well as whether it is most appropriate to apply on the document, sentence, or token level, must be determined in the context of a specific application.
  • Method Details