Package smile.nlp.tokenizer
Class BreakIteratorTokenizer
java.lang.Object
smile.nlp.tokenizer.BreakIteratorTokenizer
A word tokenizer based on the java.text.BreakIterator, which supports
multiple natural languages (selected by locale setting).
-
Constructor Summary
ConstructorDescriptionConstructor for the default locale.BreakIteratorTokenizer
(Locale locale) Constructor for the given locale. -
Method Summary
-
Constructor Details
-
BreakIteratorTokenizer
public BreakIteratorTokenizer()Constructor for the default locale. -
BreakIteratorTokenizer
Constructor for the given locale.- Parameters:
locale
- the locale.
-
-
Method Details