Package com.gengoai.hermes.preprocessing
Class TextNormalizer
- java.lang.Object
-
- com.gengoai.hermes.preprocessing.TextNormalizer
-
- All Implemented Interfaces:
Serializable
- Direct Known Subclasses:
DiacriticalMarkNormalizer
,HtmlEntityNormalizer
,TraditionalToSimplified
,UnicodeNormalizer
,WhitespaceNormalizer
public abstract class TextNormalizer extends Object implements Serializable
Defines a methodology for normalizing a string.- Author:
- David B. Bracewell
- See Also:
- Serialized Form
-
-
Constructor Summary
Constructors Constructor Description TextNormalizer()
-
Method Summary
All Methods Instance Methods Abstract Methods Concrete Methods Modifier and Type Method Description String
apply(String input, Language inputLanguage)
Performs a pre-processing operation on the input string in the given input languageprotected abstract String
performNormalization(String input, Language language)
Performs a pre-processing operation on the input string in the given input language
-