Package com.gengoai.hermes.preprocessing
-
Class Summary Class Description DiacriticalMarkNormalizer Removes diacriticsHtmlEntityNormalizer Normalizes xml and html entities, such as&
TextNormalization Class takes care of normalizing text using a number ofTextNormalizer
s.TextNormalizer Defines a methodology for normalizing a string.TraditionalToSimplified Preprocessor that converts traditional characters into simplified characters.UnicodeNormalizer Converts unicode to canonical form and removes smart quotes.WhitespaceNormalizer Handles normalizing whitespace.