Package com.gengoai.hermes.extraction
-
Interface Summary Interface Description Extraction An extraction is the output generated by anExtractor
.Extractor Fundamental to text mining in Hermes is the concept of aExtractor
and the
Extraction
it produces. -
Class Summary Class Description FeaturizingExtractor Combines anExtractor
with an ApolloFeaturizer
allowing for the output of the extractor to be directly used as features for machine learning.MultiPhaseExtractor AFeaturizingExtractor
that breaks the extraction process into the follow parts: Extracts annotations of the given types. Trims the extractions, if a trim method is defined. Filters the extractions, if a trim method is defined.MultiPhaseExtractor.MultiPhaseExtractorBuilder<T extends MultiPhaseExtractor,V extends MultiPhaseExtractor.MultiPhaseExtractorBuilder<T,V>> NGramExtractor AMultiPhaseExtractor
implementation that extracts n-grams over the desired annotation types.NGramExtractor.Builder Builder Class for constructingNGramExtractor
RegexExtractor An Extractor implementation that searches for a given regular expression pattern in the document.SearchExtractor An Extractor implementation that searches for a given search text in the document.TermExtractor Implementation of theMultiPhaseExtractor
for extracting terms where a term is a single annotation (TOKEN by default).TermExtractor.Builder Builder Class for constructingTermExtractor