Uses of Interface
com.gengoai.hermes.extraction.Extractor
-
-
Uses of Extractor in com.gengoai.hermes.corpus
Methods in com.gengoai.hermes.corpus with parameters of type Extractor Modifier and Type Method Description default Counter<String>
DocumentCollection. documentCount(@NonNull Extractor extractor)
Calculates the document frequency of annotations of the given annotation type in the corpus.default Counter<String>
DocumentCollection. termCount(@NonNull Extractor extractor)
Calculates the total corpus frequency of terms extracted using the given extractor. -
Uses of Extractor in com.gengoai.hermes.extraction
Classes in com.gengoai.hermes.extraction that implement Extractor Modifier and Type Class Description class
FeaturizingExtractor
Combines anExtractor
with an ApolloFeaturizer
allowing for the output of the extractor to be directly used as features for machine learning.class
MultiPhaseExtractor
AFeaturizingExtractor
that breaks the extraction process into the follow parts: Extracts annotations of the given types. Trims the extractions, if a trim method is defined. Filters the extractions, if a trim method is defined.class
NGramExtractor
AMultiPhaseExtractor
implementation that extracts n-grams over the desired annotation types.class
RegexExtractor
An Extractor implementation that searches for a given regular expression pattern in the document.class
SearchExtractor
An Extractor implementation that searches for a given search text in the document.class
TermExtractor
Implementation of theMultiPhaseExtractor
for extracting terms where a term is a single annotation (TOKEN by default). -
Uses of Extractor in com.gengoai.hermes.extraction.caduceus
Classes in com.gengoai.hermes.extraction.caduceus that implement Extractor Modifier and Type Class Description class
CaduceusProgram
Caduceus, pronounced ca·du·ceus, is a rule-based information extraction system. -
Uses of Extractor in com.gengoai.hermes.extraction.keyword
Subinterfaces of Extractor in com.gengoai.hermes.extraction.keyword Modifier and Type Interface Description interface
KeywordExtractor
A keyword extractor determines the important words, phrases, or concepts inHString
returning a counter of keywords and their corresponding scores.Classes in com.gengoai.hermes.extraction.keyword that implement Extractor Modifier and Type Class Description class
NPClusteringKeywordExtractor
Implementation of the NP Clustering Keyword Extractor presented in:class
RakeKeywordExtractor
Implementation of the RAKE keyword extraction algorithm as presented in:class
TermKeywordExtractor
Implementation of aKeywordExtractor
that extracts and scores terms based on a givenFeaturizingExtractor
*.class
TextRank
Implementation of the TextRank algorithm for keyword extraction as defined in: Mihalcea, R., Tarau, P.: "Textrank: Bringing order into texts".class
TFIDFKeywordExtractor
Keyword extractor that scores words based on their TFIDF value. -
Uses of Extractor in com.gengoai.hermes.extraction.lyre
Classes in com.gengoai.hermes.extraction.lyre that implement Extractor Modifier and Type Class Description class
LyreExpression
A LyreExpression represents a series of steps to perform over an inputHString
which can be used for querying (i.e. -
Uses of Extractor in com.gengoai.hermes.extraction.regex
Classes in com.gengoai.hermes.extraction.regex that implement Extractor Modifier and Type Class Description class
TokenRegex
Hermes provides a token-based regular expression engine that allows for matches on arbitrary annotation types, relation types, and attributes, while providing many of the operators that are possible using standard Java regular expressions. -
Uses of Extractor in com.gengoai.hermes.extraction.summarization
Subinterfaces of Extractor in com.gengoai.hermes.extraction.summarization Modifier and Type Interface Description interface
Summarizer
Classes in com.gengoai.hermes.extraction.summarization that implement Extractor Modifier and Type Class Description class
TextRankSummarizer
Implementation of the TextRank algorithm for summarization as defined in: Mihalcea, R., Tarau, P.: "Textrank: Bringing order into texts". -
Uses of Extractor in com.gengoai.hermes.lexicon
Classes in com.gengoai.hermes.lexicon that implement Extractor Modifier and Type Class Description class
DiskLexicon
APersistentLexicon
that storesLexiconEntry
on disk facilitating the use of very lexicons with little memory overhead.class
Lexicon
A traditional approach to information extraction incorporates the use of lexicons, also called gazetteers, for finding specific lexical items in text.class
PersistentLexicon
Base class for lexicon implementations that are persistent, meaning added entries are persisted between runs.class
TrieLexicon
Implementation ofLexicon
usng a Trie data structure. -
Uses of Extractor in com.gengoai.hermes.similarity
Constructors in com.gengoai.hermes.similarity with parameters of type Extractor Constructor Description ExtractorBasedSimilarity(@NonNull Similarity measure, @NonNull Extractor termExtractor)
Instantiates a new TokenSimilarity.
-