Uses of Interface
com.gengoai.hermes.Document
-
Packages that use Document Package Description com.gengoai.hermes Hermes is a framework for Natural Language Processing.com.gengoai.hermes.annotator com.gengoai.hermes.corpus com.gengoai.hermes.extraction.caduceus com.gengoai.hermes.extraction.regex com.gengoai.hermes.format com.gengoai.hermes.format.conll -
-
Uses of Document in com.gengoai.hermes
Methods in com.gengoai.hermes that return Document Modifier and Type Method Description static Document
Document. create(@NonNull String text)
Convenience method for creating a document using the default document factory.static Document
Document. create(@NonNull String text, @NonNull Language language)
Convenience method for creating a document using the default document factory.static Document
Document. create(@NonNull String text, @NonNull Language language, @NonNull Map<AttributeType<?>,?> attributes)
Convenience method for creating a document using the default document factory.static Document
Document. create(@NonNull String id, @NonNull String text)
Convenience method for creating a document using the default document factory.static Document
Document. create(@NonNull String id, @NonNull String text, @NonNull Language language)
Convenience method for creating a document using the default document factory.static Document
Document. create(@NonNull String id, @NonNull String text, @NonNull Language language, @NonNull Map<AttributeType<?>,?> attributes)
Convenience method for creating a document using the default document factory.static Document
Document. create(@NonNull String id, @NonNull String text, @NonNull Map<AttributeType<?>,?> attributes)
Convenience method for creating a document using the default document factory.static Document
Document. create(@NonNull String text, @NonNull Map<AttributeType<?>,?> attributes)
Convenience method for creating a document using the default document factory.Document
DocumentFactory. create(@NonNull String content)
Creates a document with the given content assigning the new document an auto-generated id and setting its language to the default language.Document
DocumentFactory. create(@NonNull String content, @NonNull Language language)
Creates a document with the given content written in the given language assigning the new document an auto-generated id .Document
DocumentFactory. create(@NonNull String content, @NonNull Language language, @NonNull Map<AttributeType<?>,?> attributeMap)
Creates a document with the given content written in the given language having the given set of attributes.Document
DocumentFactory. create(@NonNull String id, @NonNull String content)
Creates a document with the given id and content setting its language to the default language.Document
DocumentFactory. create(@NonNull String id, @NonNull String content, @NonNull Language language)
Creates a document with the given id and content written in the given language.Document
DocumentFactory. create(@NonNull String id, @NonNull String content, @NonNull Language language, @NonNull Map<AttributeType<?>,?> attributeMap)
Creates a document with the given id and content written in the given language having the given set of attributes.Document
DocumentFactory. createRaw(@NonNull String content)
Creates a document with the given content written in the default language.Document
DocumentFactory. createRaw(@NonNull String content, @NonNull Language language)
Creates a document with the given content written in the given language.Document
DocumentFactory. createRaw(@NonNull String content, @NonNull Language language, @NonNull Map<AttributeType<?>,?> attributeMap)
Creates a document with the given content written in the given language having the given set of attributes.Document
DocumentFactory. createRaw(@NonNull String id, @NonNull String content)
Creates a document with the given id and content written in the default language.Document
DocumentFactory. createRaw(@NonNull String id, @NonNull String content, @NonNull Language language)
Creates a document with the given id and content written in the given language.Document
DocumentFactory. createRaw(@NonNull String id, @NonNull String content, @NonNull Language language, @NonNull Map<AttributeType<?>,?> attributeMap)
Creates a document with the given id and content written in the given language having the given set of attributes.default Document
Document. document()
Document
HString. document()
static Document
Document. fromJson(@NonNull String jsonString)
Creates a document from a JSON representation (created by the write or toJson methods)Document
DocumentFactory. fromTokens(@NonNull Language language, @NonNull String... tokens)
Creates a document from the given tokens.Document
DocumentFactory. fromTokens(@NonNull Iterable<String> tokens)
Creates a document from the given tokens using the default language.Document
DocumentFactory. fromTokens(@NonNull Iterable<String> tokens, @NonNull Language language)
Creates a document from the given tokens.Document
DocumentFactory. fromTokens(@NonNull String... tokens)
Creates a document from the given tokens using the default language.default Document
HString. toDocument()
Converts this HString into a Document copying the annotations and relations.Methods in com.gengoai.hermes with parameters of type Document Modifier and Type Method Description boolean
AnnotationPipeline. annotate(@NonNull Document document)
Annotates a document to fulfill the desired set of annotatable typesstatic HString
Fragments. emptyHString(Document document)
Creates an empty HString associated with the given document.static HString
Fragments. span(Document document, int start, int end)
Creates an HString which is a span, i.e. -
Uses of Document in com.gengoai.hermes.annotator
Methods in com.gengoai.hermes.annotator with parameters of type Document Modifier and Type Method Description void
Annotator. annotate(@NonNull Document document)
Annotates a document with one or more AnnotatableType defined insatisfies()
.protected abstract void
Annotator. annotateImpl(Document document)
Annotates a document with one or more AnnotatableType defined insatisfies()
.protected void
DefaultCategoryAnnotator. annotateImpl(Document document)
protected void
DefaultLemmaAnnotator. annotateImpl(Document document)
protected void
DefaultMlEntityAnnotator. annotateImpl(Document document)
protected void
DefaultSentenceAnnotator. annotateImpl(Document doc)
protected void
DefaultStemAnnotator. annotateImpl(Document document)
protected void
DefaultTokenAnnotator. annotateImpl(Document document)
protected void
DefaultTransliterationAnnotator. annotateImpl(Document document)
protected void
RegexAnnotator. annotateImpl(@NonNull Document document)
protected void
SentenceLevelAnnotator. annotateImpl(Document document)
protected void
SubTypeAnnotator. annotateImpl(Document document)
protected void
FuzzyLexiconAnnotator. createAndAttachAnnotation(Document document, LexiconMatch match)
protected abstract void
ViterbiAnnotator. createAndAttachAnnotation(Document document, LexiconMatch span)
Given an possible span determines if an annotation should be created and if so creates and attaches it. -
Uses of Document in com.gengoai.hermes.corpus
Methods in com.gengoai.hermes.corpus that return Document Modifier and Type Method Description default Document
Corpus. getDocument(String id)
Gets the document with the given document idMethods in com.gengoai.hermes.corpus that return types with arguments of type Document Modifier and Type Method Description default <K> Multimap<K,Document>
DocumentCollection. groupBy(@NonNull SerializableFunction<? super Document,K> keyFunction)
Groups documents in the document store using the given function.default Iterator<Document>
DocumentCollection. iterator()
MStream<Document>
DocumentCollection. parallelStream()
Gets a parallel stream over the documents in the collectionMStream<Document>
DocumentCollection. stream()
Gets a stream over the documents in the collectionMethods in com.gengoai.hermes.corpus with parameters of type Document Modifier and Type Method Description boolean
Corpus. add(Document document)
Adds a document to the corpusstatic DocumentCollection
DocumentCollection. create(@NonNull Document... documents)
Creates a document collection for one or more documents.abstract boolean
Query. matches(Document document)
Tests if the document matches the queryboolean
Corpus. remove(Document document)
Removes a document from the corpusboolean
Corpus. update(Document document)
Updates the given documentMethod parameters in com.gengoai.hermes.corpus with type arguments of type Document Modifier and Type Method Description default void
Corpus. addAll(@NonNull Iterable<Document> documents)
Adds multiple documents to the corpus.static DocumentCollection
DocumentCollection. create(@NonNull MStream<Document> documents)
Creates a document collection for a stream of documents.static DocumentCollection
DocumentCollection. create(@NonNull Iterable<Document> documents)
Creates a document collection for one or more documents.static DocumentCollection
DocumentCollection. create(@NonNull Stream<Document> documents)
Creates a document collection for a stream of documents.default DocumentCollection
DocumentCollection. filter(@NonNull SerializablePredicate<Document> predicate)
Filters the documents in the collection using the given predicatedefault <K> Multimap<K,Document>
DocumentCollection. groupBy(@NonNull SerializableFunction<? super Document,K> keyFunction)
Groups documents in the document store using the given function.Corpus
Corpus. update(@NonNull String operation, @NonNull SerializableConsumer<Document> documentProcessor)
DocumentCollection
DocumentCollection. update(String operationName, @NonNull SerializableConsumer<Document> documentProcessor)
Updates all documents in the corpus using the given document processor -
Uses of Document in com.gengoai.hermes.extraction.caduceus
Methods in com.gengoai.hermes.extraction.caduceus with parameters of type Document Modifier and Type Method Description void
CaduceusProgram. execute(@NonNull Document document)
Executes the program over the given document. -
Uses of Document in com.gengoai.hermes.extraction.regex
Methods in com.gengoai.hermes.extraction.regex that return Document Modifier and Type Method Description Document
TokenMatch. getDocument()
-
Uses of Document in com.gengoai.hermes.format
Methods in com.gengoai.hermes.format that return types with arguments of type Document Modifier and Type Method Description MStream<Document>
DocFormat. read(Resource inputResource)
Reads documents in this format from the given input resource.MStream<Document>
WholeFileTextFormat. read(Resource inputResource)
protected Stream<Document>
CoNLLFormat. readSingleFile(String file)
protected Stream<Document>
CsvFormat. readSingleFile(String file)
protected Stream<Document>
HermesJsonFormat. readSingleFile(String content)
protected Stream<Document>
PennTreebankFormat. readSingleFile(String content)
protected Stream<Document>
POSFormat. readSingleFile(String content)
protected Stream<Document>
TaggedFormat. readSingleFile(String content)
protected Stream<Document>
TwitterSearchFormat. readSingleFile(String json)
protected Stream<Document>
TxtFormat. readSingleFile(String content)
protected abstract Stream<Document>
WholeFileTextFormat. readSingleFile(String content)
Converts the content of an entire file into one ore more documents.Methods in com.gengoai.hermes.format with parameters of type Document Modifier and Type Method Description void
CoNLLColumnProcessor. processInput(Document document, List<CoNLLRow> documentRows, Map<Tuple2<Integer,Integer>,Long> sentenceIndexToAnnotationId)
Processes a set of CoNLL rows making up a documentvoid
CoNLLFormat. write(Document document, Resource outputResource)
void
CsvFormat. write(Document document, Resource outputResource)
void
DocFormat. write(Document document, Resource outputResource)
Writes the given document in this format to the given output resource.void
HermesJsonFormat. write(Document document, Resource outputResource)
void
PennTreebankFormat. write(Document document, Resource outputResource)
void
POSFormat. write(Document document, Resource outputResource)
void
TaggedFormat. write(Document document, Resource outputResource)
void
TwitterSearchFormat. write(Document document, Resource outputResource)
void
TxtFormat. write(Document document, Resource outputResource)
-
Uses of Document in com.gengoai.hermes.format.conll
Methods in com.gengoai.hermes.format.conll with parameters of type Document Modifier and Type Method Description void
CategoryProcessor. processInput(Document document, List<CoNLLRow> documentRows, Map<Tuple2<Integer,Integer>,Long> sentenceIndexToAnnotationId)
void
DependencyLinkProcessor. processInput(Document document, List<CoNLLRow> documentRows, Map<Tuple2<Integer,Integer>,Long> sentenceIndexToAnnotationId)
void
DependencyRelationProcessor. processInput(Document document, List<CoNLLRow> documentRows, Map<Tuple2<Integer,Integer>,Long> sentenceIndexToAnnotationId)
void
IndexProcessor. processInput(Document document, List<CoNLLRow> documentRows, Map<Tuple2<Integer,Integer>,Long> sentenceIndexToAnnotationId)
void
IOBFieldProcessor. processInput(Document document, List<CoNLLRow> rows, Map<Tuple2<Integer,Integer>,Long> sentenceIndexToAnnotationId)
void
LemmaProcessor. processInput(Document document, List<CoNLLRow> documentRows, Map<Tuple2<Integer,Integer>,Long> sentenceIndexToAnnotationId)
void
MorphologicalFeatureProcessor. processInput(Document document, List<CoNLLRow> documentRows, Map<Tuple2<Integer,Integer>,Long> sentenceIndexToAnnotationId)
void
NoOptProcessor. processInput(Document document, List<CoNLLRow> documentRows, Map<Tuple2<Integer,Integer>,Long> sentenceIndexToAnnotationId)
void
POSFieldProcessor. processInput(Document document, List<CoNLLRow> documentRows, Map<Tuple2<Integer,Integer>,Long> sentenceIndexToAnnotationId)
void
UPOSProcessor. processInput(Document document, List<CoNLLRow> documentRows, Map<Tuple2<Integer,Integer>,Long> sentenceIndexToAnnotationId)
void
WordProcessor. processInput(Document document, List<CoNLLRow> documentRows, Map<Tuple2<Integer,Integer>,Long> sentenceIndexToAnnotationId)
-