Package com.gengoai.hermes.format
Interface OneDocPerFileFormat
-
- All Superinterfaces:
DocFormat
,Serializable
- All Known Implementing Classes:
CoNLLFormat
,HermesJsonFormat
,PennTreebankFormat
,POSFormat
,TaggedFormat
,TwitterSearchFormat
,TxtFormat
public interface OneDocPerFileFormat extends DocFormat, Serializable
Defines a format in which only one document is written per file. These formats require the output resource to be a directory and create individual files named "part-#####" where "####" ranges from 0 to the number of documents in the corpus.
-
-
Method Summary
All Methods Instance Methods Default Methods Modifier and Type Method Description default void
write(DocumentCollection corpus, Resource outputResource)
Writes a corpus of documents in this format to the given output resource-
Methods inherited from interface com.gengoai.hermes.format.DocFormat
getParameters, read, write
-
-
-
-
Method Detail
-
write
default void write(DocumentCollection corpus, Resource outputResource) throws IOException
Description copied from interface:DocFormat
Writes a corpus of documents in this format to the given output resource- Specified by:
write
in interfaceDocFormat
- Parameters:
corpus
- the corpusoutputResource
- the output resource- Throws:
IOException
- Something went wrong writing the corpus
-
-