Interface PipelineDescriptor
- All Known Implementing Classes:
ConfigurablePipelineDescriptor
,NoOpPipelineDescriptor
public interface PipelineDescriptor
A descriptor which can provide a
Pipeline
with newPipeline
method.-
Method Summary
Modifier and TypeMethodDescriptionGets the default extension of the file result(s)
The default extension is used whengetResultSubfolder()
is a folder and not a file.Gets theextraction matcher
to check if the pipeline is able to handle a given extractiongetLabel()
Gets the labelGet the output parameters to add some specific features in each serializer.Gets the result subfolder (or file).Gets theserializer model
Gets the XSLT file names to chainnewPipeline
(OutputStream out) Provides a newPipeline
-
Method Details
-
getLabel
Gets the label- Returns:
- the label
-
getExtractionMatcher
Gets theextraction matcher
to check if the pipeline is able to handle a given extraction- Returns:
- the
extraction matcher
of the pipeline
-
getStylesheets
Gets the XSLT file names to chain- Returns:
- the XSLT file names to chain
-
getSerializerModel
Gets theserializer model
- Returns:
- the
serializer model
-
getResultSubfolder
Gets the result subfolder (or file).
Can contain variables (seePathResolver
), resolved against the extraction results.
If it returns- a file path without variable, then all extraction results will be in this file;
- a file path with variables, then extraction results will be dispatched in the resolved file paths;
-
a folder path without variable, then all extraction results will be in one file under this folder
(the file name will be the one provided in
defaultResultFileName
parameter ofExtractionExecutor#execute method
); -
a folder path with variables, then each first level content extraction result will be dispatched in one file under the resolved folders
(the file name will be the title of the content, and the extension is the return of
getDefaultExtension()
);
- Returns:
- the result subfolder (or file)
-
getDefaultExtension
Gets the default extension of the file result(s)
The default extension is used whengetResultSubfolder()
is a folder and not a file.- Returns:
- the default extension of the file result(s)
-
newPipeline
Provides a newPipeline
- Parameters:
out
- the output stream to write on- Returns:
- the built
Pipeline
-
getOutputParameters
Get the output parameters to add some specific features in each serializer.- Returns:
- The output parameters
-