public interface TwoStepsExecutingExtractionComponent extends ExtractionComponent
ExtractionComponent
whose execution sometimes needs to be done in two times
(in order to avoid calling execute
n times for performance reasons):
computeFirstLevelResults(ExtractionExecutionContext)
for returning all the first level content results;executeFor(ContentHandler, Iterable, ExtractionExecutionContext)
for SAXing with the given ContentHandler
and the given iterable on contents, subset of the one returned by the first method.execute
n times,
computeFirstLevelResults
is called one time
and executeFor
is called n times with a different handler and iterable.
execute
is equivalent to successively call computeFirstLevelResults
and then executeFor
with that result.Modifier and Type | Method and Description |
---|---|
Iterable<Content> |
computeFirstLevelResults(ExtractionExecutionContext context)
Computes first level results for preparing multiple extraction executions
|
default void |
execute(ContentHandler contentHandler,
ExtractionExecutionContext context)
Execute the extraction of the component
|
void |
executeFor(ContentHandler contentHandler,
Iterable<Content> subsetResults,
ExtractionExecutionContext context)
Execute the extraction of the component for the given first level results
|
addSubComponent, getComponentDetailsForTree, getContentTypes, getSubComponents, prepareComponentExecution
Iterable<Content> computeFirstLevelResults(ExtractionExecutionContext context) throws Exception
context
- context of the extraction componentException
- if an error occursvoid executeFor(ContentHandler contentHandler, Iterable<Content> subsetResults, ExtractionExecutionContext context) throws Exception
contentHandler
- result documentsubsetResults
- A subset of the first level content results previously returned by computeFirstLevelResults(ExtractionExecutionContext)
context
- context of the extraction componentException
- if an error occursdefault void execute(ContentHandler contentHandler, ExtractionExecutionContext context) throws Exception
ExtractionComponent
execute
in interface ExtractionComponent
contentHandler
- result documentcontext
- context of the extraction componentException
- if an error occurs