public abstract class EntityBasedMicroformatExtractor extends MicroformatExtractor
Extractor.BlindExtractor, Extractor.ContentExtractor, Extractor.TagSoupDOMExtractorBEGIN_SCRIPT, END_SCRIPT, valueFactory| Constructor and Description |
|---|
EntityBasedMicroformatExtractor() |
| Modifier and Type | Method and Description |
|---|---|
boolean |
extract()
Performs the extraction of the data and writes them to the model.
|
protected abstract boolean |
extractEntity(Node node,
ExtractionResult out)
Extracts an entity from a DOM node.
|
protected abstract String |
getBaseClassName()
Returns the base class name for the extractor.
|
protected org.openrdf.model.BNode |
getBlankNodeFor(Node node) |
protected abstract void |
resetExtractor()
Resets the internal status of the extractor to prepare it to a new extraction section.
|
addBNodeProperty, addBNodeProperty, addURIProperty, conditionallyAddLiteralProperty, conditionallyAddResourceProperty, conditionallyAddStringProperty, fixLink, fixLink, getCurrentExtractionResult, getDescription, getDocumentURI, getExtractionContext, getHTMLDocument, includes, openSubResult, runprotected abstract String getBaseClassName()
protected abstract void resetExtractor()
protected abstract boolean extractEntity(Node node, ExtractionResult out) throws ExtractionException
node - the DOM node.out - the extraction result collector.true if the extraction has produces something, false otherwise.ExtractionExceptionpublic boolean extract()
throws ExtractionException
MicroformatExtractorextract in class MicroformatExtractorExtractionExceptionprotected org.openrdf.model.BNode getBlankNodeFor(Node node)
node - a DOM node representing a blank nodeCopyright © 2010-2013 The Apache Software Foundation. All Rights Reserved.