Extractor needed to distill RDF
from Microformats in HTML pages are contained in this package.See: Description
| Class | Description |
|---|---|
| AdrExtractor |
Extractor for the adr
microformat.
|
| AdrExtractorFactory | |
| DocumentReport |
Represents the validationReportBuilder generated by a
the
TagSoupParser when a document
is retrieved and validated. |
| DomUtils |
This class provides utility methods for DOM manipulation.
|
| EntityBasedMicroformatExtractor |
Base class for microformat extractors based on entities.
|
| GeoExtractor |
Extractor for the Geo
microformat.
|
| GeoExtractorFactory | |
| HCalendarExtractor |
Extractor for the hCalendar
microformat.
|
| HCalendarExtractorFactory | |
| HCardExtractor |
Extractor for the hCard
microformat.
|
| HCardExtractorFactory | |
| HCardName |
An HCard name, consisting of various parts.
|
| HeadLinkExtractor |
This
Extractor.TagSoupDOMExtractor implementation
retrieves the LINKs declared within the HTML/HEAD page header. |
| HeadLinkExtractorFactory | |
| HListingExtractor |
Extractor for the hListing
microformat.
|
| HListingExtractorFactory | |
| HRecipeExtractor |
Extractor for the hRecipe
microformat.
|
| HRecipeExtractorFactory | |
| HResumeExtractor |
Extractor for the hResume
microformat.
|
| HResumeExtractorFactory | |
| HReviewAggregateExtractor |
Extractor for the hReview-aggregate
microformat.
|
| HReviewAggregateExtractorFactory | |
| HReviewExtractor |
Extractor for the hReview
microformat.
|
| HReviewExtractorFactory | |
| HTMLDocument |
A wrapper around the DOM representation of an HTML document.
|
| HTMLDocument.TextField |
This class represents a text extracted from the HTML DOM related
to the node from which such test has been retrieved.
|
| HTMLMetaExtractor |
This extractor represents the HTML META tag values
according the HTML4 specification.
|
| HTMLMetaExtractorFactory | |
| ICBMExtractor |
Extractor for "ICBM coordinates" provided as META headers in the head
of an HTML page.
|
| ICBMExtractorFactory | |
| LicenseExtractor |
Extractor for the rel-license
microformat.
|
| LicenseExtractorFactory | |
| MicroformatExtractor |
The abstract base class for any
Microformat specification extractor.
|
| SpanCloserInputStream |
Extension of
InputStream meant to
detect and replace any occurrence of inline span: |
| SpeciesExtractor |
Extractor able to extract the Species Microformat.
|
| SpeciesExtractorFactory | |
| TagSoupParser |
Parses an
InputStream
into an |
| TagSoupParser.ElementLocation |
Describes a DOM Element location.
|
| TitleExtractor |
Extracts the value of the <title> element of an
HTML or XHTML page.
|
| TitleExtractorFactory | |
| TurtleHTMLExtractor |
Extractor for Turtle/N3 format embedded within HTML
script tags.
|
| TurtleHTMLExtractorFactory | |
| XFNExtractor |
Extractor for the XFN
microformat.
|
| XFNExtractorFactory |
Extractor needed to distill RDF
from Microformats in HTML pages are contained in this package.Copyright © 2010-2013 The Apache Software Foundation. All Rights Reserved.