Interface OOXMLExtractor
- All Known Implementing Classes:
AbstractOOXMLExtractor,POIXMLTextExtractorDecorator,SXSLFPowerPointExtractorDecorator,SXWPFWordExtractorDecorator,XPSExtractorDecorator,XSLFPowerPointExtractorDecorator,XSSFBExcelExtractorDecorator,XSSFExcelExtractorDecorator,XWPFWordExtractorDecorator
Deprecated.
This version of the Apache Tika library is deprecated. Use your own version of Apache Tika.
Interface implemented by all Tika OOXML extractors.
- See Also:
-
Method Summary
Modifier and TypeMethodDescriptionDeprecated.Returns the opened document.Deprecated.POIXMLTextExtractor.getMetadataTextExtractor()not yet supported for OOXML by POI.voidgetXHTML(ContentHandler handler, Metadata metadata, ParseContext context) Deprecated.Parses the document into a sequence of XHTML SAX events sent to the given content handler.
-
Method Details
-
getDocument
POIXMLDocument getDocument()Deprecated.Returns the opened document.- See Also:
-
getMetadataExtractor
MetadataExtractor getMetadataExtractor()Deprecated.POIXMLTextExtractor.getMetadataTextExtractor()not yet supported for OOXML by POI. -
getXHTML
void getXHTML(ContentHandler handler, Metadata metadata, ParseContext context) throws SAXException, XmlException, IOException, TikaException Deprecated.Parses the document into a sequence of XHTML SAX events sent to the given content handler.
-