Package org.apache.tika.parser.microsoft
Class POIFSContainerDetector
java.lang.Object
org.apache.tika.parser.microsoft.POIFSContainerDetector
- All Implemented Interfaces:
Serializable,Detector
@Deprecated(since="2026-04-30")
public class POIFSContainerDetector
extends Object
implements Detector
Deprecated.
This version of the Apache Tika library is deprecated. Use your own version of Apache Tika.
A detector that works on a POIFS OLE2 document
to figure out exactly what the file is.
This should work for all OLE2 documents, whether
they are ones supported by POI or not.
- See Also:
-
Field Summary
FieldsModifier and TypeFieldDescriptionstatic final MediaTypeDeprecated.Some other kind of embedded document, in a CompObj container within another OLE2 documentstatic final MediaTypeDeprecated.Microsoft Wordstatic final MediaTypeDeprecated.General embedded document type within an OLE2 containerstatic final MediaTypeDeprecated.Hangul Word Processor (Korean)static final MediaTypeDeprecated.Microsoft Projectstatic final MediaTypeDeprecated.Equation embedded in Office docsstatic final MediaTypeDeprecated.Graph/Charts embedded in PowerPoint and Excelstatic final MediaTypeDeprecated.Microsoft Outlookstatic final MediaTypeDeprecated.The OLE base file formatstatic final MediaTypeDeprecated.An OLE10 Native embedded document within another OLE2 documentstatic final MediaTypeDeprecated.The protected OOXML base file formatstatic final MediaTypeDeprecated.Microsoft PowerPointstatic final MediaTypeDeprecated.Microsoft Publisherstatic final MediaTypeDeprecated.Base QuattroPro mimestatic final MediaTypeDeprecated.StarOffice Drawstatic final MediaTypeDeprecated.StarOffice Calcstatic final MediaTypeDeprecated.StarOffice Impressstatic final MediaTypeDeprecated.StarOffice Writerstatic final MediaTypeDeprecated.SolidWorks CAD filestatic final MediaTypeDeprecated.Microsoft Visiostatic final MediaTypeDeprecated.Microsoft Worksstatic final MediaTypeDeprecated.Microsoft Works Spreadsheet 7.0static final MediaTypeDeprecated.Microsoft Excel -
Constructor Summary
Constructors -
Method Summary
Modifier and TypeMethodDescriptiondetect(InputStream input, Metadata metadata) Deprecated.Detects the content type of the given input document.voidsetMarkLimit(int markLimit) Deprecated.
-
Field Details
-
OLE
Deprecated.The OLE base file format -
OOXML_PROTECTED
Deprecated.The protected OOXML base file format -
GENERAL_EMBEDDED
Deprecated.General embedded document type within an OLE2 container -
OLE10_NATIVE
Deprecated.An OLE10 Native embedded document within another OLE2 document -
COMP_OBJ
Deprecated.Some other kind of embedded document, in a CompObj container within another OLE2 document -
MS_GRAPH_CHART
Deprecated.Graph/Charts embedded in PowerPoint and Excel -
MS_EQUATION
Deprecated.Equation embedded in Office docs -
XLS
Deprecated.Microsoft Excel -
DOC
Deprecated.Microsoft Word -
PPT
Deprecated.Microsoft PowerPoint -
PUB
Deprecated.Microsoft Publisher -
VSD
Deprecated.Microsoft Visio -
WPS
Deprecated.Microsoft Works -
XLR
Deprecated.Microsoft Works Spreadsheet 7.0 -
MSG
Deprecated.Microsoft Outlook -
MPP
Deprecated.Microsoft Project -
SDC
Deprecated.StarOffice Calc -
SDA
Deprecated.StarOffice Draw -
SDD
Deprecated.StarOffice Impress -
SDW
Deprecated.StarOffice Writer -
SLDWORKS
Deprecated.SolidWorks CAD file -
HWP
Deprecated.Hangul Word Processor (Korean) -
QUATTROPRO
Deprecated.Base QuattroPro mime
-
-
Constructor Details
-
POIFSContainerDetector
public POIFSContainerDetector()Deprecated.
-
-
Method Details
-
setMarkLimit
public void setMarkLimit(int markLimit) Deprecated. -
detect
Deprecated.Description copied from interface:DetectorDetects the content type of the given input document. Returnsapplication/octet-streamif the type of the document can not be detected.If the document input stream is not available, then the first argument may be
null. Otherwise the detector may read bytes from the start of the stream to help in type detection. The given stream is guaranteed to support themark featureand the detector is expected tomarkthe stream before reading any bytes from it, and toresetthe stream before returning. The stream must not be closed by the detector.The given input metadata is only read, not modified, by the detector.
- Specified by:
detectin interfaceDetector- Parameters:
input- document input stream, ornullmetadata- input metadata for the document- Returns:
- detected media type, or
application/octet-stream - Throws:
IOException- if the document input stream could not be read
-