Package org.apache.tika.metadata
Interface TikaCoreProperties
Deprecated.
This version of the Apache Tika library is deprecated. Use your own version of Apache Tika.
Contains a core set of basic Tika metadata properties, which all parsers
will attempt to supply (where the file format permits). These are all
defined in terms of other standard namespaces.
Users of Tika who wish to have consistent metadata across file formats
can make use of these Properties, knowing that where present they will
have consistent semantic meaning between different file formats. (No
matter if one file format calls it Title, another Long-Title and another
Long-Name, if they all mean the same thing as defined by
DublinCore.TITLE then they will all be present as such)
For now, most of these properties are composite ones including the deprecated
non-prefixed String properties from the Metadata class. In Tika 2.0, most
of these will revert back to simple assignments.- Since:
- Apache Tika 1.2
-
Nested Class Summary
Nested ClassesModifier and TypeInterfaceDescriptionstatic enumDeprecated.A file might contain different types of embedded documents. -
Field Summary
FieldsModifier and TypeFieldDescriptionstatic final PropertyDeprecated.static final PropertyDeprecated.static final PropertyDeprecated.This is currently used to identify Content-Type that may be included within a document, such as in html documents (e.g.static final PropertyDeprecated.static final PropertyDeprecated.static final PropertyDeprecated.static final PropertyDeprecated.static final PropertyDeprecated.static final PropertyDeprecated.static final PropertyDeprecated.static final PropertyDeprecated.Embedded resource type propertystatic final StringDeprecated.static final PropertyDeprecated.static final PropertyDeprecated.static final PropertyDeprecated.static final PropertyDeprecated.DublinCore.SUBJECT; should include both subject and keywords if a document format has both.static final PropertyDeprecated.static final PropertyDeprecated.static final PropertyDeprecated.static final PropertyDeprecated.static final PropertyDeprecated.static final PropertyDeprecated.static final PropertyDeprecated.Some file formats can store information about their original file name/location or about their attachment's original file name/location.static final PropertyDeprecated.static final PropertyDeprecated.static final PropertyDeprecated.static final PropertyDeprecated.static final PropertyDeprecated.static final PropertyDeprecated.static final PropertyDeprecated.Use this to store exceptions caught while trying to read the stream of an embedded resource.static final StringDeprecated.Use this to store parse exception information in the Metadata object.static final PropertyDeprecated.Use this to store exceptions caught during a parse that are non-fatal, e.g.static final StringDeprecated.Use this to prefix metadata properties that store information about the parsing process.static final PropertyDeprecated.static final PropertyDeprecated.use TikaCoreProperties#KEYWORDSstatic final PropertyDeprecated.use TikaCoreProperties#DESCRIPTIONstatic final PropertyDeprecated.use TikaCoreProperties#TITLEstatic final PropertyDeprecated.use OfficeOpenXMLCore#SUBJECTstatic final PropertyDeprecated.
-
Field Details
-
TIKA_META_PREFIX
Deprecated.Use this to prefix metadata properties that store information about the parsing process. Users should be able to distinguish between metadata that was contained within the document and metadata about the parsing process. In Tika 2.0 (or earlier?), let's change X-ParsedBy to X-TIKA-Parsed-By.- See Also:
-
TIKA_META_EXCEPTION_PREFIX
Deprecated.Use this to store parse exception information in the Metadata object.- See Also:
-
TIKA_META_EXCEPTION_WARNING
Deprecated.Use this to store exceptions caught during a parse that are non-fatal, e.g. if a parser is in lenient mode and more content can be extracted if we ignore an exception thrown by a dependency. -
TIKA_META_EXCEPTION_EMBEDDED_STREAM
Deprecated.Use this to store exceptions caught while trying to read the stream of an embedded resource. Do not use this if there is a parse exception on the embedded resource. -
EMBEDDED_RESOURCE_TYPE_KEY
Deprecated.- See Also:
-
ORIGINAL_RESOURCE_NAME
Deprecated.Some file formats can store information about their original file name/location or about their attachment's original file name/location. -
CONTENT_TYPE_HINT
Deprecated.This is currently used to identify Content-Type that may be included within a document, such as in html documents (e.g. ) , or the value might come from outside the document. This information may be faulty and should be treated only as a hint. -
CONTENT_TYPE_OVERRIDE
Deprecated. -
FORMAT
Deprecated.- See Also:
-
IDENTIFIER
Deprecated.- See Also:
-
CONTRIBUTOR
Deprecated.- See Also:
-
COVERAGE
Deprecated.- See Also:
-
CREATOR
Deprecated.- See Also:
-
MODIFIER
Deprecated.- See Also:
-
CREATOR_TOOL
Deprecated.- See Also:
-
LANGUAGE
Deprecated.- See Also:
-
PUBLISHER
Deprecated.- See Also:
-
RELATION
Deprecated.- See Also:
-
RIGHTS
Deprecated.- See Also:
-
SOURCE
Deprecated.- See Also:
-
TYPE
Deprecated.- See Also:
-
TITLE
Deprecated.- See Also:
-
DESCRIPTION
Deprecated.- See Also:
-
KEYWORDS
Deprecated.DublinCore.SUBJECT; should include both subject and keywords if a document format has both. See alsoOffice.KEYWORDSandOfficeOpenXMLCore.SUBJECT. -
CREATED
Deprecated.- See Also:
-
MODIFIED
Deprecated.- See Also:
-
PRINT_DATE
Deprecated.- See Also:
-
METADATA_DATE
Deprecated.- See Also:
-
LATITUDE
Deprecated.- See Also:
-
LONGITUDE
Deprecated.- See Also:
-
ALTITUDE
Deprecated.- See Also:
-
RATING
Deprecated.- See Also:
-
COMMENTS
Deprecated.- See Also:
-
TRANSITION_KEYWORDS_TO_DC_SUBJECT
Deprecated.use TikaCoreProperties#KEYWORDS- See Also:
-
TRANSITION_SUBJECT_TO_DC_DESCRIPTION
Deprecated.use TikaCoreProperties#DESCRIPTION- See Also:
-
TRANSITION_SUBJECT_TO_DC_TITLE
Deprecated.use TikaCoreProperties#TITLE- See Also:
-
TRANSITION_SUBJECT_TO_OO_SUBJECT
Deprecated.use OfficeOpenXMLCore#SUBJECT- See Also:
-
EMBEDDED_RESOURCE_TYPE
Deprecated.Embedded resource type property -
HAS_SIGNATURE
Deprecated.
-