Class WordMLParser

All Implemented Interfaces:
Serializable, Parser

@Deprecated(since="2026-04-30") public class WordMLParser extends AbstractXML2003Parser
Deprecated.
This version of the Apache Tika library is deprecated. Use your own version of Apache Tika.
Parses wordml 2003 format word files. These are single xml files that predate ooxml. See https://en.wikipedia.org/wiki/Microsoft_Office_XML_formats
See Also:
  • Constructor Details

    • WordMLParser

      public WordMLParser()
      Deprecated.
  • Method Details

    • getSupportedTypes

      public Set<MediaType> getSupportedTypes(ParseContext context)
      Deprecated.
      Description copied from interface: Parser
      Returns the set of media types supported by this parser when used with the given parse context.
      Parameters:
      context - parse context
      Returns:
      immutable set of media types
    • setContentType

      public void setContentType(Metadata metadata)
      Deprecated.