Interface DocumentSelector


@Deprecated(since="2026-04-30") public interface DocumentSelector
Deprecated.
This version of the Apache Tika library is deprecated. Use your own version of Apache Tika.
Interface for different document selection strategies for purposes like embedded document extraction by a ContainerExtractor instance. An implementation of this interface defines some specific selection criteria to be applied against the document metadata passed to the select(Metadata) method.
Since:
Apache Tika 0.8
  • Method Summary

    Modifier and Type
    Method
    Description
    boolean
    select(Metadata metadata)
    Deprecated.
    Checks if a document with the given metadata matches the specified selection criteria.
  • Method Details

    • select

      boolean select(Metadata metadata)
      Deprecated.
      Checks if a document with the given metadata matches the specified selection criteria.
      Parameters:
      metadata - document metadata
      Returns:
      true if the document matches the selection criteria, false otherwise