Class RegexUtils

java.lang.Object
org.apache.tika.utils.RegexUtils

@Deprecated(since="2026-04-30") public class RegexUtils extends Object
Deprecated.
This version of the Apache Tika library is deprecated. Use your own version of Apache Tika.
Inspired from Nutch code class OutlinkExtractor. Apply regex to extract content
  • Constructor Details

    • RegexUtils

      public RegexUtils()
      Deprecated.
  • Method Details

    • extractLinks

      public static List<String> extractLinks(String content)
      Deprecated.
      Extract urls from plain text.
      Parameters:
      content - The plain text content to examine
      Returns:
      List of urls within found in the plain text