Package org.apache.tika.utils
Class RegexUtils
java.lang.Object
org.apache.tika.utils.RegexUtils
Deprecated.
This version of the Apache Tika library is deprecated. Use your own version of Apache Tika.
Inspired from Nutch code class OutlinkExtractor. Apply regex to extract
content
-
Constructor Summary
Constructors -
Method Summary
Modifier and TypeMethodDescriptionextractLinks(String content) Deprecated.Extract urls from plain text.
-
Constructor Details
-
RegexUtils
public RegexUtils()Deprecated.
-
-
Method Details
-
extractLinks
Deprecated.Extract urls from plain text.- Parameters:
content- The plain text content to examine- Returns:
- List of urls within found in the plain text
-