Apache Tika Java API

The Tika Java API provides the AutoDetectParser for automatic format detection and parsing, Metadata class for reading extracted metadata fields, ContentHandler for streaming SAX-based text extraction, and Detector for MIME type identification. The facade Tika class provides a simple one-line API for text extraction from any supported format.

API entry from apis.yml

apis.yml Raw ↑
aid: apache-tika:apache-tika-java-api
name: Apache Tika Java API
description: The Tika Java API provides the AutoDetectParser for automatic format detection and parsing,
  Metadata class for reading extracted metadata fields, ContentHandler for streaming SAX-based text extraction,
  and Detector for MIME type identification. The facade Tika class provides a simple one-line API for
  text extraction from any supported format.
humanURL: https://tika.apache.org/
tags:
- Java
- Content Extraction
- Parser
- Metadata
properties:
- type: Documentation
  url: https://tika.apache.org/
- type: APIReference
  url: https://tika.apache.org/1.28/api/
- type: SDK
  url: https://search.maven.org/search?q=org.apache.tika
  title: Maven Java SDK