Back to Tools
Claim This Listing



E-Discovery
Apache-2.0
apacheApache Tika
The Apache Tika toolkit detects and extracts metadata and text from over a thousand different file types (such as PPT, XLS, and PDF).
Jurisdiction
Global
License
Apache-2.0
Stars
3,683
Last Updated
4/6/2026
Why this tool is worth evaluating
This page surfaces the signals a team needs before deciding: ownership, jurisdiction, licensing, freshness, and popularity — without sending them straight into docs or GitHub first.
Quick signals
- Category: E-Discovery
- Jurisdiction: Global
- Owner: apache
Claim This Listing
Own this tool? Claim your listing to update details and add information.
Related Tools
doccano
Open source annotation tool for machine learning practitioners.
10,601Label Studio
Label Studio is a multi-type data labeling and annotation tool with standardized output format
26,937FreeEed
Complete eDiscovery processing (OCR, indexing, metadata)
127FreeDiscovery
Information retrieval engine based on scikit-learn
143