Add optional OCR so scanned documents and text-in-image become findable.
What to do: wire a pluggable OCR step into the ingest pipeline (keepstack/ingest.py) that runs Tesseract when it is available and writes the extracted text into the FTS index. Keep it optional like the AI features, with no hard dependency.
Acceptance: uploading an image with visible text makes that text searchable, and the feature is a no-op when Tesseract is not installed.
Add optional OCR so scanned documents and text-in-image become findable.
What to do: wire a pluggable OCR step into the ingest pipeline (
keepstack/ingest.py) that runs Tesseract when it is available and writes the extracted text into the FTS index. Keep it optional like the AI features, with no hard dependency.Acceptance: uploading an image with visible text makes that text searchable, and the feature is a no-op when Tesseract is not installed.