Support OCRing images inside of PDFs. | Voters

Support OCRing images inside of PDFs.

complete

Alex MacCaw

Currently, only PDFs with raw text in them get properly OCRed and added to the search index. We should support PDFs that are just images or contain images.

January 17, 2025

Alex MacCaw

marked this post as

complete

Václav Vančura

Are you using standard OCR? Have you thought about using Docling for more effective transcription?

Alex MacCaw

marked this post as

in progress

Alex MacCaw

marked this post as

planned