Quality-gated workflow

Scan Rescue and OCR Readiness

Use Scan Rescue when text extraction failed, a PDF appears scanned, or an image contains text that needs review. It helps decide whether OCR is worth running and how to verify output; it does not repair scans or promise recognition accuracy.

Start here

Choose the next recovery path

Preflight result

Scan rescue readiness

Use these signals before running server-backed OCR. Ready means the file is a reasonable OCR candidate with a review plan, not that recognition will be perfect.

Ready signal

Ready for OCR attempt

The file has no usable text layer, scan-quality signals are acceptable, rotation/language concerns are known, and you have a checklist for reviewing names, numbers, and dates.

Needs review

Use text extraction first

If the PDF has selectable text or digital text-layer signals, PDF to Text is usually cleaner and cheaper than OCR.

Needs review

Needs scan cleanup review

Low contrast, rotation, tiny text, blank pages, mixed page sizes, or photo glare should be reviewed before spending a server OCR job.

Needs review

Needs output verification

OCR output should be checked against the source for names, totals, punctuation, line breaks, and table structure before reuse.

Task groups

Start with the part of the workflow that is failing