What does OCR actually do?+
OCR (Optical Character Recognition) reads the pixels of a scanned page and identifies which characters they form, producing machine-readable text. Filoraio's OCR adds that text as an invisible layer to your original PDF — the visible page looks exactly the same, but now Ctrl+F works, you can copy text out, and search engines can index the content.
Is this OCR tool really free, with no signup?+
Yes. No account, no email, no daily quota, and no watermark on the output. The page is supported by ads — never the file you download. OCR as many PDFs as you need.
Are my files uploaded somewhere?+
No. The OCR engine (an open-source OCR engine compiled to WebAssembly) runs entirely in your browser. The PDF is held in your device's RAM while processed and downloads directly back to you. Filoraio's servers never see the file or its text content.
What languages does Filoraio's OCR support?+
Twelve languages: English, Spanish, French, German, Italian, Portuguese, Dutch, Russian, Arabic, Hindi, Chinese (Simplified), and Japanese. The first OCR run in any language downloads a ~10–15 MB OCR language model and caches it in your browser — subsequent runs in the same language are instant.
How accurate is the OCR?+
On clean, well-scanned text at 200+ DPI, accuracy is typically 95–99% for Latin scripts and 90–98% for non-Latin scripts. Lower-quality scans (phone snaps, faxes, low contrast) drop to 70–90%. The underlying OCR engine is the open-source engine behind Google Books and many other large-scale OCR projects, so it's well-tested but not flawless on edge cases.
How long does OCR take?+
3–10 seconds per page on a typical modern laptop, plus a one-time ~5 second model download the first time you use a language. A 50-page document at 5 seconds/page takes ~4 minutes total. Older devices and phones are slower (10–15 seconds per page), but everything runs in the background while you keep using your browser.
Will the output PDF look the same as the original?+
Yes. The original page image is preserved exactly — OCR adds an invisible text layer underneath the pixels, the same way Adobe's Searchable Image output does. Visually, the PDF is identical to the source. The difference: now you can select, copy, and search the text.
Will the file get bigger after OCR?+
Marginally. The invisible text layer adds a small fraction (typically 1–5%) of the source's size — much less than rasterising would, and far less than converting to Word and back. For a 10 MB scanned PDF, expect a ~10.2–10.5 MB output.
Can I OCR a PDF on my iPhone or Android phone?+
Yes. The OCR runs in your phone's browser — Safari on iOS, Chrome on Android. Long documents take longer than on a desktop (phone CPUs are slower) but everything works the same. The OCR'd PDF saves directly to Files (iOS) or Downloads (Android).
What if my PDF is already searchable?+
OCR will still run, but adds a redundant text layer underneath the existing one — slightly larger file with no functional gain. Test first: open the PDF, try Ctrl+F or click to select text. If it works, the PDF is already searchable and you can skip this tool.
Can I OCR a password-protected PDF?+
Filoraio handles owner-restricted PDFs (printing/copying locks) automatically. For PDFs with user passwords (encryption requiring a password to open), unlock the file first with our Unlock PDF tool, then OCR the unlocked output here.
Can the OCR'd text be edited?+
The text is searchable and copyable in any PDF reader — you can Ctrl+F to find it, select it with your cursor, and Cmd+C/Ctrl+C to paste into another app. To edit it visibly inside the PDF itself, run the output through our PDF to Word tool for a fully editable .docx version.
What's the maximum PDF I can OCR?+
There's no hard cap. The OCR runs in your browser's memory — the practical limit is your device's RAM. Most browsers handle 100–200 page documents without issue. For very long scans (500+ pages), the limit is patience: ~50 minutes of processing at 6 seconds per page.
Will OCR work on handwritten text?+
Our OCR recognises *printed* text — handwriting is a different machine-learning problem with its own dedicated tools (HTR, or Handwritten Text Recognition). Some very neat block-printing might be recognised, but cursive and typical handwriting will produce garbled output. For handwritten documents, dedicated HTR services like Transkribus are the right tool.