How to OCR a Scanned PDF and Make It Searchable

A scanned PDF looks like a normal document, but to your computer it is just a stack of photographs. You cannot search it, you cannot select a sentence to copy, and screen readers cannot read it aloud. That gap between “looks like text” and “is text” is exactly what OCR — optical character recognition — closes.

This guide walks you through running OCR on a scanned PDF with Doqnest’s OCR tool, entirely in your browser. Your document is processed on your own device, nothing is uploaded to a server, and you do not need an account to try it.

Run OCR on a scanned PDF in four steps

Doqnest detects scanned pages automatically, so the whole process comes down to opening the file and clicking one button.

  1. Open the OCR PDF tool and select your scanned PDF, or drag and drop it onto the page.
  2. The editor opens and flags the pages it recognizes as scans — image-only pages with no text layer behind them.
  3. Click Run OCR. The recognition runs in your browser on the flagged pages and adds an invisible text layer on top of each scanned image, so the page still looks exactly the same but now contains real text.
  4. Click Download to save the searchable PDF to your device.
Tip: try searching the downloaded file (Ctrl+F or Cmd+F) for a word you can see on a scanned page. If it is found, the OCR worked — the page is now a searchable PDF.

What OCR actually does to your document

OCR does not replace the scanned image — it reads it. The software analyzes the picture of each page, recognizes the shapes of letters and words, and writes what it found into a hidden text layer positioned precisely under the printed characters. The visible page is untouched; the difference is entirely in what your computer can now do with it.

After OCR, a scanned page behaves like a born-digital one: you can search it, select and copy passages, and let accessibility tools read it. If you want a deeper look at how the recognition works and where it came from, see what OCR is and how it works.

How to make a searchable PDF from paper documents

Often the scanned PDF does not exist yet — you have paper, or a handful of phone photos. The workflow is the same once you get everything into one PDF: photograph or scan each page, combine the images into a single document, then run OCR on the result.

Doqnest handles the combining step too: the walkthrough in combining scanned pages into one PDF shows how to turn a folder of JPGs into one clean document. Once your pages are assembled, open the file in the OCR tool and run recognition on the whole thing in one pass.

Getting the best recognition accuracy

OCR quality is mostly scan quality. A crisp, straight, well-lit page recognizes almost perfectly; a dim, skewed phone photo produces errors. A few habits make a large difference:

  • Scan at 300 DPI or photograph in good light. Low-resolution or blurry images are the number-one cause of misread characters.
  • Keep pages straight. Rotate sideways pages in the editor before running OCR — recognition works line by line and assumes upright text.
  • Use flat pages. Curved text near a book spine and creased folds distort letter shapes.
  • Prefer clean originals. Highlighter marks, stamps, and handwriting over printed text confuse the recognizer; printed text on plain background works best.

Is it safe to OCR confidential scans online?

Scanned documents are often the sensitive ones — contracts, medical records, ID pages, old tax paperwork. With many online OCR services, “online” means your scan is uploaded to a server, processed there, and stored at least temporarily on someone else’s machine.

Doqnest takes a different approach: recognition runs inside your browser using your own device’s processing power. The file never leaves your computer at any point, which makes browser-based OCR a sound choice even for paperwork you would not email to a stranger.

What to do after OCR

A searchable PDF is usually the goal, but it is also a starting point. Because the text is now real, you can copy quotes out of an old report instead of retyping them, search a hundred-page scan for a name in seconds, and archive documents knowing you will actually find them again.

The recognized document also plays well with the rest of your workflow: merge it with born-digital files, sign it, or shrink it for sending. Since scans are stored at print resolution, they tend to be heavy — reducing the file size for email after OCR is a common final step.

الأسئلة الشائعة

How do I know if my PDF needs OCR?

Try to select some text with your mouse, or search for a word you can see on the page. If nothing can be selected or found, the page is an image-only scan. Doqnest also flags scanned pages automatically when you open a file and offers to run OCR on them.

Does OCR change how my document looks?

No. OCR adds an invisible text layer underneath the scanned image; the visible page stays pixel-for-pixel the same. The only difference is that you can now search, select, and copy the text.

Is the OCR free, and is there a limit?

You can run OCR without creating an account, and free downloads carry a small Doqnest watermark. The number of pages you can OCR depends on your plan — paid plans, which start with a free trial, include larger page allowances and watermark-free downloads.

Can OCR read handwriting?

Printed and typewritten text is where OCR shines. Neat block handwriting is sometimes recognized, but cursive and quick notes are unreliable. For handwritten pages, treat OCR results as a rough draft to check against the original.

What languages does OCR work with?

Recognition works with standard printed Latin-script text out of the box, which covers most everyday documents. Accuracy is highest with clean, high-resolution scans regardless of language — see what OCR is for more on how recognition handles different scripts.