What languages are supported?

English, Simplified Chinese, English + Chinese combined, and Japanese. The language model is downloaded once and cached in your browser.

Can I OCR a scanned PDF?

Yes. The tool accepts both images and PDF files. For PDFs, each page is rendered as an image and then processed by the OCR engine.

How accurate is the OCR?

Accuracy depends on image quality. Clear, high-contrast printed text at 150+ DPI typically achieves 90–99% accuracy. Handwritten text, low-resolution scans, or unusual fonts may produce lower accuracy.

Is my data sent to any server?

No. The entire OCR process runs locally in your browser using WebAssembly. Your images and text never leave your device.

Why is the first use slow?

The OCR engine and language data (~4 MB) need to be downloaded on first use. This data is cached in your browser, so subsequent uses start almost instantly.

What languages does the OCR support?

The OCR engine supports multiple languages including English, Chinese, Japanese, Korean, Spanish, French, German, and many more.

How accurate is the text recognition?

Accuracy depends on image quality. Clean, high-resolution images achieve 95%+ accuracy. Blurry, rotated, or low-contrast images may produce more errors.

Can I OCR a multi-page PDF?

Yes. Each page is processed individually and the extracted text from all pages is combined in the output.

Extract Text (OCR) Beta

Your files never leave your device

FileKit — Free browser-based file tools. No upload, no signup.

Language

Drop files here

or click to browse

Max 50.0 MB per file·Supports: JPG · PNG · WebP · BMP · TIFF · PDF

You might also need

🔤

Image to Text

Extract text from images using OCR — supports English, Chinese, and Japanese.

📝

PDF to Text

Pull all text out of a PDF as plain .txt

📑

Images to PDF

Combine JPG, PNG, WebP photos into a single PDF

How OCR works

FileKit uses Tesseract.js, a WebAssembly port of the Tesseract OCR engine, to recognise text entirely in your browser. The language model is downloaded once (~4 MB for English) and cached locally — nothing is uploaded. For best results, use high-contrast images with clearly printed text at a resolution of at least 150 DPI.

Understanding OCR Technology

What Is OCR?

Optical Character Recognition (OCR) is technology that converts images of text — whether from scanned documents, photographs, or screenshots — into machine-readable, editable text. The process involves analyzing the visual patterns in an image, recognizing character shapes, and outputting the corresponding text. Modern OCR engines like Tesseract.js use trained neural network models to achieve high accuracy across multiple languages and fonts.

When Do You Need OCR?

Digitizing scanned documents — convert paper records, receipts, and contracts into searchable, editable text files.
Extracting text from screenshots — pull text from app screenshots, error messages, or social media posts without retyping.
Invoice processing — extract vendor names, amounts, and dates from invoice images for accounting workflows.
Academic research — digitize passages from printed books or journals for citation and note-taking.
Accessibility — convert image-based content into text that screen readers can process for visually impaired users.

Key Aspects of OCR

Multi-language support — recognizes text in English, Chinese, Japanese, Korean, and dozens of other languages with trained models.
Confidence scoring — each recognized word includes a confidence percentage, letting you identify uncertain readings.
Browser-based processing — no server upload. The OCR model runs locally in your browser using WebAssembly.
Format flexibility — accepts JPEG, PNG, BMP, TIFF, and other common image formats as input.
Copy and export — extracted text can be copied to clipboard or downloaded as a plain text file.

Tips for Better OCR Results

Use high-resolution images

OCR accuracy depends heavily on image resolution. Images at 300 DPI or higher produce the best results. Photos taken with a phone camera at arm length are usually sufficient.

Ensure good contrast

Dark text on a light background works best. If your image has poor contrast, try adjusting brightness and contrast before running OCR.

Select the correct language

OCR models are language-specific. Selecting the right language dramatically improves accuracy. For mixed-language documents, run OCR once per language.

Straighten skewed scans

If your scanned document is tilted, straighten it before OCR. Even a few degrees of skew can reduce accuracy by 10–20%.

Review confidence scores

Words with low confidence scores (below 70%) are likely incorrect. Review and manually correct these after extraction.

OCR: How It Compares

OCR technology is available through various platforms. Here is how browser-based OCR compares to alternatives.

Feature	FileKit	Desktop / Other
Privacy	Images never leave your browser	Upload to cloud OCR services
Language support	60+ languages via Tesseract.js	Varies, some require paid API
Accuracy	High for clean, high-res documents	Commercial APIs slightly better for noisy input
Speed	Depends on image size and device	Server-side may be faster for large batches
Cost	Free, unlimited usage	Google Vision, AWS Textract charge per page
Offline capable	Works after initial model download	Requires internet connection

How to OCR a Document

1
Upload an image or scanned PDF
Drag and drop a scanned document, photo of a page, or screenshot. Supported formats include JPG, PNG, WebP, and PDF.
2
Select the language
Choose the primary language of the document: English, Chinese (Simplified), Japanese, or English+Chinese combined. Correct language selection improves accuracy significantly.
3
Extract and copy text
FileKit runs Tesseract.js (WebAssembly OCR) entirely in your browser. The recognised text appears in an editable area — copy it or download as a .txt file.

Frequently Asked Questions

Your Privacy, Guaranteed

FileKit processes every file directly in your browser using JavaScript and WebAssembly. Your documents, images, and data never leave your device — there is no server upload, no cloud storage, and no account required. What happens in your browser stays in your browser.

100% client-side processing
No server upload — ever
No account or signup needed
Works offline after first load