What image formats are supported?

JPG, PNG, WebP, BMP, and TIFF. For best OCR accuracy, use a high-resolution image with clear, well-lit text.

What languages does image-to-text support?

English, Simplified Chinese, English + Chinese combined, and Japanese. The language model is downloaded once and cached in your browser.

How accurate is the text extraction?

Accuracy depends on image quality. Clear, high-contrast printed text at 150+ DPI typically achieves 90–99% accuracy. Handwritten text, low-resolution photos, or unusual fonts may produce lower accuracy.

Can I extract text from a screenshot?

Yes. Screenshots typically have high contrast and clean text, which makes them ideal for OCR. Simply upload the screenshot and the tool will extract the text.

Is my data sent to any server?

No. The entire OCR process runs locally in your browser using WebAssembly. Your images and extracted text never leave your device.

Why is the first use slower?

The OCR engine and language data (~4 MB) need to be downloaded on first use. This data is cached in your browser, so subsequent uses start almost instantly.

Can I copy and paste the extracted text?

Yes. The extracted text appears in an editable text area. Use the Copy button to copy it to your clipboard, or download it as a .txt file.

What about scanned PDFs?

For scanned PDFs, use the full OCR tool which supports both images and PDF files. This image-to-text tool is optimised for individual images and photos.

Image to Text

Your files never leave your device

Language

Drop files here

or click to browse

Max 50.0 MB per file·Supports: JPG · PNG · WebP · BMP · TIFF

You might also need

🔍

Extract Text (OCR)

Pull text from images — beta

📝

PDF to Text

Pull all text out of a PDF as plain .txt

📉

Compress Image

Shrink JPG, PNG, WebP files with quality and resize controls

How image-to-text extraction works

FileKit uses Tesseract.js, a WebAssembly port of the Tesseract OCR engine, to recognise text entirely in your browser. Upload a photo, screenshot, or scanned document and the engine identifies letters, words, and paragraphs automatically. The language model is downloaded once (~4 MB for English) and cached locally — nothing is uploaded. For best results, use high-contrast images with clearly printed text at a resolution of at least 150 DPI.

How to Extract Text from an Image

1
Upload an image
Drag and drop a photo, screenshot, or scanned document. Supported formats include JPG, PNG, WebP, BMP, and TIFF.
2
Select the language
Choose the primary language of the text: English, Chinese (Simplified), Japanese, or English+Chinese combined. Correct language selection improves accuracy significantly.
3
Copy or download text
The recognised text appears in an editable area. Copy it to your clipboard or download as a .txt file for further use.

Image to Text

You might also need

How image-to-text extraction works

How to Extract Text from an Image

Upload an image

Select the language

Copy or download text

Frequently Asked Questions

Related Guides