Skip to content
Image Tools

Image to Text (OCR)

Extract text from images, screenshots, and scanned documents with OCR. Supports 100+ languages via Tesseract.js. Free to use in the browser, and signup is not required to start.

Free in browser No sign-up required Files stay on your device
Upload Image

Drop images here or click to upload

JPEG, PNG, WebP, BMP supported — multiple images OK

Result

Extracted text appears here

Upload an image and click Extract Text

Files processed locally — never uploaded
How it works

Run this tool in three short steps.

01

Upload your image

Drop or select one or more images (JPEG, PNG, WebP, BMP). Multiple images are processed in sequence.

02

Select language

Choose from 20+ languages including Chinese, Japanese, Arabic, and Hindi. Tesseract.js supports 100+ languages.

03

Extract and copy text

Click Extract Text and the OCR engine runs locally via WebAssembly. Copy the result as plain text or Markdown, or download as a .txt file.

Questions

What people ask before they use this tool.

How does image-to-text work?
We use Tesseract.js, an open-source OCR (Optical Character Recognition) engine that runs entirely in your browser. It analyzes the image pixels to recognize letters, words, and paragraphs.
Which languages are supported?
Choose from 20+ popular languages in the dropdown, including Chinese, Japanese, Korean, Arabic, Hindi, and major European languages. Tesseract.js supports 100+ languages total. Language data is downloaded on first use (~2-15MB per language).
Is my image uploaded to a server?
No. OCR processing happens locally in your browser using WebAssembly. Images are not uploaded to our servers during extraction.
What image formats work?
JPEG, PNG, WebP, BMP, and TIFF. OCR usually works better with high-contrast images and clear text. Around 300 DPI is a practical target for scanned documents.
How accurate is it?
Accuracy depends on image quality. Clean, high-resolution images with standard fonts typically achieve 95%+ accuracy. Handwriting, low resolution, or unusual fonts may reduce accuracy.
Can I extract text from handwriting?
Tesseract.js is trained primarily on printed text. Handwriting recognition is limited and results are often poor — especially cursive. For handwriting, a dedicated service like Google Vision API will perform significantly better.
Can I process multiple images at once?
Yes. You can upload multiple images in one go. The tool processes them in sequence and outputs all extracted text in a single result, with each image's text clearly labeled. You can then copy the combined text or download it as a .txt file.
What happens when I change the language?
The first time you select a new language, Tesseract.js downloads the corresponding language data file (typically 2-15MB per language, cached after the first use). English is always pre-loaded and requires no extra download. Processing time varies by language complexity.
Why does the first OCR run take longer?
The OCR engine and selected language data are downloaded on first use and cached in your browser. Subsequent runs in the same session are significantly faster. The progress bar and status text show what is happening during initialization.
Does OCR work on mobile phones?
Yes. Tesseract.js runs via WebAssembly in all modern mobile browsers. You can photograph a document with your phone camera and extract text immediately. Processing is slower than desktop but fully functional.
How does this compare to Google Lens or Adobe Scan?
Google Lens and Adobe Scan typically use cloud-based processing. Our OCR runs in the browser, so image files are not uploaded to our servers during extraction. That browser-local workflow can be useful for sensitive documents such as contracts, medical records, or financial statements.
Can I extract text from a PDF?
This tool handles images only. For PDF text extraction, use our <a href="/pdf-to-word">PDF to Word converter</a>. If your PDF contains scanned pages (image-based), export each page as an image first, then run OCR here.
What image resolution gives stronger OCR accuracy?
Aim for around 300 DPI for scanned documents. Higher resolution can help, especially for small text. If your scan is low-res, try upscaling it first with our <a href="/image-upscaler">AI Image Upscaler</a> before running OCR.
Can I extract text from screenshots?
Yes. Screenshots of websites, chat messages, code editors, and apps work well because they contain clean, high-contrast text. The tool handles multi-column layouts and mixed content.
Related

Continue the workflow

Coda One's OCR tool extracts text from images using Tesseract.js running in your browser via WebAssembly. It supports 100+ languages and works with screenshots, document photos, and scanned pages. Image files are processed locally during extraction and are not uploaded to our servers.

More:  All Image Tools  · Compress  · Convert  · Resize  · Remove BG