Skip to content

📷 OCR — Image / PDF to Text

Tesseract 5 with the trained Japanese model. Extract text from PNG / JPEG / WebP / GIF or PDF (max 50 MB, 50 pages per PDF).

100% Free No signup Server-side No logs / DB Rate-limited VPS high-accuracy OSS-based 5 languages

🔒 About Privacy

📂
Drag & drop here, or click to choose
PNG / JPEG / WebP / GIF / PDF · ≤50MB

📖 How to Use

  1. 1
    Choose file
    Drag & drop an image (PNG / JPEG / WebP / GIF) or PDF (max 50 MB).
  2. 2
    Pick language
    Pick Japanese + English (recommended), Japanese only, or English only.
  3. 3
    Run → copy or download
    Click Run OCR. Copy the result or download as .txt.

❓ FAQ

How accurate is it?
Uses Tesseract 5 with the official Japanese trained model. Clean print (books, PDFs, scans) achieves 90%+; handwriting, complex backgrounds, and low resolution degrade accuracy.
PDF page limit?
PDF: pages 1–20. Ghostscript rasterizes each page to 300 dpi grayscale PNG, then OCR runs per page.
Are uploaded files stored?
No. Files live in a temp directory for a few seconds and are deleted right after Tesseract finishes. No logs of OCR text, file names, or sizes are kept.

🔗 Related tools

🐛 Found a bug or issue with this tool?

Free to use, no signup. Even just the steps to reproduce are helpful. Reports go directly to the operator and help us fix issues.

* Browser info (UA / screen / language / URL) is sent automatically to help reproduce the issue