📷 OCR — Image / PDF to Text
Tesseract 5 with the trained Japanese model. Extract text from PNG / JPEG / WebP / GIF or PDF (max 50 MB, 50 pages per PDF).
100% Free
No signup
Server-side
No logs / DB
Rate-limited
VPS high-accuracy
OSS-based
5 languages
🔒 About Privacy
- ・Uploaded files are passed to Tesseract then immediately deleted (a few seconds in /tmp at most).
- ・No logs of OCR text, file names, or sizes are kept.
- ・Rate limit: 30 requests per IP per minute.
Drag & drop here, or click to choose
PNG / JPEG / WebP / GIF / PDF · ≤50MB
📖 How to Use
-
1
Choose fileDrag & drop an image (PNG / JPEG / WebP / GIF) or PDF (max 50 MB).
-
2
Pick languagePick Japanese + English (recommended), Japanese only, or English only.
-
3
Run → copy or downloadClick Run OCR. Copy the result or download as .txt.
❓ FAQ
How accurate is it?
Uses Tesseract 5 with the official Japanese trained model. Clean print (books, PDFs, scans) achieves 90%+; handwriting, complex backgrounds, and low resolution degrade accuracy.
PDF page limit?
PDF: pages 1–20. Ghostscript rasterizes each page to 300 dpi grayscale PNG, then OCR runs per page.
Are uploaded files stored?
No. Files live in a temp directory for a few seconds and are deleted right after Tesseract finishes. No logs of OCR text, file names, or sizes are kept.
🔗 Related tools
🐛 Found a bug or issue with this tool?
Free to use, no signup. Even just the steps to reproduce are helpful. Reports go directly to the operator and help us fix issues.
✅
Thanks for your report!
Your report has been delivered to the operator and will be used to improve the tool.