Free & Open Source · OCR Overlay · v0.9.3

Read Japanese Games.
Look Up Every Word.
Stay in the Zone.

YomiNinja captures on-screen Japanese text from games, visual novels, emulators, manga readers, and subtitled video, overlays it directly onto the window, and connects it to Yomitan's popup dictionary — so you can hover any kanji and get instant definitions without ever alt-tabbing.

Windows Linux macOS

An OCR overlay built for Japanese immersion

YomiNinja reads on-screen Japanese text using optical character recognition, renders an interactive overlay directly on top of your game or application, and connects it to Yomitan's popup dictionary engine so you can hover any word, see the reading, and keep playing.

It works on any application where Japanese text is visible on screen — JRPGs, visual novels, emulators, manga readers, and video players with hardcoded subtitles. No game-engine hook list. No process injection. No need to move text into a browser first.

See how it works →
v0.9.3
Latest Public Release
3
Platforms
5
OCR Engines
GPL
3.0 License

Three steps. Zero interruptions.

From pressing a hotkey to reading a definition in under a second.

  1. Select your capture window

    Point YomiNinja at a game, visual novel, or any application window. Draw an OCR template over the dialogue box to target just the text region — or let Auto OCR scan the full window automatically.

  2. OCR reads the text, overlay appears

    Press the hotkey — or let Auto OCR fire when it detects new on-screen content. YomiNinja processes the capture through your chosen OCR engine and renders the extracted Japanese text as a transparent, interactive overlay on top of your game.

  3. Hover any word — Yomitan pops up

    Move your cursor over any character in the overlay. Yomitan — pre-installed, no browser setup required — triggers instantly with readings, definitions, pitch accent, and frequency data. Your immersion stays unbroken.

Everything in the box

The core app, dictionary lookups, OCR engine choice, and mining workflow all live in one stack.

Works on any game

No text hooking, no engine compatibility lists. If Japanese text is visible on screen, YomiNinja reads it — including games, emulators, visual novels with hardcoded or image-rendered text, manga readers, and video players.

Compatibility →

Yomitan & 10ten built in

Both dictionary extensions ship pre-installed. Import JMdict once, then start hovering kanji immediately. No browser, no Chrome Web Store, no separate extension install.

Dictionary setup →

Five OCR engines

PaddleOCR for most games, MangaOCR for stylized fonts, Google Cloud Vision and Google Lens for hard cases, and Apple Vision Framework on macOS for strong native OCR including vertical text.

Compare engines →

Auto OCR — hands-free

Continuous text detection that fires automatically when screen content changes. No hotkey required mid-battle. Define OCR templates over dialogue boxes to reduce OCR noise and improve speed.

All features →

VOICEVOX TTS

Hear extracted text read aloud using VOICEVOX, a high-quality Japanese speech engine. Built in as of v0.9.1 across all platforms. Listen while you read — reinforce pronunciation naturally.

Audio features →

Anki mining pipeline

WebSocket output streams recognized text to any texthooker page in real time. Pipe sentences into your Anki sentence mining workflow without leaving the game. Direct Anki integration is still planned, so the current workflow uses WebSocket plus Yomitan.

Mining guide →

Three kinds of immersion learners

The game explorer

Playing untranslated JRPGs, action games, and everything that never got a localization. YomiNinja turns Japanese games from inaccessible to learnable — one dialogue box at a time.

  • PaddleOCR or Google Lens
  • Yomitan + JLPT frequency list
  • Auto OCR for continuous play

The VN & manga reader

Working through visual novels and manga where text hooks don't reach. Hardcoded subtitles, image-rendered text, web-based readers — OCR captures what Textractor cannot.

  • MangaOCR for stylized fonts
  • OCR templates for text regions
  • 10ten Reader for quick lookups

The vocabulary miner

Building Anki decks from native content, one sentence at a time. WebSocket output feeds directly into texthooker pages. Every sentence you read becomes a potential flashcard.

  • WebSocket → Texthooker page
  • Yomitan → Anki export
  • VOICEVOX for audio cards

Windows, Linux & macOS

Windows

Windows 10 & 11. Full feature set including all OCR engines, VOICEVOX TTS, and GPU acceleration.

  • VCRedist required
  • Media Feature Pack for N/KN editions
  • CUDA support for GPU-accelerated OCR
Download .exe

Linux

AppImage, .deb, .rpm, and .pacman packages. X11 required — Wayland experimental support added in v0.9.1.

  • xdotool required
  • X11 (Wayland: experimental)
  • ROCm support for AMD GPU
Download AppImage

macOS

Intel x64 and Apple Silicon (M1/M2/M3). Apple Vision Framework for native OCR including vertical Japanese text.

  • ARM64: run xattr -cr YomiNinja.app
  • Reset Accessibility & Screen Recording on update
  • Apple Vision varies by macOS version
Download .dmg

Why learners use it instead of piecing together other tools

For games and emulators: OCR works where text hooks fail, especially on JRPGs, console ports, emulator windows, and titles with no usable hook.

Based on the public repo, issue tracker, and Japanese-learning community discussions

For actual study: the main workflow is OCR overlay plus Yomitan popup, with optional WebSocket output for sentence mining and VOICEVOX for audio reinforcement.

Aligned with the public feature set and setup discussions around YomiNinja

Download YomiNinja Free

Open source. No account. No subscription. No telemetry.
Windows, Linux, macOS — with clear setup paths for Yomitan, OCR engine choice, and first lookup.

v0.9.3 · GPL-3.0 · GitHub