Search:
Match:
1 results

Frontend Tools for Viewing Top Token Probabilities

Published:Jan 3, 2026 00:11
1 min read
r/LocalLLaMA

Analysis

The article discusses the need for frontends that display top token probabilities, specifically for correcting OCR errors in Japanese artwork using a Qwen3 vl 8b model. The user is looking for alternatives to mikupad and sillytavern, and also explores the possibility of extensions for popular frontends like OpenWebUI. The core issue is the need to access and potentially correct the model's top token predictions to improve accuracy.
Reference

I'm using Qwen3 vl 8b with llama.cpp to OCR text from japanese artwork, it's the most accurate model for this that i've tried, but it still sometimes gets a character wrong or omits it entirely. I'm sure the correct prediction is somewhere in the top tokens, so if i had access to them i could easily correct my outputs.