Fix for Nvidia Nemotron Nano 3's forced thinking – now it can be toggled on and off!
Published:Dec 28, 2025 15:51
•1 min read
•r/LocalLLaMA
Analysis
The article discusses a bug fix for Nvidia's Nemotron Nano 3 LLM, specifically addressing the issue of forced thinking. The original instruction to disable detailed thinking was not working due to a bug in the Lmstudio Jinja template. The workaround involves a modified template that enables thinking by default but allows users to toggle it off using the '/nothink' command in the system prompt, similar to Qwen. This fix provides users with greater control over the model's behavior and addresses a usability issue. The post includes a link to a Pastebin with the bug fix.
Key Takeaways
Reference
“The instruction 'detailed thinking off' doesn't work...this template has a bugfix which makes thinking on by default, but it can be toggled off by typing /nothink at the system prompt (like you do with Qwen).”