Unsupervised Discovery of Reasoning Behaviors in LLMs

Paper #llm 🔬 Research|Analyzed: Jan 3, 2026 18:22•

Published: Dec 30, 2025 05:09

•

1 min read

Analysis

This paper introduces an unsupervised method (RISE) to analyze and control reasoning behaviors in large language models (LLMs). It moves beyond human-defined concepts by using sparse auto-encoders to discover interpretable reasoning vectors within the activation space. The ability to identify and manipulate these vectors allows for controlling specific reasoning behaviors, such as reflection and confidence, without retraining the model. This is significant because it provides a new approach to understanding and influencing the internal reasoning processes of LLMs, potentially leading to more controllable and reliable AI systems.

Key Takeaways

•Proposes an unsupervised framework (RISE) for discovering reasoning vectors in LLMs.
•RISE uses sparse auto-encoders to identify interpretable reasoning behaviors.
•Enables control over specific reasoning behaviors (e.g., reflection, confidence) without retraining.
•Discovers novel reasoning behaviors beyond human supervision.

Reference / Citation

View Original

"Targeted interventions on SAE-derived vectors can controllably amplify or suppress specific reasoning behaviors, altering inference trajectories without retraining."

ArXivDec 30, 2025 05:09

* Cited for critical analysis under Article 32.

Older

Drone Uses AI and 11,500 Crashes to Learn How to Fly

Newer

Do AI detectors work? Students face false cheating accusations

Related Analysis

Paper

Unsupervised Discovery of Reasoning Behaviors in LLMs

Analysis

Key Takeaways

Related Analysis

Instant 3D Scene Editing from Unposed Images

Coordinated Humanoid Manipulation with Choice Policies

LLM Forecasting for Future Prediction

📬 Get AI News Delivered

Browse by Category

Trending Topics

📬 Get AI News Delivered

Browse by Category

Trending Topics