Steering LLM Reasoning for Efficiency and Accuracy
Analysis
Key Takeaways
- •Proposes CREST, a training-free method for steering LLM reasoning at test time.
- •Identifies and intervenes on specific attention heads associated with cognitive behaviors like verification and backtracking.
- •Improves accuracy by up to 17.5% and reduces token usage by 37.6%.
- •Offers a pathway to faster and more reliable LLM reasoning without retraining.
“CREST improves accuracy by up to 17.5% while reducing token usage by 37.6%, offering a simple and effective pathway to faster, more reliable LLM reasoning.”