QianfanHuijin: Multi-Stage Training for Financial LLMs
Published:Dec 30, 2025 16:10
•1 min read
•ArXiv
Analysis
This paper introduces QianfanHuijin, a financial domain LLM, and a novel multi-stage training paradigm. It addresses the need for LLMs with both domain knowledge and advanced reasoning/agentic capabilities, moving beyond simple knowledge enhancement. The multi-stage approach, including Continual Pre-training, Financial SFT, Reasoning RL, and Agentic RL, is a significant contribution. The paper's focus on real-world business scenarios and the validation through benchmarks and ablation studies suggest a practical and impactful approach to industrial LLM development.
Key Takeaways
- •Introduces QianfanHuijin, a financial domain LLM.
- •Proposes a multi-stage training paradigm for industrial LLM enhancement.
- •Employs Continual Pre-training, Financial SFT, Reasoning RL, and Agentic RL.
- •Demonstrates superior performance on financial benchmarks.
- •Ablation studies validate the effectiveness of Reasoning and Agentic RL stages.
Reference
“The paper highlights that the targeted Reasoning RL and Agentic RL stages yield significant gains in their respective capabilities.”