QianfanHuijin: Multi-Stage Training for Financial LLMs

Published:Dec 30, 2025 16:10
1 min read
ArXiv

Analysis

This paper introduces QianfanHuijin, a financial domain LLM, and a novel multi-stage training paradigm. It addresses the need for LLMs with both domain knowledge and advanced reasoning/agentic capabilities, moving beyond simple knowledge enhancement. The multi-stage approach, including Continual Pre-training, Financial SFT, Reasoning RL, and Agentic RL, is a significant contribution. The paper's focus on real-world business scenarios and the validation through benchmarks and ablation studies suggest a practical and impactful approach to industrial LLM development.

Reference

The paper highlights that the targeted Reasoning RL and Agentic RL stages yield significant gains in their respective capabilities.