Search:
Match:
3 results
Paper#llm🔬 ResearchAnalyzed: Jan 3, 2026 19:20

Improving LLM Pruning Generalization with Function-Aware Grouping

Published:Dec 28, 2025 17:26
1 min read
ArXiv

Analysis

This paper addresses the challenge of limited generalization in post-training structured pruning of Large Language Models (LLMs). It proposes a novel framework, Function-Aware Neuron Grouping (FANG), to mitigate calibration bias and improve downstream task accuracy. The core idea is to group neurons based on their functional roles and prune them independently, giving higher weight to tokens correlated with the group's function. The adaptive sparsity allocation based on functional complexity is also a key contribution. The results demonstrate improved performance compared to existing methods, making this a valuable contribution to the field of LLM compression.
Reference

FANG outperforms FLAP and OBC by 1.5%--8.5% in average accuracy under 30% and 40% sparsity.

Analysis

This article announces the release of a new AI inference server, the "Super A800I V7," by Softone Huaray, a company formed from Softone Dynamics' acquisition of Tsinghua Tongfang Computer's business. The server is built on Huawei's Ascend full-stack AI hardware and software, and is deeply optimized, offering a mature toolchain and standardized deployment solutions. The key highlight is the server's reliance on Huawei's Kirin CPU and Ascend AI inference cards, emphasizing Huawei's push for self-reliance in AI technology. This development signifies China's continued efforts to build its own independent AI ecosystem, reducing reliance on foreign technology. The article lacks specific performance benchmarks or detailed technical specifications, making it difficult to assess the server's competitiveness against existing solutions.
Reference

"The server is based on Ascend full-stack AI hardware and software, and is deeply optimized, offering a mature toolchain and standardized deployment solutions."

Analysis

This article from 36Kr provides a concise overview of recent developments in the Chinese tech and business landscape. It covers a range of topics, including corporate compensation strategies (JD.com's bonus plan), advancements in AI applications (Meituan's "Rest Assured Beauty" and Qianwen App's user growth), industrial standardization (Tenfang Ronghai Pear Education's inclusion in the MIIT AI Standards Committee), supply chain infrastructure (SHEIN's industrial park), automotive technology (BYD's collaboration with Volcano Engine), and strategic partnerships in the battery industry (Zhongwei and Sunwoda). The article also touches upon investment activities with the mention of "Fen Yin Ta Technology" securing A round funding. The breadth of coverage makes it a useful snapshot of the current trends and key players in the Chinese tech sector.
Reference

According to Xsignal data, Qianwen App's monthly active users (MAU) exceeded 40 million in just 30 days of public testing.