Qwen3.5 LLM Performance Soars on Strix Halo: Exciting Unsloth Quantization Insights!

research #llm 📝 Blog|Analyzed: Mar 9, 2026 21:16•

Published: Mar 9, 2026 20:18

•

1 min read

Analysis

This research explores the performance of the Qwen3.5-35B and 122B models, using new 'dynamic' quantization methods on a Strix Halo system. The tests compare the Unsloth UDXL quants against Bartowski's implementation, revealing performance gains and intriguing behaviors during the creation of a complex HTML file with a 3D animated solar system.

Key Takeaways

•Tests evaluate the performance of Qwen3.5 LLMs (35B and 122B parameters) using new quantization methods.
•Unsloth's 'dynamic' quantization is compared to Bartowski's method to identify performance improvements and stability.
•The evaluation includes generating a complex HTML file, highlighting performance differences in practical applications.

Reference / Citation

View Original

"Besides of the numbers in performance, i noticed while testing somethnig odd with "dynamic" quants, i tested already two of them on strix halo, 122B-A10B-UD-Q5_K_XL and 35B-A3B-UD-Q6_K_XL and they behave weird."

r/LocalLLaMAMar 9, 2026 20:18

* Cited for critical analysis under Article 32.

Older

AI Models' Collaboration with Academic Misconduct: A New Frontier

Newer

AI Leaders Rally Behind Anthropic in Defense Department Dispute