Qwen3.5 LLM Performance Soars on Strix Halo: Exciting Unsloth Quantization Insights!
Analysis
This research explores the performance of the Qwen3.5-35B and 122B models, using new 'dynamic' quantization methods on a Strix Halo system. The tests compare the Unsloth UDXL quants against Bartowski's implementation, revealing performance gains and intriguing behaviors during the creation of a complex HTML file with a 3D animated solar system.
Key Takeaways
- •Tests evaluate the performance of Qwen3.5 LLMs (35B and 122B parameters) using new quantization methods.
- •Unsloth's 'dynamic' quantization is compared to Bartowski's method to identify performance improvements and stability.
- •The evaluation includes generating a complex HTML file, highlighting performance differences in practical applications.
Reference / Citation
View Original"Besides of the numbers in performance, i noticed while testing somethnig odd with "dynamic" quants, i tested already two of them on strix halo, 122B-A10B-UD-Q5_K_XL and 35B-A3B-UD-Q6_K_XL and they behave weird."