Qwen3.6 GGUF Performance Benchmarks and Updates
分析
The article provides detailed performance benchmarks for Qwen3.6-35B-A3B in GGUF format, addressing common misunderstandings about frequent updates due to external factors like llama.cpp bug fixes and CUDA issues.
重要ポイント
- •Unsloth quants lead in KLD vs disk space performance for Qwen3.6-35B-A3B GGUF format.
- •Frequent re-uploads are often due to external factors like bug fixes or CUDA issues, not provider mistakes.
- •CUDA 13.2 is confirmed broken causing gibberish in low bit quants on all models; temporary solution is to use CUDA 13.1.
引用・出典
原文を見る"In roughly 95% of cases, the root causes were out of our hands - we just try to be transparent and keep the community informed."