Together AI Delivers Top Speeds for DeepSeek-R1-0528 Inference on NVIDIA Blackwell
Analysis
The article highlights Together AI's achievement in optimizing inference speed for the DeepSeek-R1 model on NVIDIA's Blackwell platform. It emphasizes the platform's speed and capability for running open-source reasoning models at scale. The focus is on performance and the use of specific hardware (NVIDIA HGX B200).
Key Takeaways
- •Together AI has optimized inference for DeepSeek-R1.
- •The optimization is for NVIDIA Blackwell (HGX B200).
- •The platform is positioned as fast and capable for open-source reasoning models.
Reference
“Together AI inference is now among the world’s fastest, most capable platforms for running open-source reasoning models like DeepSeek-R1 at scale, thanks to our new inference engine designed for NVIDIA HGX B200.”