Chroma 1.0: Revolutionizing Spoken Dialogue with Real-Time Personalization!
Analysis
Key Takeaways
- •Chroma 1.0 is a real-time, open-source spoken dialogue model with personalized voice cloning.
- •It achieves sub-second latency and maintains high-quality voice synthesis.
- •The model shows a 10.96% relative improvement in speaker similarity compared to the human baseline!
“Chroma achieves sub-second end-to-end latency through an interleaved text-audio token schedule (1:2) that supports streaming generation, while maintaining high-quality personalized voice synthesis across multi-turn conversations.”