Real-Time Interactive Human Avatars with Streaming Diffusion Models

Paper#AI/Computer Vision/Digital Humans🔬 Research|Analyzed: Jan 3, 2026 16:32
Published: Dec 26, 2025 15:41
1 min read
ArXiv

Analysis

This paper addresses the challenge of creating real-time, interactive human avatars, a crucial area in digital human research. It tackles the limitations of existing diffusion-based methods, which are computationally expensive and unsuitable for streaming, and the restricted scope of current interactive approaches. The proposed two-stage framework, incorporating autoregressive adaptation and acceleration, along with novel components like Reference Sink and Consistency-Aware Discriminator, aims to generate high-fidelity avatars with natural gestures and behaviors in real-time. The paper's significance lies in its potential to enable more engaging and realistic digital human interactions.
Reference / Citation
View Original
"The paper proposes a two-stage autoregressive adaptation and acceleration framework to adapt a high-fidelity human video diffusion model for real-time, interactive streaming."
A
ArXivDec 26, 2025 15:41
* Cited for critical analysis under Article 32.