基于预测3D高斯分布随时间变化的视频跟踪

Paper #Computer Vision 🔬 Research|分析: 2026年1月3日 16:27•

发布: 2025年12月27日 06:16

•

1分で読める

分析

本文提出了一种新颖的自监督方法Video-GMAE，用于视频表示学习。其核心思想是将视频表示为一组随时间移动的3D高斯散点。这种归纳偏置使得模型能够学习有意义的表示，并实现令人印象深刻的零样本跟踪性能。在Kinetics和Kubric数据集上的显著性能提升突出了所提出方法的有效性。

引用 / 来源

"Mapping the trajectory of the learnt Gaussians onto the image plane gives zero-shot tracking performance comparable to state-of-the-art."

ArXiv2025年12月27日 06:16

* 根据版权法第32条进行合法引用。

ManchuTTS: Towards High-Quality Manchu Speech Synthesis via Flow Matching and Hierarchical Text Representation

Show HN: Hyperbrowser MCP Server – Connect AI agents to the web through browsers