PhyVLLM：基于物理学的视频语言模型，提升视频理解能力

Research #Video LLM 🔬 Research|分析: 2026年1月10日 13:14•

发布: 2025年12月4日 07:28

•

1分で読める

分析

这项研究介绍了PhyVLLM，这是一种通过结合物理学原理来提升视频理解的新方法，为动态场景提供了更强大和准确的表示。运动-外观解耦是一项关键创新，有助于构建更具泛化性的模型。

引用 / 来源

"PhyVLLM leverages motion-appearance disentanglement."

ArXiv2025年12月4日 07:28

* 根据版权法第32条进行合法引用。

Refaçade: AI-Powered Object Editing with Reference Textures

Advancements in Visible-Infrared Person Re-Identification through Identity Clue Refinement