SenseNova-MARS: Open-Source AI Agent Surpasses Gemini-3-Pro in Multimodal Performance!
Analysis
商汤's SenseNova-MARS has emerged as a groundbreaking open-source model, outperforming even leading closed-source models like Gemini-3-Pro in multimodal search and reasoning. This achievement marks a significant leap in AI capabilities, especially in complex tasks requiring visual understanding and tool usage. The open-source nature of SenseNova-MARS promises to accelerate innovation and collaboration within the AI community.
Key Takeaways
- •SenseNova-MARS is an open-source multimodal AI Agent.
- •It surpasses Gemini-3-Pro and GPT-5.2 in several benchmarks.
- •The model excels at complex tasks that require dynamic visual reasoning and tool usage.
Reference / Citation
View Original"SenseNova-MARS is the first Agentic VLM model to support dynamic visual reasoning and deep integration of image and text search."
雷
雷锋网Jan 30, 2026 03:18
* Cited for critical analysis under Article 32.