ATLAS：高难度跨学科基准，挑战前沿科学推理

Research #Benchmark 🔬 Research|分析: 2026年1月10日 14:38•

发布: 2025年11月18日 11:13

•

1分で読める

分析

ATLAS的发布代表了在复杂、跨学科科学领域评估人工智能能力的重要一步。该基准测试侧重于高难度推理，推动了当前人工智能模型的边界。

引用 / 来源

"ATLAS is a high-difficulty, multidisciplinary benchmark for frontier scientific reasoning."

ArXiv2025年11月18日 11:13

* 根据版权法第32条进行合法引用。

O3SLM: A New Open-Source Sketch-Language Model Opens Doors for Innovation

SciRAG: Advancing Scientific Literature Retrieval and Synthesis with AI