文本失语症测试 (TAB): 用于评估语言模型中类似失语症缺陷的临床基准

Research #llm 🔬 Research|分析: 2026年1月4日 07:26•

发布: 2025年11月25日 17:16

•

1分で読める

分析

本文介绍了一个新的基准测试，即文本失语症测试 (TAB)，旨在评估语言模型中类似失语症的缺陷。关注临床基础表明，这是一种严格评估模型语言能力和局限性的方法。使用失语症作为人工智能局限性的模型很有趣。

引用 / 来源

"The Text Aphasia Battery (TAB): A Clinically-Grounded Benchmark for Aphasia-Like Deficits in Language Models"

ArXiv2025年11月25日 17:16

* 根据版权法第32条进行合法引用。

Coding the History of Deep Learning

Empirical Decision Theory