DataGovBench：用于评估数据治理工作流程中LLM Agents的新基准

Research #LLM Agents 🔬 Research|分析: 2026年1月10日 13:15•

发布: 2025年12月4日 03:25

•

1分で読める

分析

这篇ArXiv文章介绍了DataGovBench，这是一个新颖的基准，旨在评估大型语言模型（LLM）代理在实际数据治理工作流程中的性能。创建这样一个基准对于推动进步和确保LLM在此重要领域的可靠应用至关重要。

引用 / 来源

"DataGovBench is a benchmark for evaluating LLM agents for real-world data governance workflows."

ArXiv2025年12月4日 03:25

* 根据版权法第32条进行合法引用。

AI-Powered Gait Analysis for Parkinson's Disease: Leveraging RGB-D and LLMs

6G Networks Evolve: Semantic-Aware AI at the Edge