CIFE：代码指令跟随评估的新基准

Research #LLM 🔬 Research|分析: 2026年1月10日 09:40•

发布: 2025年12月19日 09:43

•

1分で読める

分析

本文介绍了CIFE，这是一个新的基准，旨在评估语言模型遵循代码指令的程度。这项工作解决了对LLM在代码相关任务中进行更稳健评估的关键需求。

引用 / 来源

"CIFE is a benchmark for evaluating code instruction-following."

ArXiv2025年12月19日 09:43

* 根据版权法第32条进行合法引用。

Can Vision-Language Models Understand Cross-Cultural Perspectives?

Real-time Information Updates for Mobile Devices: A Comparative Study