话题

code model

Aggregated news, research, and updates specifically regarding code model. Auto-curated by our AI Engine.

揭露恶意 AI 代码：基于执行轨迹的可证明方法

ArXiv•2025年12月15日 19:05•Safety▸

Safety #Code AI 🔬 Research|分析: 2026年1月10日 11:00•

发布: 2025年12月15日 19:05

•

1分で読める

•ArXiv

这份来自 ArXiv 的研究提出了一种通过分析代码世界模型的执行轨迹来检测恶意行为的方法。重点关注可证明的揭露，是对 AI 安全性的重大贡献。

要点与引用▶

引用 / 来源

"The research focuses on provably unmasking malicious behavior."

* 根据版权法第32条进行合法引用。

Loading topic feed...