微调CodeLlama-34B在HumanEval上超越GPT-4

Research #AI Code Generation 👥 Community|分析: 2026年1月3日 06:20•

发布: 2023年8月25日 22:08

•

1分で読める

分析

这篇文章报告了在专有数据集上微调CodeLlama-34B和CodeLlama-34B-Python，在HumanEval上获得了比GPT-4更高的pass@1分数。作者强调了在其数据集中使用指令-答案对、原生微调以及应用OpenAI的去污方法以确保结果有效性。训练过程涉及DeepSpeed ZeRO 3、Flash Attention 2和32个A100-80GB GPU，在三个小时内完成。这篇文章突出了代码生成能力方面的一项重大成就。

关键要点

引用 / 来源

查看原文

"We have fine-tuned CodeLlama-34B and CodeLlama-34B-Python on an internal Phind dataset that achieved 67.6% and 69.5% pass@1 on HumanEval, respectively. GPT-4 achieved 67%."

Hacker News2023年8月25日 22:08

* 根据版权法第32条进行合法引用。

较旧

Vibe Coding as Interface Flattening

较新

AI is Taking Over Your Video Recommendation Feed

微调CodeLlama-34B在HumanEval上超越GPT-4

分析

关键要点

相关分析

人类AI检测

侧重于实现的深度学习书籍

个性化 Gemini

📬 Get AI News Delivered

按类别浏览

热门话题

📬 Get AI News Delivered

按类别浏览

热门话题