近端策略优化

发布:2017年7月20日 07:00

•

1分で読める

分析

这篇文章宣布了 OpenAI 发布一种新的强化学习算法，近端策略优化 (PPO)。关键卖点是其与现有方法相当或更优的性能、易于实现以及易于调整。文章强调 PPO 现在是 OpenAI 的默认强化学习算法。

引用

“PPO 已经成为 OpenAI 的默认强化学习算法，因为它易于使用且性能良好。”

MedKGI: Iterative Differential Diagnosis with Medical Knowledge Graphs and Information-Guided Inquiring

CS 522: Machine Learning Approaches to Decode the Human Genome