tf-agents:tensorflow的强化学习库

tf-agents-nightl的Python项目详细描述


TF-Agents:TensorFlow中的强化学习库

注意:当前TF代理的预发布正在积极开发中,并且 接口可以随时更改请随时提供反馈和意见。

为了开始,我们建议查看我们的colab教程之一。如果你 需要一个介绍rl(或快速回顾)。 start here。否则,请查看我们的 DQN tutorial让一个特工起来 在车柱环境中跑步。

目录

Agents
Tutorials
Examples
Installation
Contributing
Principles
Citation
Disclaimer

Agents

In TF-Agents, the core elements of RL algorithms are implemented as ^{}. An agent encompasses two main responsibilities: defining a Policy to interact with the Environment, and how to learn/train that Policy from collected experience.

Currently the following algorithms are available under TF-Agents:

Tutorials

See ^{}专业教程 提供组件

Examples

End-to-end examples training agents can be found under each agent directory. e.g.:

Installation

To install the latest version, use nightly builds of TF-Agents under the pip package ^{}, which requires you install on one of ^{} and ^{} and also ^{}. Nightly builds include newer features, but may be less stable than the versioned releases.

To install the nightly build version, run the following:

^{pr 1}$

If you clone the repository you will still need a ^{} installation. You can then run ^{} from the agents directory to get dependencies to run tests.

Contributing

We're eager to collaborate with you! See ^{} 关于如何贡献的指南这个项目遵循TensorFlow的 code of conduct。通过参与,你应该 坚持这个准则

Principles

This project adheres to Google's AI principles。 通过参与、使用或参与本项目 坚持这些原则。

引文

如果您使用此代码,请将其引用为:

@misc{TFAgents,
  title = {{TF-Agents}: A library for Reinforcement Learning in TensorFlow},
  author = "{Sergio Guadarrama, Anoop Korattikara, Oscar Ramirez,
    Pablo Castro, Ethan Holly, Sam Fishman, Ke Wang, Ekaterina Gonina, Neal Wu,
    Chris Harris, Vincent Vanhoucke, Eugene Brevdo}",
  howpublished = {\url{https://github.com/tensorflow/agents}},
  url = "https://github.com/tensorflow/agents",
  year = 2018,
  note = "[Online; accessed 25-June-2019]"
}

免责声明

这不是谷歌的官方产品。

欢迎加入QQ群-->: 979659372 Python中文网_新手群

推荐PyPI第三方库


热门话题
如何使用Netbeans在Java中创建可执行文件   swing Set不透明(真/假);JAVA   java JPA无法删除具有onotomany关系和cascatype all的实体   java Hibernate自参考映射难题   Java/从字符串到Java代码   java为什么我已经在取整了,却得到了一个无终止小数扩展?   java Wicket onsubmit信息对话框   java如何使用类、类加载器和URL获取资源   java在多模块项目中使用License Maven插件时出错   java HashSet“contains”方法是如何工作的?   图像JavaPNG显示为黑色   java如何在保持内容完整的同时在两个活动之间切换?   JAVA在Tomcat 7.0.27中,WAR无法为Prod正确解包   listview的java onclicklistener   JavaXPages托管Bean可以跨多个数据库工作吗?   java无法停止服务空对象引用错误Android   java将UTF8编码的字符串转换为人类可读的字符串   JavaSpring与JBossSAML的集成   java如何替换AuthorizationCodeInstalledApp类?