rllib
Here are 36 public repositories matching this topic...
-
Updated
Jul 24, 2021 - Python
-
Updated
Oct 23, 2020 - Python
-
Updated
May 21, 2021 - Jupyter Notebook
-
Updated
Jun 8, 2021 - Jupyter Notebook
-
Updated
Jul 5, 2021 - Jupyter Notebook
Currently we use a very ad-hoc procedure for scaling the quadratic component of NAF when used for exploration:
https://github.com/angelolovatto/raylab/blob/9820275b17ee085e1955a6d845c0bdf61333f8da/raylab/algorithms/naf/naf_policy.py#L150-L155
A possibly better alternative would be to scale it based on the desired average action stddev. Something like:
scale_tril * (1.0 / average_st
-
Updated
May 29, 2021 - Jupyter Notebook
-
Updated
Nov 17, 2020 - Python
-
Updated
Jul 7, 2021 - Python
-
Updated
Jul 28, 2021 - Python
-
Updated
May 22, 2021 - Python
-
Updated
Dec 8, 2020 - Python
-
Updated
Jun 8, 2021 - Python
-
Updated
Sep 6, 2019 - Python
-
Updated
Jun 12, 2019 - Python
-
Updated
Aug 25, 2020 - Jupyter Notebook
-
Updated
Dec 15, 2020 - Jupyter Notebook
-
Updated
Nov 27, 2020 - Jupyter Notebook
-
Updated
Aug 11, 2020 - Python
-
Updated
Mar 25, 2021 - Jupyter Notebook
-
Updated
Sep 30, 2019 - Python
-
Updated
May 26, 2019 - Python
-
Updated
Jul 12, 2021 - Python
-
Updated
May 24, 2021 - Python
Implement parameterized linear and non-linear dynamics model with bimodal Gaussian noise.
Improve this page
Add a description, image, and links to the rllib topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the rllib topic, visit your repo's landing page and select "manage topics."
Apache Arrow has a first-class tabular file format, Feather, that the Ray Datasets IO layer should support. Combined with Ray Datasets' existing
.from_arrow()
and.to_arrow()
APIs, this would round out our "all-Arrow" experience, which should be as nice as possible given our "distributed Arrow dataset" positioning.Implementation Note