rllib

Apache Arrow has a first-class tabular file format, Feather, that the Ray Datasets IO layer should support. Combined with Ray Datasets' existing .from_arrow() and .to_arrow() APIs, this would round out our "all-Arrow" experience, which should be as nice as possible given our "distributed Arrow dataset" positioning.

Implementation Note

Currently we use a very ad-hoc procedure for scaling the quadratic component of NAF when used for exploration:
https://github.com/angelolovatto/raylab/blob/9820275b17ee085e1955a6d845c0bdf61333f8da/raylab/algorithms/naf/naf_policy.py#L150-L155

A possibly better alternative would be to scale it based on the desired average action stddev. Something like:

scale_tril * (1.0 / average_st

Jun	AUG	Sep
	05
2020	2021	2022

rllib

Here are 36 public repositories matching this topic...

ray-project / ray

[Datasets] Add Feather IO layer to Ray Datasets.

Implementation Note

[tune] Clarify documentation around using different resource requirements across trials

[RLlib] Policy weights overwritten in self-play

utiasDSL / gym-pybullet-drones

Draichi / T-1000

druce / rl

ChuaCheowHuan / gym-continuousDoubleAuction

DerwenAI / ray_tutorial

angelolovatto / raylab

Scale tril by desired average action stddev

DerwenAI / rllib_tutorials

JacopoPan / a-minimalist-guide

goshaQ / adaptive-tls

DerwenAI / gym_example

dcos-labs / dcos-jupyterlab-service

CN-UPB / DeepCoMP

AhmetFurkanDEMIR / SuperMarioBrosRL

akirasosa / aie-train

HumanCompatibleAI / better-adversarial-defenses

nicofirst1 / rl_werewolf

Senmumu / ray_project_doc

toanngosy / robustprosthetics

ChuaCheowHuan / PBT_MARL_watered_down

rlew631 / AutonomousVehicleSimulation

wullli / flatlander

xdralex / pioneer

mynkpl1998 / upgraded-octo-lamp

ChuaCheowHuan / sagemaker_Ray_RLlib_custom_env

3neutronstar / flow-autonomous-driving

jthelin / HelloRayActors

hybug / RL_Lab

LeonZamel / coins-and-traitors

thiagopbueno / model-aware-policy-optimization

Add Gym env for Navigation domain with bimodal dynamics distribution

Create Value function class

Improve this page

Add this topic to your repo