The Wayback Machine - https://web.archive.org/web/20220323190901/https://github.com/topics/reinforcement-learning
Skip to content
#

reinforcement-learning

Here are 7,842 public repositories matching this topic...

DmitriGekhtman
DmitriGekhtman commented Mar 14, 2022

Search before asking

  • I searched the issues and found no similar issues.

Ray Component

Ray Clusters

Issue Severity

Medium: It contributes to significant difficulty to complete my task but I work arounds and get it resolved.

What happened + What you expected to happen

The arguments to ray.autoscaler.sdk are not validated.
I

bug good first issue P1
annotated_deep_learning_paper_implementations

🧑‍🏫 50! Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
  • Updated Mar 21, 2022
  • Jupyter Notebook
stable-baselines
calerc
calerc commented Nov 23, 2020

The following applies to DDPG and TD3, and possibly other models. The following libraries were installed in a virtual environment:

numpy==1.16.4
stable-baselines==2.10.0
gym==0.14.0
tensorflow==1.14.0

Episode rewards do not seem to be updated in model.learn() before callback.on_step(). Depending on which callback.locals variable is used, this means that:

  • episode rewards may n
good first issue question

Improve this page

Add a description, image, and links to the reinforcement-learning topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the reinforcement-learning topic, visit your repo's landing page and select "manage topics."

Learn more