Clean PyTorch implementations of imitation and reward learning algorithms
-
Updated
Jan 7, 2025 - Python
Clean PyTorch implementations of imitation and reward learning algorithms
Subtask-Aware Visual Reward Learning from Segmented Demonstrations (ICLR 2025 accepted)
Experiments in applying interpretability techniques to learned reward functions.
A repo for Implemented online preference-based reward learning under human irrationality & delayed feedback
An interactive system that uses large language models to generate clarification questions for ambiguous human feedback, improving reward learning accuracy.
Version of the PST for DIVA, implemented in E-Prime.
Add a description, image, and links to the reward-learning topic page so that developers can more easily learn about it.
To associate your repository with the reward-learning topic, visit your repo's landing page and select "manage topics."