rraileanu/policy-dynamics-value-functions

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/rraileanu/policy-dynamics-value-functions)

rraileanu / policy-dynamics-value-functions

☆33

Alternatives and similar repositories for policy-dynamics-value-functions

Users that are interested in policy-dynamics-value-functions are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

mansicer / Q-Adapter
View on GitHub
Implementation of ICLR 2025 paper "Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation"
☆18Oct 5, 2024Updated last year
typoverflow / WiseRL
View on GitHub
PyTorch implementations for Offline Preference-Based RL (PbRL) algorithms
☆21Mar 24, 2025Updated last year
Ji4chenLi / Multi-Task-Batch-RL
View on GitHub
☆26Mar 16, 2023Updated 3 years ago
dennisl88 / rand_param_envs
View on GitHub
Random parameter environments using gym 0.7.4 and mujoco-py 0.5.7
☆20Feb 14, 2019Updated 7 years ago
typoverflow / UtilsRL
View on GitHub
A python module designed for agile RL algorithm developing.
☆26Jul 11, 2024Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
danielshin1 / oprl
View on GitHub
Official Codebase for TMLR 2023, Benchmarks and Algorithms for Offline Preference-Based Reward Learning
☆20Dec 30, 2022Updated 3 years ago
kaixin96 / mixreg
View on GitHub
Code for our NeurIPS 2020 paper Improving Generalization in Reinforcement Learning with Mixture Regularization
☆34Oct 22, 2020Updated 5 years ago
011235813 / SEPT
View on GitHub
Single Episode Policy Transfer in Reinforcement Learning
☆17Jun 13, 2022Updated 4 years ago
anishmadan23 / MAML_Pytorch_RL
View on GitHub
☆10Aug 8, 2021Updated 4 years ago
ttumiel / minRLHF
View on GitHub
Minimal RLHF implementation built on top of minGPT.
☆32Jul 4, 2024Updated 2 years ago
0xWelt / VibeRL
View on GitHub
VibeRL is a Reinforcement Learning framework built essentially through vibe coding with Kimi K2.
☆17Updated this week
DrZero0 / MACC
View on GitHub
The implementation of IJCAI'22 paper "Multi-Agent Concentrative Coordination with Decentralized Task Representation".
☆18May 1, 2022Updated 4 years ago
mit-ll / hanabi_AnyPlay
View on GitHub
☆15Jun 28, 2022Updated 4 years ago
jsikyoon / bmaml_rl
View on GitHub
This repository contains implementations of the paper, Bayesian Model-Agnostic Meta-Learning.
☆20Jan 19, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
tianxusky / Code-for-Error-Bounds-of-Imitating-Policies-and-Environments
View on GitHub
☆10Oct 15, 2020Updated 5 years ago
frt03 / generalized_dt
View on GitHub
Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)
☆70Aug 8, 2022Updated 3 years ago
only-changer / GeneraLight
View on GitHub
☆12Aug 15, 2020Updated 5 years ago
polixir / d3pe
View on GitHub
D3PE (Deep Data-Driven Policy Evaluation) aims to evaluation a large set of candidate policies from a fixed dataset to select best ones.
☆10Jun 2, 2022Updated 4 years ago
LanqingLi1993 / FOCAL-ICLR
View on GitHub
Code for FOCAL Paper Published at ICLR 2021
☆55Dec 4, 2023Updated 2 years ago
microsoft / MAMBA
View on GitHub
Imitation learning from multiple experts
☆13Aug 29, 2022Updated 3 years ago
eugenevinitsky / robust_RL_multi_adversary
View on GitHub
We investigate the effect of populations on finding good solutions to the robust MDP
☆29Mar 27, 2021Updated 5 years ago
sujoyp / subgoal-discovery
View on GitHub
Learning from Trajectories via Subgoal Discovery
☆12Dec 10, 2020Updated 5 years ago
lamda-bbo / madac
View on GitHub
Official implementation of NeurIPS22 paper “Multi-agent Dynamic Algorithm Configuration”
☆26Mar 6, 2023Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
typoverflow / .dotfiles
View on GitHub
A repo containing bash scripts to deploy reinforcement learning dev environment within one click!
☆11Jun 28, 2026Updated 3 weeks ago
rythei / DARLA-PyTorch
View on GitHub
PyTorch implementation of DARLA preprocessing models
☆11Jan 30, 2018Updated 8 years ago
uoe-agents / LIAM
View on GitHub
Official Repository for "Agent Modelling under Partial Observability for Deep Reinforcement Learning"
☆43Oct 5, 2022Updated 3 years ago
singhalrk / stein_ksd
View on GitHub
☆10Apr 2, 2018Updated 8 years ago
qiongwu86 / Task-Offloading-in-Vfc-Assisted-Platoons
View on GitHub
☆12Aug 24, 2023Updated 2 years ago
sheydashz / federated-double-deep-Q_network-
View on GitHub
A framework that exploits the potentials of distributed federated learning and double deep Q-networks to minimize joint energy and delay …
☆11Apr 21, 2021Updated 5 years ago
trackoor / AwesomePL
View on GitHub
A Collection of Papers & Notes in Programming Language & Formal Verification
☆17May 10, 2022Updated 4 years ago
russellmendonca / maesn_suite
View on GitHub
☆44Oct 27, 2018Updated 7 years ago
ml-jku / rudder-demonstration-code
View on GitHub
Code for demonstration example-task in RUDDER blog
☆24May 19, 2020Updated 6 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
lengyijun / polonius-proof
View on GitHub
Verify naive = datafrog-opt, in rust/polonius
☆16Jun 26, 2025Updated last year
denisyarats / proto
View on GitHub
Proto-RL: Reinforcement Learning with Prototypical Representations
☆87Jun 12, 2022Updated 4 years ago
facebookresearch / mtrl
View on GitHub
Multi Task RL Baselines
☆269Dec 31, 2021Updated 4 years ago
iclavera / learning_to_adapt
View on GitHub
Learning to Adapt in Dynamic, Real-World Environment through Meta-Reinforcement Learning
☆218Dec 27, 2022Updated 3 years ago
uber-research / D3G
View on GitHub
Estimating Q(s,s') with Deep Deterministic Dynamics Gradients
☆32Feb 21, 2020Updated 6 years ago
lafmdp / HIDIL
View on GitHub
[NeurIPS'20] Code for the paper "Offline Imitation Learning with a Misspecified Simulator"
☆12Nov 24, 2021Updated 4 years ago
regehr / pldi22-llvm-tutorial
View on GitHub
outline and links for PLDI 2022 tutorial
☆17Jun 13, 2022Updated 4 years ago