Aladoro/Stabilizing-Off-Policy-RL

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Aladoro/Stabilizing-Off-Policy-RL)

Aladoro / Stabilizing-Off-Policy-RL

☆18

Alternatives and similar repositories for Stabilizing-Off-Policy-RL

Users that are interested in Stabilizing-Off-Policy-RL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

UtkarshMishra04 / pixel-representations-RL
View on GitHub
This repository is a collection of widely used self-supervised auxiliary losses used for learning representations in reinforcement learni…
☆14Feb 27, 2023Updated 3 years ago
iwiwi / epochraft-hf-fsdp
View on GitHub
Example of using Epochraft to train HuggingFace transformers models with PyTorch FSDP
☆11Jan 29, 2024Updated 2 years ago
FrankZheng2022 / TACO
View on GitHub
Code for "TACO: Temporal Latent Action-Driven Contrastive Loss for Visual Reinforcement Learning"
☆28May 19, 2024Updated 2 years ago
Aladoro / domain-robust-visual-il
View on GitHub
Domain-Robust Visual Imitation Learning with Mutual Information Constraints code
☆19Mar 1, 2021Updated 5 years ago
tristandeleu / jax-meta-learning
View on GitHub
A collection of meta-learning algorithms in Jax
☆24Sep 3, 2022Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
e2crawfo / silot
View on GitHub
Original tensorflow implementation of SILOT (Spatially Invariant, Label-free Object Tracking).
☆13Mar 24, 2023Updated 3 years ago
happywu / Self-Sup-Attention-RL
View on GitHub
Self-Supervised Attention-Aware Reinforcement Learning
☆18May 20, 2022Updated 4 years ago
openpsi-project / srl
View on GitHub
A Really Scalable RL Framework to 10k+ CPUs
☆38Feb 29, 2024Updated 2 years ago
younggyoseo / rnn-auxiliary-loss
View on GitHub
Learning Longer-term Dependencies in RNNs with Auxiliary Losses - Implementation in PyTorch.
☆17Aug 26, 2018Updated 7 years ago
denisyarats / dmc2gym
View on GitHub
OpenAI Gym wrapper for the DeepMind Control Suite
☆229May 19, 2024Updated 2 years ago
facebookresearch / drqv2
View on GitHub
DrQ-v2: Improved Data-Augmented Reinforcement Learning
☆437May 31, 2022Updated 4 years ago
kid-yang233 / robots
View on GitHub
The homework of robos learning base.
☆11May 23, 2023Updated 3 years ago
JimOhman / model-based-rl
View on GitHub
Implementation of MuZero with PyTorch, based on the pseudocode from DeepMind (https://arxiv.org/src/1911.08265v2/anc/pseudocode.py).
☆33Aug 14, 2022Updated 3 years ago
qiangwang-academic / UR5E_robot_gym_env_Real_and_Sim
View on GitHub
Reinforcement learning environment for UR5e robot with OPENAI gym like format. Include both simulation and real parts.
☆15Nov 2, 2021Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
JindongJiang / GNM
View on GitHub
Official Release of NeurIPS 2020 Spotlight paper "Generative Neurosymbolic Machines"
☆37Mar 9, 2024Updated 2 years ago
joenghl / HYPO
View on GitHub
☆14Dec 29, 2023Updated 2 years ago
ikostrikov / jaxrl2
View on GitHub
☆58Jan 20, 2023Updated 3 years ago
microsoft / ATAC
View on GitHub
Code accompanying the paper Adversarially Trained Actor Critic for Offline Reinforcement Learning by Ching-An Cheng*, Tengyang Xie*, Nan …
☆74Feb 2, 2023Updated 3 years ago
deep-diver / LLM-Serve
View on GitHub
This repository provides a framework to serve LLM(Large Language Model) based applications such as Chatbot.
☆18Apr 20, 2023Updated 3 years ago
lz1oceani / pointcloud_rl
View on GitHub
☆40Jun 17, 2023Updated 3 years ago
skku-taehwan / KoreanRecipeGPT
View on GitHub
ratsnlp, KOGPT2와 recipegpt github를 참고하여 음식명과 식재료명을 입력하면 레시피를 생성해주는 모델을 제작하였습니다!!
☆11Dec 28, 2021Updated 4 years ago
zhaohengyin / EfficientImitate
View on GitHub
Codebase of NeurIPS 2022 paper ''Planning for Sample Efficient Imitation Learning''
☆41Oct 25, 2022Updated 3 years ago
denisyarats / drq
View on GitHub
DrQ: Data regularized Q
☆422Jan 13, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ahmed-touati / controllable_agent
View on GitHub
☆61Jun 6, 2023Updated 3 years ago
AlvinWen428 / keyframe-focused-imitation-learning
View on GitHub
☆11Dec 13, 2021Updated 4 years ago
cqian19 / qmix-plus
View on GitHub
Improving upon state of the art cooperative deep reinforcement learning in StarCraft II
☆13May 16, 2019Updated 7 years ago
google-deepmind / csuite
View on GitHub
☆47Sep 24, 2024Updated last year
dmsm / MarioNette
View on GitHub
Code for MarioNette: Self-Supervised Sprite Learning, in NeurIPS 2021
☆40Oct 20, 2021Updated 4 years ago
alec-tschantz / planet
View on GitHub
PlaNet: Learning Latent Dynamics for Planning from Pixels
☆10Feb 13, 2020Updated 6 years ago
GuanSuns / ASGRL
View on GitHub
Official python implementation of ASGRL in ICML 2022 paper: Leveraging Approximate Symbolic Models for Reinforcement Learning via Skill D…
☆20Oct 5, 2022Updated 3 years ago
alexrame / diwa
View on GitHub
DiWA: Diverse Weight Averaging for Out-of-Distribution Generalization
☆31Jan 31, 2023Updated 3 years ago
WendyShang / flare
View on GitHub
Reinforcement Learning with Latent Flow
☆43Mar 25, 2021Updated 5 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
dennisl88 / rand_param_envs
View on GitHub
Random parameter environments using gym 0.7.4 and mujoco-py 0.5.7
☆20Feb 14, 2019Updated 7 years ago
LLaVA-VL / llava-vl.github.io
View on GitHub
☆13Mar 9, 2024Updated 2 years ago
IDSIA / recurrent-fwp
View on GitHub
Official repository for the paper "Going Beyond Linear Transformers with Recurrent Fast Weight Programmers" (NeurIPS 2021)
☆52Jun 11, 2025Updated last year
JindongJiang / SCALOR
View on GitHub
Official Release of ICLR 2020 paper "SCALOR: Generative World Models with Scalable Object Representations"
☆49Dec 24, 2023Updated 2 years ago
timleslie / rust_vs_cython
View on GitHub
An experiment to compare the performance of Rust and Cython
☆16Aug 7, 2021Updated 4 years ago
joeybose / comp760_lecturenotes
View on GitHub
COMP760 Lecture Notes
☆34Jan 13, 2023Updated 3 years ago
SimondeMoreau / LED
View on GitHub
LED : Light Enhanced Depth Estimation at Night
☆15Mar 24, 2026Updated 4 months ago