sukhijab/maxinforl_torch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/sukhijab/maxinforl_torch)

sukhijab / maxinforl_torch

☆51

Alternatives and similar repositories for maxinforl_torch

Users that are interested in maxinforl_torch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

sukhijab / maxinforl_jax
View on GitHub
☆29Jan 8, 2026Updated 6 months ago
jzndd / CP3ER
View on GitHub
The official PyTorch implementation of the paper "Generalizing Consistency Policy to Visual RL with Prioritized Proximal Experience Regul…
☆15Nov 10, 2024Updated last year
Dribble-HRL / Dribble_HRL
View on GitHub
☆12Apr 9, 2026Updated 3 months ago
amazon-far / residual-offpolicy-rl
View on GitHub
☆137Dec 2, 2025Updated 7 months ago
yunglau / QGFN
View on GitHub
QGFN: Controllable Greediness with Action Values - Code
☆11May 17, 2024Updated 2 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
cvoelcker / reppo
View on GitHub
Official Code for "Relative Entropy Pathwise Policy Optimization"
☆57May 6, 2026Updated 2 months ago
naumix / BiggerRegularizedOptimistic
View on GitHub
Official implementation of the BRO algorithm
☆61Jan 29, 2025Updated last year
Viraj-Joshi / MTBench
View on GitHub
☆44Jul 1, 2026Updated last week
seohongpark / horizon-reduction
View on GitHub
The official implementation of "Horizon Reduction Makes RL Scalable"
☆198Aug 2, 2025Updated 11 months ago
RLG-Leiden / edugym
View on GitHub
☆15Sep 22, 2023Updated 2 years ago
DAVIAN-Robotics / SimbaV2
View on GitHub
Code for "SimbaV2: Hyperspherical Normalization for Scalable Deep Reinforcement Learning"
☆106Nov 4, 2025Updated 8 months ago
vhartman / multirobot-pathplanning-benchmark
View on GitHub
☆37Jun 24, 2026Updated 2 weeks ago
gauthamvasan / avg
View on GitHub
Action Value Gradient Algorithm
☆28May 18, 2025Updated last year
conglu1997 / v-d4rl
View on GitHub
Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations
☆115Apr 16, 2026Updated 2 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
imgeorgiev / PWM
View on GitHub
PWM: Policy Learning with Large World Models
☆70Aug 4, 2025Updated 11 months ago
facebookresearch / MRQ
View on GitHub
MR.Q is a general-purpose model-free reinforcement learning algorithm.
☆152Apr 7, 2026Updated 3 months ago
Improbable-AI / pql
View on GitHub
Parallel Q-Learning: Scaling Off-policy Reinforcement Learning under Massively Parallel Simulation
☆77Aug 2, 2023Updated 2 years ago
gchada / ROAM
View on GitHub
☆18Oct 1, 2025Updated 9 months ago
hsjang0 / LED-GFN
View on GitHub
Learning energy decompositions for partial inference in GFlowNets
☆16Jun 4, 2024Updated 2 years ago
facebookresearch / modemv2
View on GitHub
MoDem-V2 combines the sample efficiency of the original MoDem with conservative exploration in order to quickly and safely learn manipula…
☆25Apr 1, 2024Updated 2 years ago
carlosferrazza / BodyTransformer
View on GitHub
Body Transformer: Leveraging Robot Embodiment for Policy Learning
☆197Sep 18, 2025Updated 9 months ago
SonyResearch / simba
View on GitHub
☆128Feb 25, 2025Updated last year
RLE-Foundation / Plasticine
View on GitHub
Accelerating Research in Plasticity-Motivated Deep Reinforcement Learning.
☆43Feb 9, 2026Updated 5 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
qfettes / CuriosityDrivenExplorationBySelfSupervisedPrediction
View on GitHub
Reproduction of Curiosity-driven Exploration by Self-supervised Prediction in PyTorch
☆13Jun 10, 2019Updated 7 years ago
sjchoi86 / simple-mujoco-usage-v2
View on GitHub
☆13Sep 12, 2022Updated 3 years ago
seohongpark / fql
View on GitHub
The official implementation of flow Q-learning (FQL)
☆319Jul 21, 2025Updated 11 months ago
kngwyu / mujoco-maze
View on GitHub
Simple maze environments using mujoco-py
☆61Dec 27, 2023Updated 2 years ago
KarlXing / RL-Visual-Continuous-Control
View on GitHub
RL Algorithms for Visual Continuous Control
☆36May 31, 2023Updated 3 years ago
ankile / robust-rearrangement
View on GitHub
From Imitation to Refinement -- Residual RL for Precise Assembly
☆244Dec 2, 2025Updated 7 months ago
elicassion / active-gym
View on GitHub
Environments for Active Vision Reinforcement Learning
☆30Oct 10, 2024Updated last year
tongzhoumu / DrS
View on GitHub
Code for "DrS: Learning Reusable Dense Rewards for Multi-Stage Tasks"
☆22Apr 26, 2024Updated 2 years ago
Jaewoopudding / GTA
View on GitHub
Official codebase for GTA: Generative Trajectory Augmentation with Guidance for Offline Reinforcement Learning.
☆32Nov 12, 2024Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
learnsyslab / lsy_drone_racing
View on GitHub
LSY Autonomous Drone Racing Challenge
☆79Updated this week
UT-Austin-RobIn / BUMBLE
View on GitHub
☆69Feb 23, 2025Updated last year
peiqi-liu / stretch_ai
View on GitHub
☆14May 21, 2025Updated last year
ColinQiyangLi / qam
View on GitHub
Q-learning with Adjoint Matching
☆99May 11, 2026Updated last month
feel-the-force-ftf / feel-the-force
View on GitHub
☆31Jun 8, 2025Updated last year
R-McHenry / ParallelizedGoExplore
View on GitHub
A first bare bones paralleled implementation of Go Explore as described by the Uber Engineering blog post
☆46Jan 25, 2019Updated 7 years ago
lasr-lab / learning-to-play-piano
View on GitHub
☆26Apr 1, 2026Updated 3 months ago