gauthamvasan/avg

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/gauthamvasan/avg)

gauthamvasan / avg

Action Value Gradient Algorithm

☆29

Alternatives and similar repositories for avg

Users that are interested in avg are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

machado-research / AgarCL
View on GitHub
Agar.io for Continual Reinforcement Learning
☆24Jul 24, 2025Updated last year
esraaelelimy / rtus
View on GitHub
Real-Time RTUs
☆12Mar 20, 2026Updated 4 months ago
qlan3 / Jaxplorer
View on GitHub
Jaxplorer is a Jax reinforcement learning (RL) framework for exploring new ideas.
☆13Jul 19, 2024Updated 2 years ago
mohmdelsayed / streaming-drl
View on GitHub
Deep reinforcement learning without experience replay, target networks, or batch updates.
☆293Mar 18, 2025Updated last year
rlai-lab / ReLoD
View on GitHub
An efficient remote-onboard architecture for real-time Reinforcement Learning
☆17Jun 28, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
pd-perry / TQL
View on GitHub
☆28May 11, 2026Updated 2 months ago
SAIC-MONTREAL / hyperzero
View on GitHub
Code for AAAI 2023 paper "Hypernetworks for Zero-shot Transfer in Reinforcement Learning"
☆24Apr 26, 2023Updated 3 years ago
RajGhugare19 / stitching-is-combinatorial-generalisation
View on GitHub
[ICLR 2024] Closing the Gap between TD Learning and Supervised Learning - A Generalisation Point of View.
☆25Apr 19, 2024Updated 2 years ago
AlexGoldie / learn-rl-algorithms
View on GitHub
Official implementation for "How Should We Meta-Learn Reinforcement Learning Algorithms?"
☆23Sep 7, 2025Updated 10 months ago
CMU-AIRe / floq
View on GitHub
Code Release for floq: Training Critics via Flow-Matching for Scaling Compute In Value-Based RL
☆46Apr 7, 2026Updated 3 months ago
aliang8 / varibad_jax
View on GitHub
☆10Jun 27, 2024Updated 2 years ago
mttga / purejaxql
View on GitHub
Simple single-file baselines for Q-Learning in pure-GPU setting
☆242Nov 24, 2025Updated 8 months ago
SonyResearch / simba
View on GitHub
☆128Feb 25, 2025Updated last year
mazpie / redundancy-action-spaces
View on GitHub
[RA-L 2024] Novel action spaces leveraging redundancy in 7 DoF arms enable efficient & precise learning in robotic manipulation
☆23Jun 6, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
dunnolab / xland-minigrid-datasets
View on GitHub
XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning - - — ICLR 2025
☆84Feb 13, 2025Updated last year
notmahi / disk
View on GitHub
PyTorch implementation for "Discovery of Incremental Skills" (DISk) algorithm from ICLR 2022 paper "One After Another: Learning Increment…
☆21Mar 22, 2022Updated 4 years ago
lilucse / SparseNetwork4DRL
View on GitHub
[ICML 2025 oral] Network Sparsity Unlocks the Scaling Potential of Deep Reinforcement Learning
☆41Jun 5, 2025Updated last year
MouseHu / GEM
View on GitHub
☆16Jul 1, 2021Updated 5 years ago
TTomilin / COOM
View on GitHub
COOM: Benchmarking Continual Reinforcement Learning on Doom
☆27Mar 5, 2026Updated 4 months ago
FLAIROx / cultural-accumulation
View on GitHub
☆16Jul 16, 2024Updated 2 years ago
Probabilistic-and-Interactive-ML / awesome-plasticity-loss
View on GitHub
Collection of resources on plasticity loss in deep reinforcement learning
☆23Nov 12, 2024Updated last year
hari-sikchi / DVL
View on GitHub
A Dual-RL method DVL: Dual-V Learning for offline and online reinforcement learning
☆16Oct 22, 2023Updated 2 years ago
google-deepmind / lm_act
View on GitHub
LMAct: A Benchmark for In-Context Imitation Learning with Long Multimodal Demonstrations
☆30May 21, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
radarFudan / mamba-minimal-jax
View on GitHub
☆36Nov 22, 2024Updated last year
RyanNavillus / reward-surfaces
View on GitHub
☆19Apr 22, 2024Updated 2 years ago
yun-kwak / decision-transformer-jax
View on GitHub
Decision Transformer JAX - Reproduction of 'Decision Transformer: Reinforcement Learning via Sequence Modeling' in JAX and Haiku
☆13Aug 14, 2024Updated last year
kylestach / bigvision-palivla
View on GitHub
☆15Sep 4, 2025Updated 10 months ago
alexanderswerdlow / faster
View on GitHub
☆30Jun 30, 2026Updated 3 weeks ago
adityab / CrossQ
View on GitHub
Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"
☆95Jun 4, 2024Updated 2 years ago
dunnolab / laom
View on GitHub
Official implementation of "Latent Action Learning Requires Supervision in the Presence of Distractors", ICML 2025
☆39Jul 8, 2025Updated last year
ikostrikov / dmcgym
View on GitHub
☆23Aug 19, 2022Updated 3 years ago
conglu1997 / v-d4rl
View on GitHub
Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations
☆115Apr 16, 2026Updated 3 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
yihaosun1124 / mobile
View on GitHub
Code for MOBILE: Model-Bellman Inconsistency Penalized Offline Policy Optimization
☆22Apr 17, 2024Updated 2 years ago
bryanoliveira / sliding-puzzles-gym
View on GitHub
A scalable benchmark for state representation learning in visual reinforcement learning.
☆17Jun 23, 2025Updated last year
danijar / crafter-baselines
View on GitHub
Docker containers of baseline agents for the Crafter environment
☆30Dec 14, 2021Updated 4 years ago
dunnolab / vintix
View on GitHub
Vintix: Action Model via In-Context Reinforcement Learning - - — ICML 2025
☆51May 23, 2025Updated last year
instadeepai / flashbax
View on GitHub
⚡ Flashbax: Accelerated Replay Buffers in JAX
☆279Sep 22, 2025Updated 10 months ago
Viraj-Joshi / MTBench
View on GitHub
☆45Jul 1, 2026Updated 3 weeks ago
bmazoure / ppo_jax
View on GitHub
Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…
☆62Aug 4, 2022Updated 3 years ago