tinkoff-ai/sac-rnd

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/tinkoff-ai/sac-rnd)

tinkoff-ai / sac-rnd

Official implementation for "Anti-Exploration by Random Network Distillation", ICML 2023

☆58

Alternatives and similar repositories for sac-rnd

Users that are interested in sac-rnd are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

tinkoff-ai / lb-sac
View on GitHub
Official implementation for "Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size", NeurIPS 2022, Offline RL Worksho…
☆21Feb 27, 2023Updated 3 years ago
tinkoff-ai / ReBRAC
View on GitHub
Author's implementation of ReBRAC, a minimalist improvement upon TD3+BC
☆63Aug 3, 2023Updated 2 years ago
aliang8 / varibad_jax
View on GitHub
☆10Jun 27, 2024Updated 2 years ago
tinkoff-ai / cnf
View on GitHub
Official implementation for "Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flows", NeurIPS 2022, O…
☆12Jan 31, 2023Updated 3 years ago
corl-team / katakomba
View on GitHub
Data-Driven NetHack Tools: Datasets (30+) and recurrent-baselines (AWAC, BC, CQL, IQL, REM)
☆43Aug 22, 2023Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
hwang-ua / inac_pytorch
View on GitHub
☆20Jun 25, 2023Updated 3 years ago
Howuhh / sac-n-jax
View on GitHub
Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorch
☆57May 21, 2023Updated 3 years ago
dunnolab / xland-minigrid-datasets
View on GitHub
XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning - - — ICLR 2025
☆84Feb 13, 2025Updated last year
tinkoff-ai / eop
View on GitHub
Code for the paper "Showing Your Offline Reinforcement Learning Work: Online Evaluation Budget Matters", ICML 2022
☆28Jul 10, 2022Updated 4 years ago
microsoft / lightATAC
View on GitHub
A lightweight reimplementation of Adversarially Trained Actor Critic
☆19Mar 19, 2026Updated 4 months ago
frt03 / jax_dt
View on GitHub
Minimal Decision Transformer Implementation written in Jax (Flax).
☆18Aug 8, 2022Updated 3 years ago
dunnolab / NinA
View on GitHub
Official implementation of "NinA: Normalizing Flows in Action. Training VLA Models with Normalizing Flows"
☆17Sep 22, 2025Updated 10 months ago
CEC-Agent / CEC
View on GitHub
Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"
☆32Oct 12, 2023Updated 2 years ago
dunnolab / vintix
View on GitHub
Vintix: Action Model via In-Context Reinforcement Learning - - — ICML 2025
☆51May 23, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
dunnolab / phi-module
View on GitHub
[ICML 2025 GenBio Workshop] Official Implementation for "Electrostatics from Laplacian Eigenbasis for Neural Network Interatomic Potentia…
☆18Jun 12, 2025Updated last year
holarissun / RewardShifting
View on GitHub
Code for NeurIPS 2022 paper Exploiting Reward Shifting in Value-Based Deep RL
☆29Oct 29, 2023Updated 2 years ago
tinkoff-ai / open-tlab
View on GitHub
Примеры пропозалов для подачи заявки в Open.TLab
☆27Dec 15, 2022Updated 3 years ago
radarFudan / mamba-minimal-jax
View on GitHub
☆36Nov 22, 2024Updated last year
tinkoff-ai / katakomba
View on GitHub
Data-Driven NetHack Tools: Datasets (30+) and recurrent-baselines (AWAC, BC, CQL, IQL, REM)
☆79Jun 23, 2023Updated 3 years ago
dunnolab / vintix-II
View on GitHub
Vintix II: Decision Pre-Trained Transformer is a Scalable In-Context Reinforcement Learner - - = ICLR 2026
☆16Apr 8, 2026Updated 3 months ago
conglu1997 / SynthER
View on GitHub
Synthetic Experience Replay
☆114Apr 16, 2026Updated 3 months ago
tinkoff-ai / probabilistic-embeddings
View on GitHub
"Probabilistic Embeddings Revisited" paper official repository
☆31Dec 30, 2022Updated 3 years ago
nmonette / NCC-UED
View on GitHub
Official Implementation of `An Optimisation Framework for Unsupervised Environment Design` from RLC 2025
☆17Nov 24, 2025Updated 8 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
subho406 / Recurrent-PPO-Jax
View on GitHub
Implementation of Proximal Policy Optimization in Jax+Flax
☆21May 18, 2023Updated 3 years ago
brownirl / lambda_discrepancy
View on GitHub
Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy
☆24Oct 28, 2024Updated last year
zdhNarsil / Stochastic-Marginal-Actor-Critic
View on GitHub
Official pytorch implementation for our ICLR 2023 paper "Latent State Marginalization as a Low-cost Approach for Improving Exploration".
☆24Feb 9, 2023Updated 3 years ago
Baichenjia / PBRL
View on GitHub
Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning
☆29Feb 21, 2022Updated 4 years ago
LAMDA-RL / PRDC
View on GitHub
Author's PyTorch implementation of ICML'23 paper "Policy Regularization with Dataset Constraint for Offline Reinforcement Learning" for D…
☆18Nov 8, 2024Updated last year
FLAIROx / cultural-accumulation
View on GitHub
☆16Jul 16, 2024Updated 2 years ago
dmksjfl / MCQ
View on GitHub
Code for Mildly Conservative Q-learning for Offline Reinforcement Learning (NeurIPS 2022)
☆64Apr 29, 2024Updated 2 years ago
thuml / SPOT
View on GitHub
Code release for "Supported Policy Optimization for Offline Reinforcement Learning" (NeurIPS 2022), https://arxiv.org/abs/2202.06239
☆22Jun 24, 2023Updated 3 years ago
max7born / decision-lstm
View on GitHub
Codebase for the paper "How Crucial is Transformer in Decision Transformer?". Containing experiments on different pendulum tasks and code…
☆28Mar 24, 2023Updated 3 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
SpirinEgor / gulag
View on GitHub
GULAG: GUessing LAnGuages with neural networks
☆13May 4, 2022Updated 4 years ago
shidilrzf / Anti-exploration-RL
View on GitHub
Anti exploration in offline reinforcement learning
☆11May 17, 2021Updated 5 years ago
kenjyoung / dreamerv2_JAX
View on GitHub
An implementation of DreamerV2 written in JAX, with support for running multiple random seeds of an experiment on a single GPU.
☆18Jan 16, 2023Updated 3 years ago
tinkoff-ai / palbert
View on GitHub
Code for the paper "PALBERT: Teaching ALBERT to Ponder", NeurIPS 2022 Spotlight
☆37Apr 8, 2023Updated 3 years ago
zombie-einstein / jaxpr-viz
View on GitHub
Jaxpr Visualisation Tool
☆37Dec 22, 2024Updated last year
DT6A / ReBRAC
View on GitHub
Author's implementation of ReBRAC, a minimalist improvement upon TD3+BC
☆19Oct 22, 2023Updated 2 years ago
Div-Infinity / XQL
View on GitHub
Extreme Q-Learning: Max Entropy RL without Entropy
☆88Feb 14, 2023Updated 3 years ago