tinkoff-ai/lb-sac

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/tinkoff-ai/lb-sac)

tinkoff-ai / lb-sac

Official implementation for "Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size", NeurIPS 2022, Offline RL Workshop

☆21

Alternatives and similar repositories for lb-sac

Users that are interested in lb-sac are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

tinkoff-ai / eop
View on GitHub
Code for the paper "Showing Your Offline Reinforcement Learning Work: Online Evaluation Budget Matters", ICML 2022
☆28Jul 10, 2022Updated 4 years ago
corl-team / katakomba
View on GitHub
Data-Driven NetHack Tools: Datasets (30+) and recurrent-baselines (AWAC, BC, CQL, IQL, REM)
☆43Aug 22, 2023Updated 2 years ago
tinkoff-ai / sac-rnd
View on GitHub
Official implementation for "Anti-Exploration by Random Network Distillation", ICML 2023
☆58Feb 3, 2023Updated 3 years ago
tinkoff-ai / probabilistic-embeddings
View on GitHub
"Probabilistic Embeddings Revisited" paper official repository
☆31Dec 30, 2022Updated 3 years ago
Howuhh / sac-n-jax
View on GitHub
Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorch
☆57May 21, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
tinkoff-ai / palbert
View on GitHub
Code for the paper "PALBERT: Teaching ALBERT to Ponder", NeurIPS 2022 Spotlight
☆37Apr 8, 2023Updated 3 years ago
tinkoff-ai / open-tlab
View on GitHub
Примеры пропозалов для подачи заявки в Open.TLab
☆27Dec 15, 2022Updated 3 years ago
tinkoff-ai / ReBRAC
View on GitHub
Author's implementation of ReBRAC, a minimalist improvement upon TD3+BC
☆63Aug 3, 2023Updated 2 years ago
tinkoff-ai / katakomba
View on GitHub
Data-Driven NetHack Tools: Datasets (30+) and recurrent-baselines (AWAC, BC, CQL, IQL, REM)
☆79Jun 23, 2023Updated 3 years ago
corl-team / ad-eps
View on GitHub
Official Implementation for "In-Context Reinforcement Learning from Noise Distillation"
☆35Sep 18, 2024Updated last year
dunnolab / NinA
View on GitHub
Official implementation of "NinA: Normalizing Flows in Action. Training VLA Models with Normalizing Flows"
☆17Sep 22, 2025Updated 10 months ago
dunnolab / vintix
View on GitHub
Vintix: Action Model via In-Context Reinforcement Learning - - — ICML 2025
☆51May 23, 2025Updated last year
dunnolab / phi-module
View on GitHub
[ICML 2025 GenBio Workshop] Official Implementation for "Electrostatics from Laplacian Eigenbasis for Neural Network Interatomic Potentia…
☆18Jun 12, 2025Updated last year
schatty / awesome-memory-rl
View on GitHub
A curated list of awesome memory in reinforcement learning research materials
☆24Sep 5, 2021Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
DT6A / ReBRAC
View on GitHub
Author's implementation of ReBRAC, a minimalist improvement upon TD3+BC
☆19Oct 22, 2023Updated 2 years ago
ShivankUdayawal / Regression-on-Car-Insurance-Dataset
View on GitHub
Predicting for Customers, whether they will buy car insurance or not.
☆11Jan 29, 2021Updated 5 years ago
dunnolab / xland-minigrid-datasets
View on GitHub
XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning - - — ICLR 2025
☆84Feb 13, 2025Updated last year
aliang8 / varibad_jax
View on GitHub
☆10Jun 27, 2024Updated 2 years ago
CEC-Agent / CEC
View on GitHub
Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"
☆32Oct 12, 2023Updated 2 years ago
radarFudan / mamba-minimal-jax
View on GitHub
☆36Nov 22, 2024Updated last year
brownirl / lambda_discrepancy
View on GitHub
Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy
☆24Oct 28, 2024Updated last year
htdt / lwm
View on GitHub
Latent World Models For Intrinsically Motivated Exploration | Official repository
☆23Apr 28, 2021Updated 5 years ago
Howuhh / faster-trajectory-transformer
View on GitHub
Implementation of Trajectory Transformer with attention caching and batched beam search
☆118Apr 27, 2023Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
SpirinEgor / gulag
View on GitHub
GULAG: GUessing LAnGuages with neural networks
☆13May 4, 2022Updated 4 years ago
psclklnk / currot
View on GitHub
Source Code for the ICML Paper "Curriculum Reinforcement Learning via Constrained Optimal Transport"
☆16Jun 9, 2022Updated 4 years ago
webstorms / Blocks
View on GitHub
A new model for quickly training and simulating adaptive leaky integrate-and-fire spiking neural networks.
☆14Apr 9, 2024Updated 2 years ago
eric-mitchell / macaw-min
View on GitHub
Clean, extensible implementation of MACAW [ICML 2021]
☆12Dec 7, 2021Updated 4 years ago
zombie-einstein / jaxpr-viz
View on GitHub
Jaxpr Visualisation Tool
☆37Dec 22, 2024Updated last year
alxmamaev / ultimate_tts
View on GitHub
☆13Aug 7, 2021Updated 4 years ago
dunnolab / harmony
View on GitHub
[ICML 2026 GenBio Workshop] Official Implementation for "Harmonic Torsional Diffusion for Protein-Ligand Flexible Docking"
☆15Jun 30, 2026Updated 3 weeks ago
seongun-kim / vcrl
View on GitHub
[ICML 2023] Variational Curriculum Reinforcement Learning for Unsupervised Discovery of Skills
☆12Jul 15, 2023Updated 3 years ago
corl-team / lime
View on GitHub
Official implementation of the paper "You Do Not Fully Utilize Transformer's Representation Capacity"
☆32May 28, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
tinkoff-ai / CORL
View on GitHub
High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC…
☆1,369Aug 3, 2023Updated 2 years ago
fgvbrt / retro_contest
View on GitHub
☆15Mar 31, 2023Updated 3 years ago
corl-team / flexsae
View on GitHub
Official Triton kernels for TopK and HierarchicalTopK Sparse Autoencoder decoders.
☆29Sep 29, 2025Updated 9 months ago
corl-team / CORL
View on GitHub
High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC…
☆652Feb 10, 2024Updated 2 years ago
JinnnK / TGRF
View on GitHub
Public sourcecode for Transformable Gaussian Reward Function for Robot Navigation with Deep Reinforcement Learning
☆22Aug 7, 2024Updated last year
dunnolab / awesome-in-context-rl
View on GitHub
Awesome In-Context RL: A curated list of In-Context Reinforcement Learning - - —
☆305Sep 8, 2025Updated 10 months ago
FLAIROx / cultural-accumulation
View on GitHub
☆16Jul 16, 2024Updated 2 years ago