tinkoff-ai/eop

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/tinkoff-ai/eop)

tinkoff-ai / eop

Code for the paper "Showing Your Offline Reinforcement Learning Work: Online Evaluation Budget Matters", ICML 2022

☆28

Alternatives and similar repositories for eop

Users that are interested in eop are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

tinkoff-ai / lb-sac
View on GitHub
Official implementation for "Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size", NeurIPS 2022, Offline RL Worksho…
☆21Feb 27, 2023Updated 3 years ago
tinkoff-ai / cnf
View on GitHub
Official implementation for "Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flows", NeurIPS 2022, O…
☆12Jan 31, 2023Updated 3 years ago
tinkoff-ai / probabilistic-embeddings
View on GitHub
"Probabilistic Embeddings Revisited" paper official repository
☆31Dec 30, 2022Updated 3 years ago
tinkoff-ai / katakomba
View on GitHub
Data-Driven NetHack Tools: Datasets (30+) and recurrent-baselines (AWAC, BC, CQL, IQL, REM)
☆79Jun 23, 2023Updated 3 years ago
tinkoff-ai / palbert
View on GitHub
Code for the paper "PALBERT: Teaching ALBERT to Ponder", NeurIPS 2022 Spotlight
☆37Apr 8, 2023Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
corl-team / katakomba
View on GitHub
Data-Driven NetHack Tools: Datasets (30+) and recurrent-baselines (AWAC, BC, CQL, IQL, REM)
☆43Aug 22, 2023Updated 2 years ago
Howuhh / sac-n-jax
View on GitHub
Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorch
☆56May 21, 2023Updated 3 years ago
dunnolab / NinA
View on GitHub
Official implementation of "NinA: Normalizing Flows in Action. Training VLA Models with Normalizing Flows"
☆17Sep 22, 2025Updated 10 months ago
tinkoff-ai / sac-rnd
View on GitHub
Official implementation for "Anti-Exploration by Random Network Distillation", ICML 2023
☆58Feb 3, 2023Updated 3 years ago
dunnolab / phi-module
View on GitHub
[ICML 2025 GenBio Workshop] Official Implementation for "Electrostatics from Laplacian Eigenbasis for Neural Network Interatomic Potentia…
☆18Jun 12, 2025Updated last year
corl-team / counting_manifolds
View on GitHub
Code for the reproduction of counting manifolds
☆16Feb 26, 2026Updated 4 months ago
thethaibinh / agile_flight
View on GitHub
Simulation system for path planning evaluation
☆13Dec 13, 2025Updated 7 months ago
catalyst-team / hydra-slayer
View on GitHub
☆16Jan 4, 2024Updated 2 years ago
tinkoff-ai / CORL
View on GitHub
High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC…
☆1,368Aug 3, 2023Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
corl-team / ad-eps
View on GitHub
Official Implementation for "In-Context Reinforcement Learning from Noise Distillation"
☆35Sep 18, 2024Updated last year
htdt / lwm
View on GitHub
Latent World Models For Intrinsically Motivated Exploration | Official repository
☆23Apr 28, 2021Updated 5 years ago
corl-team / CORL
View on GitHub
High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC…
☆652Feb 10, 2024Updated 2 years ago
imustafin / brie_doom
View on GitHub
DOOM source port in Eiffel with SDL2
☆11Sep 9, 2025Updated 10 months ago
glassroom / heinsen_sequence
View on GitHub
Code implementing "Efficient Parallelization of a Ubiquitious Sequential Computation" (Heinsen, 2023)
☆98Dec 5, 2024Updated last year
hr0nix / dejax
View on GitHub
Accelerated replay buffers in JAX
☆46Sep 17, 2022Updated 3 years ago
col-in-coding / robot-modeling
View on GitHub
Quadruped Robot controller design and simulation on Webots
☆12Apr 28, 2020Updated 6 years ago
webstorms / Blocks
View on GitHub
A new model for quickly training and simulating adaptive leaky integrate-and-fire spiking neural networks.
☆14Apr 9, 2024Updated 2 years ago
jetnew / visrl
View on GitHub
A simple wrapper to analyse and visualise reinforcement learning agents' behaviour in the environment.
☆14Jan 8, 2022Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
alxmamaev / ultimate_tts
View on GitHub
☆13Aug 7, 2021Updated 4 years ago
dunnolab / harmony
View on GitHub
[ICML 2026 GenBio Workshop] Official Implementation for "Harmonic Torsional Diffusion for Protein-Ligand Flexible Docking"
☆15Jun 30, 2026Updated 3 weeks ago
jsw7460 / sb3_jax
View on GitHub
☆13Aug 9, 2022Updated 3 years ago
corl-team / lime
View on GitHub
Official implementation of the paper "You Do Not Fully Utilize Transformer's Representation Capacity"
☆32May 28, 2025Updated last year
Miffyli / gan-aimbots
View on GitHub
Code for the experiments done in the paper "GAN-Aimbots: Using Machine Learning for Cheating in First Person Shooters"
☆24May 13, 2022Updated 4 years ago
schatty / awesome-memory-rl
View on GitHub
A curated list of awesome memory in reinforcement learning research materials
☆24Sep 5, 2021Updated 4 years ago
google-research / dataclass_array
View on GitHub
Dataclasses manipulated as numpy arrays (with batching, reshape, slicing,...)
☆54Jul 9, 2026Updated 2 weeks ago
yudasong / HyQ
View on GitHub
Official code repo for paper: Hybrid RL: Using both offline and online data can make RL efficient.
☆24Feb 16, 2023Updated 3 years ago
CEC-Agent / CEC
View on GitHub
Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"
☆32Oct 12, 2023Updated 2 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
joeegan17 / DQN-for-Electrical-Microgrid-Control
View on GitHub
Reinforcement learning project using deep Q-learning to control the operations of an electrical microgrid
☆11Jan 3, 2023Updated 3 years ago
dunnolab / xland-minigrid-datasets
View on GitHub
XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning - - — ICLR 2025
☆84Feb 13, 2025Updated last year
vkurenkov / haxball-chameleon
View on GitHub
Solving Haxball (www.haxball.com) using Imitation Learning methods.
☆23Nov 19, 2019Updated 6 years ago
Charlie0257 / T2TL
View on GitHub
Exploiting Transformer in Reinforcement Learning for Interpretable Temporal Logic Motion Planning (RAL 2023)
☆12Jul 17, 2023Updated 3 years ago
DT6A / ReBRAC
View on GitHub
Author's implementation of ReBRAC, a minimalist improvement upon TD3+BC
☆19Oct 22, 2023Updated 2 years ago
TanguyLevent / RL4Microgrids
View on GitHub
RL for Energy Management of Microgrids
☆11Mar 28, 2020Updated 6 years ago
philipjball / OffCon3
View on GitHub
📴 OffCon^3: SOTA PyTorch SAC and TD3 Implementations (arxiv: 2101.11331)
☆25Jun 20, 2021Updated 5 years ago