DT6A/ReBRAC

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/DT6A/ReBRAC)

DT6A / ReBRAC

Author's implementation of ReBRAC, a minimalist improvement upon TD3+BC

☆19

Alternatives and similar repositories for ReBRAC

Users that are interested in ReBRAC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

tinkoff-ai / ReBRAC
View on GitHub
Author's implementation of ReBRAC, a minimalist improvement upon TD3+BC
☆63Aug 3, 2023Updated 2 years ago
tinkoff-ai / lb-sac
View on GitHub
Official implementation for "Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size", NeurIPS 2022, Offline RL Worksho…
☆21Feb 27, 2023Updated 3 years ago
Howuhh / sac-n-jax
View on GitHub
Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorch
☆56May 21, 2023Updated 3 years ago
Improbable-AI / harness-offline-rl
View on GitHub
Official implementation of Harnessing Mixed Offline Reinforcement Learning Datasets via Trajectory Reweighting
☆16Feb 14, 2024Updated 2 years ago
tinkoff-ai / sac-rnd
View on GitHub
Official implementation for "Anti-Exploration by Random Network Distillation", ICML 2023
☆58Feb 3, 2023Updated 3 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
dunnolab / NinA
View on GitHub
Official implementation of "NinA: Normalizing Flows in Action. Training VLA Models with Normalizing Flows"
☆17Sep 22, 2025Updated 10 months ago
andylolu2 / jax-vqvae-gpt
View on GitHub
Implementation of VQ-VAE with a GPT-style sampler in the JAX and Haiku ecosystem.
☆11Nov 23, 2023Updated 2 years ago
andrewargatkiny / dense-attention
View on GitHub
This is the repo for DenseAttention and DANet - fast and conceptually simple modification of standard attention and Transformer
☆20Apr 6, 2026Updated 3 months ago
shidilrzf / Anti-exploration-RL
View on GitHub
Anti exploration in offline reinforcement learning
☆11May 17, 2021Updated 5 years ago
dunnolab / phi-module
View on GitHub
[ICML 2025 GenBio Workshop] Official Implementation for "Electrostatics from Laplacian Eigenbasis for Neural Network Interatomic Potentia…
☆18Jun 12, 2025Updated last year
frt03 / jax_dt
View on GitHub
Minimal Decision Transformer Implementation written in Jax (Flax).
☆18Aug 8, 2022Updated 3 years ago
nissymori / remax-rl
View on GitHub
[ICML2026] Official JAX code for Emergence of Exploration in Policy Gradient Reinforcement Learning via Retrying
☆15Jul 3, 2026Updated 3 weeks ago
ryanxhr / IVR
View on GitHub
[ICLR 2023 Oral] The official implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Reg…
☆46Jul 27, 2023Updated 2 years ago
tinkoff-ai / probabilistic-embeddings
View on GitHub
"Probabilistic Embeddings Revisited" paper official repository
☆31Dec 30, 2022Updated 3 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
zhjiang22 / SYSU_Course
View on GitHub
Collect information about 2018 CS courses in CSE of SYSU.
☆11Jun 29, 2022Updated 4 years ago
WISEPLAT / QuikPy
View on GitHub
Библиотека-обертка, которая позволяет получить доступ к функционалу Quik из Python
☆12Feb 16, 2024Updated 2 years ago
ztjhz / t5-jax
View on GitHub
JAX implementation of the T5 model: Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
☆24Jun 10, 2023Updated 3 years ago
polixir / d3pe
View on GitHub
D3PE (Deep Data-Driven Policy Evaluation) aims to evaluation a large set of candidate policies from a fixed dataset to select best ones.
☆10Jun 2, 2022Updated 4 years ago
odhyan / options-calculator
View on GitHub
Calculate expected profit & loss for options
☆15Aug 5, 2019Updated 6 years ago
pgermain / PAC-Bayesian-Theory-Meets-Bayesian-Inference
View on GitHub
Code to related to my NIPS 2016 paper
☆10Dec 4, 2016Updated 9 years ago
tinkoff-ai / eop
View on GitHub
Code for the paper "Showing Your Offline Reinforcement Learning Work: Online Evaluation Budget Matters", ICML 2022
☆28Jul 10, 2022Updated 4 years ago
MLD3 / OfflineRL_ModelSelection
View on GitHub
[MLHC 2021] Model Selection for Offline RL: Practical Considerations for Healthcare Settings. https://arxiv.org/abs/2107.11003
☆11Oct 6, 2022Updated 3 years ago
magruener / reconstructing-proprietary-video-streaming-algorithms
View on GitHub
This repo contains the scripts used to create the data for the ATC2020 paper "Reconstructing proprietary video streaming algorithms"
☆14Mar 24, 2021Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
anniesch / single-life-rl
View on GitHub
Single-Life Reinforcement Learning
☆14Dec 17, 2022Updated 3 years ago
keraJLi / synthetic-gymnax
View on GitHub
Drop-in environment replacements that make your RL algorithm train faster.
☆22Jun 19, 2024Updated 2 years ago
Baichenjia / UTDS
View on GitHub
Pessimistic Value Iteration for Multi-Task Data Sharing in Offline RL
☆18Nov 21, 2023Updated 2 years ago
amorev / vibecode-setup-public
View on GitHub
☆15Jun 12, 2026Updated last month
zhanghuanhuan1994 / arsenal
View on GitHub
☆12Apr 12, 2022Updated 4 years ago
Ernie1 / SYSU-Exam
View on GitHub
收集整理SYSU期末考试卷子、资料
☆10Jul 9, 2019Updated 7 years ago
SlippyDong / supabase-mcp-cursor
View on GitHub
A Supabase MCP server compatible with cursor
☆20Feb 13, 2025Updated last year
LAMDA-RL / PRDC
View on GitHub
Author's PyTorch implementation of ICML'23 paper "Policy Regularization with Dataset Constraint for Offline Reinforcement Learning" for D…
☆18Nov 8, 2024Updated last year
schepal / deribit_flows_heatmap
View on GitHub
☆18Nov 23, 2023Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
RemexHu / BandwidthPrediction-LSTM
View on GitHub
Real-time Bandwidth Prediction based on LSTM
☆10Mar 19, 2025Updated last year
eric-mitchell / macaw-min
View on GitHub
Clean, extensible implementation of MACAW [ICML 2021]
☆12Dec 7, 2021Updated 4 years ago
tinkoff-ai / palbert
View on GitHub
Code for the paper "PALBERT: Teaching ALBERT to Ponder", NeurIPS 2022 Spotlight
☆37Apr 8, 2023Updated 3 years ago
jayin92 / pix2pix-terrain-generator
View on GitHub
pix2pix model for generating terrain
☆17Jan 7, 2023Updated 3 years ago
pinghsieh / FSAF
View on GitHub
☆11Oct 25, 2021Updated 4 years ago
lboasso / oberon0
View on GitHub
Adapted source code of Niklaus Wirth's "Compiler Construction" book
☆21May 11, 2023Updated 3 years ago
facebookresearch / tce
View on GitHub
Library for the Test-based Calibration Error (TCE) metric to quantify the degree to classifier calibration.
☆14Sep 15, 2023Updated 2 years ago