Author's implementation of ReBRAC, a minimalist improvement upon TD3+BC
☆19Oct 22, 2023Updated 2 years ago
Alternatives and similar repositories for ReBRAC
Users that are interested in ReBRAC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Author's implementation of ReBRAC, a minimalist improvement upon TD3+BC☆63Aug 3, 2023Updated 2 years ago
- Official implementation for "Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size", NeurIPS 2022, Offline RL Worksho…☆21Feb 27, 2023Updated 3 years ago
- Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorch☆56May 21, 2023Updated 3 years ago
- Official implementation for "Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flows", NeurIPS 2022, O…☆12Jan 31, 2023Updated 3 years ago
- ☆18Jul 10, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Official implementation of Harnessing Mixed Offline Reinforcement Learning Datasets via Trajectory Reweighting☆16Feb 14, 2024Updated 2 years ago
- Official implementation for "Anti-Exploration by Random Network Distillation", ICML 2023☆58Feb 3, 2023Updated 3 years ago
- ☆18Apr 17, 2026Updated 2 months ago
- Collect information about 2018 CS courses in CSE of SYSU.☆11Jun 29, 2022Updated 4 years ago
- Code for Mildly Conservative Q-learning for Offline Reinforcement Learning (NeurIPS 2022)☆63Apr 29, 2024Updated 2 years ago
- Feasibility Consistent Representation Learning for Safe Reinforcement Learning (ICML 2024). Current SOTA model-free safe RL algorithm on …☆17Jul 12, 2024Updated last year
- D3PE (Deep Data-Driven Policy Evaluation) aims to evaluation a large set of candidate policies from a fixed dataset to select best ones.☆10Jun 2, 2022Updated 4 years ago
- Minimal JAX implementation unifying Diffusion and Flow Matching algorithms as alternative strategies for transporting data distributions.☆66Dec 19, 2025Updated 6 months ago
- Code for the paper "Showing Your Offline Reinforcement Learning Work: Online Evaluation Budget Matters", ICML 2022☆28Jul 10, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Whiteboard for Driving Diagrams🚗☆86Jun 20, 2026Updated 2 weeks ago
- Code to related to my NIPS 2016 paper☆10Dec 4, 2016Updated 9 years ago
- [MLHC 2021] Model Selection for Offline RL: Practical Considerations for Healthcare Settings. https://arxiv.org/abs/2107.11003☆11Oct 6, 2022Updated 3 years ago
- This repo contains the scripts used to create the data for the ATC2020 paper "Reconstructing proprietary video streaming algorithms"☆14Mar 24, 2021Updated 5 years ago
- A lightweight reimplementation of Adversarially Trained Actor Critic☆19Mar 19, 2026Updated 3 months ago
- Drop-in environment replacements that make your RL algorithm train faster.☆22Jun 19, 2024Updated 2 years ago
- ☆10Aug 8, 2021Updated 4 years ago
- 收集整理SYSU期末考试卷子、资料☆10Jul 9, 2019Updated 6 years ago
- pix2pix model for generating terrain☆17Jan 7, 2023Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆11Oct 25, 2021Updated 4 years ago
- The deribit_historical_trades repository gathers cryptocurrency (BTC, ETH, SOL, USDC) derivatives traded on the cryptocurrency derivative…☆24Mar 1, 2023Updated 3 years ago
- Real-time Bandwidth Prediction based on LSTM☆10Mar 19, 2025Updated last year
- Clean, extensible implementation of MACAW [ICML 2021]☆12Dec 7, 2021Updated 4 years ago
- Code for NeurIPS 2022 paper "Robust offline Reinforcement Learning via Conservative Smoothing"☆24Feb 15, 2023Updated 3 years ago
- Official implementation of "cmSalGAN: RGB-D Salient Object Detection with Cross-View Generative Adversarial Networks" (IEEE TMM 2020)☆11Aug 23, 2021Updated 4 years ago
- Code for the paper "PALBERT: Teaching ALBERT to Ponder", NeurIPS 2022 Spotlight☆37Apr 8, 2023Updated 3 years ago
- This is the official repository for "DiffSG: A Generative Solver for Network Optimization with Diffusion Model" and "Diffusion Models as …☆20Feb 10, 2025Updated last year
- Bayes-Adaptive Monte-Carlo Planning algorithm☆19Mar 5, 2013Updated 13 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 订餐系统☆14Mar 5, 2016Updated 10 years ago
- SYDE 671 final project source code, copied from Google Research to avoid cloning the entire Google Research repo.☆14Nov 17, 2019Updated 6 years ago
- ☆23May 22, 2026Updated last month
- Library for the Test-based Calibration Error (TCE) metric to quantify the degree to classifier calibration.☆14Sep 15, 2023Updated 2 years ago
- ☆19Jun 25, 2023Updated 3 years ago
- Jax like function transformation engine but micro, microjax☆34Oct 25, 2024Updated last year
- 技育CAMP ハッカソンvol.5 開発を効率化するアプリケーションを作ろう! 最優秀賞 & 技育展2021 開発/スキル支援 部門 最優秀賞☆28Mar 6, 2023Updated 3 years ago