prajjwal1 / rl_paradigmView external linksLinks
Code for ICLR 2024 paper "When should we prefer Decision Transformers for Offline Reinforcement Learning?"
☆17Jan 31, 2024Updated 2 years ago
Alternatives and similar repositories for rl_paradigm
Users that are interested in rl_paradigm are comparing it to the libraries listed below
Sorting:
- Anti exploration in offline reinforcement learning☆11May 17, 2021Updated 4 years ago
- Actor Prioritized Experience Replay☆18Nov 20, 2023Updated 2 years ago
- Implementations of Stable Contrastive RL☆23Apr 13, 2025Updated 10 months ago
- Code for NeurIPS 2022 paper "Robust offline Reinforcement Learning via Conservative Smoothing"☆23Feb 15, 2023Updated 3 years ago
- ☆19Jun 25, 2023Updated 2 years ago
- ☆24Jan 26, 2024Updated 2 years ago
- Code for Efficient Continuous Control with Double Actors and Regularized Critics, AAAI 2022.☆22Mar 11, 2022Updated 3 years ago
- ☆20Oct 19, 2022Updated 3 years ago
- Revisiting Discrete Gradient Estimation in MADDPG☆27Feb 24, 2023Updated 2 years ago
- Code for ICLR 2022 paper Rethinking Goal-Conditioned Supervised Learning and Its Connection to Offline RL.☆28Feb 21, 2022Updated 3 years ago
- Official repository for Paper "Offline Goal-Conditioned Reinforcement Learning via f-Advantage Regression" (NeurIPS 2022)☆36Oct 19, 2023Updated 2 years ago
- ☆16Feb 1, 2026Updated 2 weeks ago
- [ICML 2022] The official implementation of DWBC in "Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"☆35Jan 5, 2023Updated 3 years ago
- Official implementation of the UMDQN algorithm presented in the scientific research paper entitled "Distributional Reinforcement Learning…☆11Jun 3, 2022Updated 3 years ago
- Code for the paper "Minimum-Delay Adaptation in Non-Stationary Reinforcement Learning via Online High-Confidence Change-Point Detection"☆10Aug 7, 2023Updated 2 years ago
- Accepted at WWW 25 Industrial Track (oral)☆16Jun 6, 2025Updated 8 months ago
- Implementation of BIMRL: Brain Inspired Meta Reinforcement Learning - Roozbeh Razavi et al. (IROS 2022)☆10Dec 1, 2022Updated 3 years ago
- Code for optimal execution☆12Oct 29, 2020Updated 5 years ago
- Alpha mining with DEAP-based genetic programming.☆11Jul 7, 2023Updated 2 years ago
- Beyond Entities: A Large-Scale Multi-Modal Knowledge Graph with Triplet Fact Grounding☆11May 23, 2024Updated last year
- ☆11Nov 13, 2025Updated 3 months ago
- ☆11May 24, 2024Updated last year
- factory.ai FACTORY_API_KEY switch and query☆27Dec 6, 2025Updated 2 months ago
- This is a data repository for the ACL 2020 paper: "Let Me Choose: From Verbal Context to Font Selection"☆10May 5, 2020Updated 5 years ago
- Prioritized Sequence Experience Replay☆10Aug 16, 2021Updated 4 years ago
- ☆11Sep 5, 2024Updated last year
- This repository is created on top of two repositories i.e., yolov7 face detection and yolov7 blurring object☆15Jan 21, 2023Updated 3 years ago
- An implementation of the paper "Solving the Rubik's Cube without Human Knowledge"☆14Dec 9, 2018Updated 7 years ago
- Apply Graph Neural Networks to Optimize Factor Feature Extraction of FactorVAE☆12Jan 11, 2025Updated last year
- Sample code for the paper "VLM-driven Behavior Tree for Context-aware Task Planning”☆16Jan 10, 2025Updated last year
- ☆10Sep 7, 2022Updated 3 years ago
- Implementation prototype of the Deep Deterministic Off-Policy Gradient (DD-OPG) method.☆11Jun 12, 2019Updated 6 years ago
- ☆12Jan 17, 2025Updated last year
- ☆10Feb 20, 2024Updated last year
- The Conceptual Coverage Across Languages Benchmark for Text-to-Image Models☆12Oct 28, 2024Updated last year
- Implementing DQNClipped and DQNReg Algorithms☆10Mar 2, 2021Updated 4 years ago
- Testing and implementation of ML algorithms for the analysis of cryptocurrency trends.☆11Feb 20, 2024Updated last year
- Authors' implementation of PEER☆11Jul 13, 2023Updated 2 years ago
- Example of android app written in Qt/Qml which uses MXNet for plant image recognition.☆10Nov 4, 2017Updated 8 years ago