☆20Nov 13, 2023Updated 2 years ago
Alternatives and similar repositories for rllib
Users that are interested in rllib are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆19Jan 9, 2025Updated last year
- ☆32Nov 13, 2023Updated 2 years ago
- Code for the paper: Causal Action Influence Aware Counterfactual Data Augmentation @ICML2024☆12Jul 19, 2024Updated last year
- ☆66Mar 11, 2024Updated 2 years ago
- Model Primitive Hierarchical Reinforcement Learning☆13Dec 8, 2022Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Paper Implementation of Self-Rewarding Language Models☆13Feb 1, 2024Updated 2 years ago
- Code for simulations in "Computational mechanisms of curiosity and goal-directed exploration"☆11May 22, 2020Updated 6 years ago
- My final project submission for the Meta Learning course at BITS Goa (conducted by TCS Research)☆17May 3, 2021Updated 5 years ago
- The Controllable Agent project trains RL Agents able to optimize any reward function specified in real time, without any further learning…☆75Jul 17, 2023Updated 2 years ago
- [ICCV 2025] VLM4D: Towards Spatiotemporal Awareness in Vision Language Models☆47Nov 20, 2025Updated 6 months ago
- Dynamic Movement Primitives in Python☆15Jul 6, 2023Updated 2 years ago
- ☆18Jul 20, 2023Updated 2 years ago
- Offline Risk-Averse Actor-Critic (O-RAAC). A model-free RL algorithm for risk-averse RL in a fully offline setting☆35Feb 9, 2021Updated 5 years ago
- Learning Task-parametrized Riemannian Motion Policies from demonstrations.☆16Dec 23, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Reward shaping approach for instruction following settings, leveraging language at multiple levels of abstraction.☆21Mar 9, 2021Updated 5 years ago
- model based reinforcement learning algorithms for unstable baselines☆15May 9, 2023Updated 3 years ago
- Thinker project☆16Sep 4, 2024Updated last year
- Perception related packages☆19Dec 18, 2024Updated last year
- 📖The Big-&-Extending-Repository-of-Transformers: Pretrained PyTorch models for Google's BERT, OpenAI GPT & GPT-2, Google/CMU Transformer…☆16Jun 9, 2019Updated 6 years ago
- Notes for the Neuroscience & AI Reading Course (SEM-I 2020-21) at BITS Pilani Goa Campus☆14Sep 30, 2020Updated 5 years ago
- A short and easy implementation of Quantile Regression DQN | Distributional Reinforcement Learning☆98Sep 3, 2020Updated 5 years ago
- ☆23Oct 2, 2025Updated 7 months ago
- Windy GridWorlds environments compatible with OpenAI gym.☆15Jul 8, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Official implementation of the AAAI 2021 paper Deep Bayesian Quadrature Policy Optimization.☆17Feb 17, 2021Updated 5 years ago
- ☆38May 18, 2021Updated 5 years ago
- ☆17Oct 31, 2023Updated 2 years ago
- ☆30Aug 25, 2022Updated 3 years ago
- PyTorch - Implicit Quantile Networks - Quantile Regression - C51☆22Jul 26, 2019Updated 6 years ago
- Dataset generation for NeuralGrasps https://arxiv.org/abs/2207.02959☆24Sep 26, 2024Updated last year
- Code associated with "Anxiety, avoidance, and sequential evaluation"☆17Oct 26, 2021Updated 4 years ago
- MuJoCo models for Unitree Robots☆12Nov 24, 2021Updated 4 years ago
- DiBS: Differentiable Bayesian Structure Learning, NeurIPS 2021☆53Feb 27, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Robotic grasp generation C/C++ library for parallel-jaw grippers☆22Jan 19, 2023Updated 3 years ago
- Code for training on Imagenet to SOTA results using PyTorch☆13Aug 14, 2023Updated 2 years ago
- ☆55Jul 16, 2021Updated 4 years ago
- implementation of our self-guided and self-regularized actor-critic algorithm☆29Jan 1, 2023Updated 3 years ago
- Repository for example Hierarchical Drift Diffusion Model (HDDM) code using JAGS in Python. These scripts provide useful examples for usi…☆30Mar 13, 2024Updated 2 years ago
- ☆12Mar 7, 2022Updated 4 years ago
- Dataset for Image-Goal Navigation in Habitat☆12Feb 24, 2022Updated 4 years ago