☆20Nov 13, 2023Updated 2 years ago
Alternatives and similar repositories for rllib
Users that are interested in rllib are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆19Jan 9, 2025Updated last year
- ☆33Nov 13, 2023Updated 2 years ago
- Code for our paper: Online Variational Filtering and Parameter Learning☆20Dec 8, 2021Updated 4 years ago
- Model Primitive Hierarchical Reinforcement Learning☆13Dec 8, 2022Updated 3 years ago
- A Python and Jsonnet framework for handling espanso configurations☆11Oct 6, 2025Updated 5 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Paper Implementation of Self-Rewarding Language Models☆13Feb 1, 2024Updated 2 years ago
- Code for simulations in "Computational mechanisms of curiosity and goal-directed exploration"☆10May 22, 2020Updated 5 years ago
- The official implementation of Preference Data Reward-Augmentation.☆18May 1, 2025Updated 10 months ago
- Offline RL algoritms implemented in Stable Baselines3 (pytorch)☆10Dec 7, 2021Updated 4 years ago
- My final project submission for the Meta Learning course at BITS Goa (conducted by TCS Research)☆17May 3, 2021Updated 4 years ago
- ☆12Aug 26, 2025Updated 7 months ago
- Dynamic Movement Primitives in Python☆15Jul 6, 2023Updated 2 years ago
- Structure refinement software for total scattering data☆13Mar 18, 2026Updated last week
- Offline Risk-Averse Actor-Critic (O-RAAC). A model-free RL algorithm for risk-averse RL in a fully offline setting☆35Feb 9, 2021Updated 5 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆18Jul 20, 2023Updated 2 years ago
- ☆16Jul 4, 2019Updated 6 years ago
- ☆12Oct 13, 2017Updated 8 years ago
- The Power of the Senses: Generalizable Manipulation from Vision and Touch through Masked Multimodal Learning☆40Aug 13, 2024Updated last year
- Python Package for EIT(Electric Impedance Tomography)-like problems using Gauss-Newton method.☆16Nov 5, 2025Updated 4 months ago
- Learning Task-parametrized Riemannian Motion Policies from demonstrations.☆16Dec 23, 2022Updated 3 years ago
- ☆21Apr 12, 2024Updated last year
- Reward shaping approach for instruction following settings, leveraging language at multiple levels of abstraction.☆21Mar 9, 2021Updated 5 years ago
- 2022华为软件精英挑战赛 - 杭厦赛区 - 土豪法称霸杭厦 - 决赛季军☆14Jul 31, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Thinker project☆16Sep 4, 2024Updated last year
- Official code for UnICORNN (ICML 2021)☆28Oct 1, 2021Updated 4 years ago
- A Simulated Optimal Intrusion Response Game☆21Apr 3, 2022Updated 3 years ago
- 📖The Big-&-Extending-Repository-of-Transformers: Pretrained PyTorch models for Google's BERT, OpenAI GPT & GPT-2, Google/CMU Transformer…☆16Jun 9, 2019Updated 6 years ago
- Notes for the Neuroscience & AI Reading Course (SEM-I 2020-21) at BITS Pilani Goa Campus☆14Sep 30, 2020Updated 5 years ago
- Building blocks for productive research☆70Updated this week
- Multi-Agent Reinforcement Learning on network-security☆20Apr 12, 2022Updated 3 years ago
- Belief-state planning for POMDPs using learned approximations☆23Jan 21, 2025Updated last year
- Minimal Transformer base in JAX. A single backbone for language modelling, diffusion, classification, etc...☆14May 28, 2025Updated 9 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Windy GridWorlds environments compatible with OpenAI gym.☆15Jul 8, 2022Updated 3 years ago
- Fast reinforcement learning research☆61Dec 7, 2024Updated last year
- ☆17Oct 31, 2023Updated 2 years ago
- ☆30Aug 25, 2022Updated 3 years ago
- DNN Node Collection using Inference Helper in ROS2☆13Apr 24, 2022Updated 3 years ago
- Dataset generation for NeuralGrasps https://arxiv.org/abs/2207.02959☆24Sep 26, 2024Updated last year
- 🎾 Multi-Agent Proximal Policy Optimization approach to a competitive reinforcement learning problem☆22Sep 25, 2022Updated 3 years ago