belerico / lightning-rlLinks
☆16Updated 3 years ago
Alternatives and similar repositories for lightning-rl
Users that are interested in lightning-rl are comparing it to the libraries listed below
Sorting:
- ☆101Updated last month
- Automatic gradient descent☆216Updated 2 years ago
- WandB sweeps integration with Hydra sweeper☆50Updated last year
- 🎢 Creating and sharing simulation environments for embodied and synthetic data research☆192Updated 2 years ago
- DiffusionWithAutoscaler☆29Updated last year
- Tutorial to get started with SkyPilot!☆58Updated last year
- Functional local implementations of main model parallelism approaches☆95Updated 2 years ago
- Modalities, a PyTorch-native framework for distributed and reproducible foundation model training.☆91Updated last week
- SMIT: A Simple Modality Integration Tool☆15Updated last year
- AI Data Management & Evaluation Platform☆215Updated 2 years ago
- git extension for {collaborative, communal, continual} model development☆217Updated last year
- ☆236Updated last month
- ☆20Updated 3 years ago
- Swarming algorithms like PSO, Ant Colony, Sakana, and more in PyTorch 😊☆136Updated 2 months ago
- Clean RL implementation using MLX☆34Updated last year
- ☆44Updated last year
- Interpreting how transformers simulate agents performing RL tasks☆90Updated 2 years ago
- Unity Machine Learning Agents Toolkit☆48Updated 2 years ago
- ☆62Updated 2 years ago
- Research platform for Human-in-the-loop learning (HILL) & Multi-Agent Reinforcement Learning (MARL)☆82Updated 2 years ago
- API Client for paperswithcode.com☆188Updated last year
- some common Huggingface transformers in maximal update parametrization (µP)☆87Updated 3 years ago
- Official repository for the paper "Zero-Shot AutoML with Pretrained Models"☆48Updated 2 years ago
- ML/DL Math and Method notes☆65Updated 2 years ago
- Generative cellular automaton-like learning environments for RL.☆20Updated 11 months ago
- Additional code for Stable-baselines3 to load and upload models from the Hub.☆90Updated last year
- A langchain agent that retries☆51Updated 2 years ago
- Python library which enables complex compositions of language models such as scratchpads, chain of thought, tool use, selection-inference…☆216Updated 7 months ago
- Automatic Prompt Optimization☆48Updated last year
- Tools to make language models a bit easier to use☆63Updated this week