belerico / lightning-rlLinks
☆16Updated 2 years ago
Alternatives and similar repositories for lightning-rl
Users that are interested in lightning-rl are comparing it to the libraries listed below
Sorting:
- ☆95Updated 2 weeks ago
- Tutorial to get started with SkyPilot!☆58Updated last year
- 🎢 Creating and sharing simulation environments for embodied and synthetic data research☆191Updated last year
- Modalities, a PyTorch-native framework for distributed and reproducible foundation model training.☆83Updated this week
- WandB sweeps integration with Hydra sweeper☆49Updated last year
- Functional local implementations of main model parallelism approaches☆95Updated 2 years ago
- ☆21Updated 3 years ago
- ☆43Updated last year
- A curated list of awesome open source tools and commercial products for ML Experiment Tracking and Management 🚀☆139Updated 11 months ago
- A lightweight PyTorch implementation of the Transformer-XL architecture proposed by Dai et al. (2019)☆37Updated 2 years ago
- ☆221Updated last week
- Shakespeare transformer fine-tuned to generate positive sentiment samples using RLHF☆9Updated 2 years ago
- AI Data Management & Evaluation Platform☆215Updated last year
- git extension for {collaborative, communal, continual} model development☆214Updated 7 months ago
- DiffusionWithAutoscaler☆29Updated last year
- ☆92Updated last year
- Swarming algorithms like PSO, Ant Colony, Sakana, and more in PyTorch 😊☆125Updated last month
- We view Large Language Models as stochastic language layers in a network, where the learnable parameters are the natural language prompts…☆94Updated 11 months ago
- Discovering Data-driven Hypotheses in the Wild☆99Updated last month
- ☆61Updated last year
- 📖 A curated list of resources dedicated to synthetic data☆130Updated 2 years ago
- some common Huggingface transformers in maximal update parametrization (µP)☆81Updated 3 years ago
- This repo contains a set of notebooks to reproduce reinforcement learning algorithms.☆15Updated 2 years ago
- Additional code for Stable-baselines3 to load and upload models from the Hub.☆87Updated last year
- Experiments in Joint Embedding Predictive Architectures (JEPAs).☆41Updated last year
- Repo to reproduce the First-Explore paper results☆37Updated 6 months ago
- BricksRL: A Platform for Democratizing Robotics and Reinforcement Learning Research and Education with LEGO☆60Updated 9 months ago
- Used for adaptive human in the loop evaluation of language and embedding models.☆309Updated 2 years ago
- Python library which enables complex compositions of language models such as scratchpads, chain of thought, tool use, selection-inference…☆208Updated last month
- Supercharge huggingface transformers with model parallelism.☆77Updated 9 months ago