belerico / lightning-rlLinks

☆16

Alternatives and similar repositories for lightning-rl

Users that are interested in lightning-rl are comparing it to the libraries listed below

Sorting:

AnswerDotAI / nbdev-template
☆95Updated 2 weeks ago
skypilot-org / skypilot-tutorial
Tutorial to get started with SkyPilot!
☆58Updated last year
huggingface / simulate
🎢 Creating and sharing simulation environments for embodied and synthetic data research
☆191Updated last year
Modalities / modalities
Modalities, a PyTorch-native framework for distributed and reproducible foundation model training.
☆83Updated this week
captain-pool / hydra-wandb-sweeper
WandB sweeps integration with Hydra sweeper
☆49Updated last year
hundredblocks / large-model-parallelism
Functional local implementations of main model parallelism approaches
☆95Updated 2 years ago
captain-pool / hydra-example
☆21Updated 3 years ago
raunakdoesdev / claudescholar
☆43Updated last year
awesome-mlops / awesome-ml-experiment-management
A curated list of awesome open source tools and commercial products for ML Experiment Tracking and Management 🚀
☆139Updated 11 months ago
augustwester / transformer-xl
A lightweight PyTorch implementation of the Transformer-XL architecture proposed by Dai et al. (2019)
☆37Updated 2 years ago
google-research / optformer
☆221Updated last week
ckkissane / rlhf-shakespeare
Shakespeare transformer fine-tuned to generate positive sentiment samples using RLHF
☆9Updated 2 years ago
zeno-ml / zeno
AI Data Management & Evaluation Platform
☆215Updated last year
r-three / git-theta
git extension for {collaborative, communal, continual} model development
☆214Updated 7 months ago
Lightning-Universe / DiffusionWithAutoscaler
DiffusionWithAutoscaler
☆29Updated last year
abacaj / train-with-fsdp
☆92Updated last year
kyegomez / swarms-pytorch
Swarming algorithms like PSO, Ant Colony, Sakana, and more in PyTorch 😊
☆125Updated last month
microsoft / deep-language-networks
We view Large Language Models as stochastic language layers in a network, where the learnable parameters are the natural language prompts…
☆94Updated 11 months ago
allenai / discoverybench
Discovering Data-driven Hypotheses in the Wild
☆99Updated last month
NousResearch / StripedHyenaTrainer
☆61Updated last year
gretelai / awesome-synthetic-data
📖 A curated list of resources dedicated to synthetic data
☆130Updated 2 years ago
microsoft / mutransformers
some common Huggingface transformers in maximal update parametrization (µP)
☆81Updated 3 years ago
lvwerra / rl-implementations
This repo contains a set of notebooks to reproduce reinforcement learning algorithms.
☆15Updated 2 years ago
huggingface / huggingface_sb3
Additional code for Stable-baselines3 to load and upload models from the Hub.
☆87Updated last year
LumenPallidium / jepa
Experiments in Joint Embedding Predictive Architectures (JEPAs).
☆41Updated last year
btnorman / First-Explore
Repo to reproduce the First-Explore paper results
☆37Updated 6 months ago
BricksRL / bricksrl
BricksRL: A Platform for Democratizing Robotics and Reinforcement Learning Research and Education with LEGO
☆60Updated 9 months ago
CarperAI / cheese
Used for adaptive human in the loop evaluation of language and embedding models.
☆309Updated 2 years ago
google-research / cascades
Python library which enables complex compositions of language models such as scratchpads, chain of thought, tool use, selection-inference…
☆208Updated last month
luyug / magix
Supercharge huggingface transformers with model parallelism.
☆77Updated 9 months ago