belerico / lightning-rl
β16Updated 2 years ago
Alternatives and similar repositories for lightning-rl:
Users that are interested in lightning-rl are comparing it to the libraries listed below
- π’ Creating and sharing simulation environments for embodied and synthetic data researchβ191Updated last year
- WandB sweeps integration with Hydra sweeperβ47Updated last year
- β43Updated last year
- Tensor Extraction of Latent Features (T-ELF). Within T-ELF's arsenal are non-negative matrix and tensor factorization solutions, equippedβ¦β16Updated 3 weeks ago
- DiffusionWithAutoscalerβ29Updated 10 months ago
- A langchain agent that retriesβ48Updated last year
- Tutorial to get started with SkyPilot!β56Updated 9 months ago
- The backend behind the LLM-Perf Leaderboardβ10Updated 9 months ago
- Because we don't have enough time to read everythingβ87Updated 4 months ago
- BricksRL: A Platform for Democratizing Robotics and Reinforcement Learning Research and Education with LEGOβ57Updated 4 months ago
- An active learning library for Pytorch based on Lightning-Fabric.β79Updated 9 months ago
- AI Data Management & Evaluation Platformβ215Updated last year
- Exca - Execution and caching tool for pythonβ75Updated this week
- DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.β31Updated last year
- β20Updated 2 years ago
- Research platform for Human-in-the-loop learning (HILL) & Multi-Agent Reinforcement Learning (MARL)β80Updated last year
- On-the-fly conversions between Jax and NumPy tensorsβ49Updated last year
- Code for minimum-entropy coupling.β31Updated 7 months ago
- Repository to gather and share ideas eventually spurring discussions and possibly implementations. Please adhere to the proposal templateβ¦β9Updated 5 years ago
- Additional code for Stable-baselines3 to load and upload models from the Hub.β84Updated 7 months ago
- β100Updated last year
- Discovering Data-driven Hypotheses in the Wildβ55Updated 3 months ago
- Train very large language models in Jax.β202Updated last year
- Portfolio REgret for Confidence SEquencesβ14Updated 2 months ago
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.β161Updated last week
- LMQL implementation of tree of thoughtsβ33Updated last year
- Efficient baselines for autocurricula in JAX.β179Updated 5 months ago
- β14Updated 10 months ago
- A curated list of awesome open source tools and commercial products for ML Experiment Tracking and Management πβ121Updated 7 months ago