facebookresearch / Pearl
A Production-ready Reinforcement Learning AI Agent Library brought by the Applied Reinforcement Learning team at Meta.
☆2,672Updated this week
Related projects ⓘ
Alternatives and complementary repositories for Pearl
- A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.☆2,346Updated this week
- A JAX research toolkit for building, editing, and visualizing neural networks.☆1,679Updated this week
- Fine-tune LLM agents with online reinforcement learning☆995Updated 8 months ago
- Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI☆1,336Updated 7 months ago
- ☆4,035Updated 5 months ago
- High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, T…☆5,706Updated this week
- PyTorch native finetuning library☆4,336Updated this week
- A native PyTorch Library for large model training☆2,623Updated this week
- Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.☆5,669Updated last month
- ☆741Updated 9 months ago
- Tools for merging pretrained large language models.☆4,816Updated 2 weeks ago
- LeanRL is a fork of CleanRL, where selected PyTorch scripts optimized for performance using compile and cudagraphs.☆445Updated 3 weeks ago
- Really Fast End-to-End Jax RL Implementations☆724Updated 2 months ago
- Schedule-Free Optimization in PyTorch☆1,898Updated 2 weeks ago
- Streamlining reinforcement learning with RLOps. State-of-the-art RL algorithms and tools.☆592Updated this week
- Make PyTorch models up to 40% faster! Thunder is a source to source compiler for PyTorch. It enables using different hardware executors a…☆1,199Updated this week
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.☆2,045Updated this week
- A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents include…☆2,081Updated this week
- A benchmark to evaluate language models on questions I've previously asked them to solve.☆916Updated 2 weeks ago
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,057Updated 2 months ago
- ☆2,746Updated 2 months ago
- [ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling☆1,529Updated 4 months ago
- nanoGPT style version of Llama 3.1☆1,246Updated 3 months ago
- A curated list of reinforcement learning with human feedback resources (continually updated)☆3,476Updated last week
- Monte Carlo tree search in JAX☆2,357Updated 3 months ago
- PyTorch code and models for V-JEPA self-supervised learning from video.☆2,673Updated 3 months ago
- Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.☆9,195Updated 4 months ago
- A modern model graph visualizer and debugger☆1,058Updated this week
- A library for generative social simulation☆693Updated this week
- Mastering Diverse Domains through World Models☆1,381Updated 3 months ago