facebookresearch / PearlLinks
A Production-ready Reinforcement Learning AI Agent Library brought by the Applied Reinforcement Learning team at Meta.
☆2,879Updated last week
Alternatives and similar repositories for Pearl
Users that are interested in Pearl are comparing it to the libraries listed below
Sorting:
- A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.☆2,911Updated this week
- Fine-tune LLM agents with online reinforcement learning☆1,201Updated last year
- Streamlining reinforcement learning with RLOps. State-of-the-art RL algorithms and tools, with 10x faster training through evolutionary h…☆793Updated last week
- High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, T…☆7,406Updated last week
- ☆898Updated last year
- Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI☆1,392Updated last year
- Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.☆2,606Updated last year
- Mastering Diverse Domains through World Models☆1,980Updated 3 months ago
- Robust recipes to align language models with human and AI preferences☆5,260Updated this week
- Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.☆6,011Updated 3 months ago
- Monte Carlo tree search in JAX☆2,507Updated 3 months ago
- PyTorch native post-training library☆5,323Updated this week
- Really Fast End-to-End Jax RL Implementations☆908Updated 10 months ago
- PyTorch code and models for V-JEPA self-supervised learning from video.☆3,126Updated 4 months ago
- A library for generative social simulation☆928Updated last week
- Official Repository for "Eureka: Human-Level Reward Design via Coding Large Language Models" (ICLR 2024)☆3,011Updated last year
- A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents include…☆2,483Updated last month
- A modular RL library to fine-tune language models to human preferences☆2,329Updated last year
- A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)☆4,674Updated last year
- [ICLR 2024] Efficient Streaming Language Models with Attention Sinks☆6,927Updated last year
- Clean PyTorch implementations of imitation and reward learning algorithms☆1,532Updated 6 months ago
- ☆4,083Updated last year
- High throughput synchronous and asynchronous reinforcement learning☆921Updated last month
- LeanRL is a fork of CleanRL, where selected PyTorch scripts optimized for performance using compile and cudagraphs.☆602Updated 8 months ago
- Training LLMs with QLoRA + FSDP☆1,490Updated 8 months ago
- A library of reinforcement learning components and agents☆3,729Updated last month
- A JAX research toolkit for building, editing, and visualizing neural networks.☆1,803Updated 3 weeks ago
- [ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling☆1,712Updated last year
- NanoGPT (124M) in 3 minutes☆2,774Updated 3 weeks ago
- The official implementation of Self-Play Fine-Tuning (SPIN)☆1,172Updated last year