facebookresearch / PearlLinks

A Production-ready Reinforcement Learning AI Agent Library brought by the Applied Reinforcement Learning team at Meta.

☆2,879

Alternatives and similar repositories for Pearl

Users that are interested in Pearl are comparing it to the libraries listed below

Sorting:

pytorch / rl
A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.
☆2,911Updated this week
KhoomeiK / LlamaGym
Fine-tune LLM agents with online reinforcement learning
☆1,201Updated last year
AgileRL / AgileRL
Streamlining reinforcement learning with RLOps. State-of-the-art RL algorithms and tools, with 10x faster training through evolutionary h…
☆793Updated last week
vwxyzjn / cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, T…
☆7,406Updated last week
google-deepmind / funsearch
☆898Updated last year
lucidrains / self-rewarding-lm-pytorch
Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI
☆1,392Updated last year
kzl / decision-transformer
Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.
☆2,606Updated last year
danijar / dreamerv3
Mastering Diverse Domains through World Models
☆1,980Updated 3 months ago
huggingface / alignment-handbook
Robust recipes to align language models with human and AI preferences
☆5,260Updated this week
pytorch-labs / gpt-fast
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
☆6,011Updated 3 months ago
google-deepmind / mctx
Monte Carlo tree search in JAX
☆2,507Updated 3 months ago
pytorch / torchtune
PyTorch native post-training library
☆5,323Updated this week
luchris429 / purejaxrl
Really Fast End-to-End Jax RL Implementations
☆908Updated 10 months ago
facebookresearch / jepa
PyTorch code and models for V-JEPA self-supervised learning from video.
☆3,126Updated 4 months ago
google-deepmind / concordia
A library for generative social simulation
☆928Updated last week
eureka-research / Eureka
Official Repository for "Eureka: Human-Level Reward Design via Coding Large Language Models" (ICLR 2024)
☆3,011Updated last year
DLR-RM / rl-baselines3-zoo
A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents include…
☆2,483Updated last month
allenai / RL4LMs
A modular RL library to fine-tune language models to human preferences
☆2,329Updated last year
CarperAI / trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
☆4,674Updated last year
mit-han-lab / streaming-llm
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
☆6,927Updated last year
HumanCompatibleAI / imitation
Clean PyTorch implementations of imitation and reward learning algorithms
☆1,532Updated 6 months ago
openai / transformer-debugger
☆4,083Updated last year
alex-petrenko / sample-factory
High throughput synchronous and asynchronous reinforcement learning
☆921Updated last month
pytorch-labs / LeanRL
LeanRL is a fork of CleanRL, where selected PyTorch scripts optimized for performance using compile and cudagraphs.
☆602Updated 8 months ago
AnswerDotAI / fsdp_qlora
Training LLMs with QLoRA + FSDP
☆1,490Updated 8 months ago
google-deepmind / acme
A library of reinforcement learning components and agents
☆3,729Updated last month
google-deepmind / penzai
A JAX research toolkit for building, editing, and visualizing neural networks.
☆1,803Updated 3 weeks ago
SqueezeAILab / LLMCompiler
[ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling
☆1,712Updated last year
KellerJordan / modded-nanogpt
NanoGPT (124M) in 3 minutes
☆2,774Updated 3 weeks ago
uclaml / SPIN
The official implementation of Self-Play Fine-Tuning (SPIN)
☆1,172Updated last year