A curated list of RL resources
☆54Apr 17, 2026Updated last month
Alternatives and similar repositories for Awesome-RL
Users that are interested in Awesome-RL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An implementation of DreamerV2 written in JAX, with support for running multiple random seeds of an experiment on a single GPU.☆18Jan 16, 2023Updated 3 years ago
- ☆10Oct 15, 2020Updated 5 years ago
- Kernel Herding for probability density estimation☆14Feb 23, 2016Updated 10 years ago
- An evaluation suite for Retrieval-Augmented Generation (RAG).☆24Apr 26, 2025Updated last year
- Squint: Fast Visual Reinforcement Learning for Sim-to-Real Robotics [PyTorch, SO-101 Robot Arm, ManiSkill3, Sim-to-Real]☆60Mar 4, 2026Updated 2 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- This is the source code for: Context-aware Entity Typing in Knowledge Graphs.☆16May 10, 2022Updated 4 years ago
- ☆14Apr 21, 2023Updated 3 years ago
- Pi0-VLA Repository of "MotionTrans: Human VR Data Enable Motion-Level Learning for Robotic Manipulation Policies"☆27Mar 9, 2026Updated 2 months ago
- ☆18May 5, 2021Updated 5 years ago
- ☆29Mar 13, 2026Updated 2 months ago
- ☆59Nov 18, 2024Updated last year
- A better Alpaca Model Trained with Less Data (only 9k instructions of the original set)☆24Jul 26, 2024Updated last year
- [ICLR 2025] Language Imbalance Driven Rewarding for Multilingual Self-improving☆25Apr 6, 2026Updated last month
- [NAACL 2025] Representing Rule-based Chatbots with Transformers☆23Feb 9, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- a training-free approach to accelerate ViTs and VLMs by pruning redundant tokens based on similarity☆44May 24, 2025Updated last year
- Relative Preference Optimization: Enhancing LLM Alignment through Contrasting Responses across Identical and Diverse Prompts☆25Feb 23, 2024Updated 2 years ago
- Knowledge Base Graph Attention Networks☆14Feb 22, 2020Updated 6 years ago
- Official repository of paper "Parameters vs. Context: Fine-Grained Control of Knowledge Reliance in Language Models"☆26May 27, 2025Updated 11 months ago
- Official implementation for "Knowledge Distillation with Refined Logits".☆23Aug 26, 2024Updated last year
- MICCAI 2024 code for the paper: EchoNet-Synthetic: Privacy-preserving Video Generation for Safe Medical Data Sharing. EchoNet-Synthetic i…☆39Jun 16, 2025Updated 11 months ago
- [ICLR 2025] Breaking Mental Set to Improve Reasoning through Diverse Multi-Agent Debate☆21Apr 22, 2025Updated last year
- [WSDM 2025] Source code for "Spectrum-based Modality Representation Fusion Graph Convolutional Network for Multimodal Recommendation".☆37Dec 22, 2024Updated last year
- data center cooling with reinforcement learning☆16Jun 3, 2020Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Repository for Skill Set Optimization☆14Jul 26, 2024Updated last year
- ChatGPT for Software Architects, Harnessing AI for Architectural Innovation using Intelligent Design Decision Support☆38Apr 1, 2026Updated last month
- Controller for OnRobot RG2 and RG6 grippers.☆21Mar 10, 2026Updated 2 months ago
- Setup guide for the UniTree Go1 robot☆23Dec 9, 2023Updated 2 years ago
- Implementation for "The Scalability of Simplicity: Empirical Analysis of Vision-Language Learning with a Single Transformer"☆83Oct 29, 2025Updated 6 months ago
- This repository provides a framework for low-level control of a legged robot (Unitree Go2), using ROS 2 as the communication middleware. …☆34Oct 17, 2025Updated 7 months ago
- ☆90Aug 16, 2025Updated 9 months ago
- [ICML 2023] Pre-train world model-based agents with different unsupervised strategies, fine-tune the agent's components selectively, and …☆41Feb 27, 2024Updated 2 years ago
- Resources for our ACL 2023 paper: Distilling Script Knowledge from Large Language Models for Constrained Language Planning☆36Aug 19, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆13Feb 21, 2022Updated 4 years ago
- Official Implementation of paper "Distilling Long-tailed Datasets" [CVPR 2025]☆21Aug 13, 2025Updated 9 months ago
- Code and Resources for the paper, "Better to Ask in English: Cross-Lingual Evaluation of Large Language Models for Healthcare Queries"☆19May 14, 2026Updated last week
- ☆35Sep 14, 2024Updated last year
- Official implementation of Scaling Laws in Patchification: An Image Is Worth 50,176 Tokens And More☆25Feb 25, 2025Updated last year
- ENACT is a benchmark that evaluates embodied cognition through world modeling from egocentric interaction. It is designed to be simple an…☆50Nov 27, 2025Updated 6 months ago
- MEDDxAgent: A Unified Modular Agent Framework for Explainable Automatic Differential Diagnosis☆19Jun 13, 2025Updated 11 months ago