A curated list of RL resources
☆53Apr 17, 2026Updated 2 weeks ago
Alternatives and similar repositories for Awesome-RL
Users that are interested in Awesome-RL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The source code of [WWW 2025] MoDiCF☆14Mar 26, 2026Updated last month
- An implementation of DreamerV2 written in JAX, with support for running multiple random seeds of an experiment on a single GPU.☆18Jan 16, 2023Updated 3 years ago
- ☆10Oct 15, 2020Updated 5 years ago
- Kernel Herding for probability density estimation☆14Feb 23, 2016Updated 10 years ago
- Code for "A Data-Centric Approach To Generate Faithful and High Quality Patient Summaries with Large Language Models"☆17Jul 20, 2025Updated 9 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Offical implementation of "Advancing Spiking Neural Networks towards Deep Residual Learning" (IEEE TNNLS 2024)☆14Aug 28, 2023Updated 2 years ago
- This is the source code for: Context-aware Entity Typing in Knowledge Graphs.☆16May 10, 2022Updated 3 years ago
- ☆14Apr 21, 2023Updated 3 years ago
- Pi0-VLA Repository of "MotionTrans: Human VR Data Enable Motion-Level Learning for Robotic Manipulation Policies"☆27Mar 9, 2026Updated last month
- ☆18May 5, 2021Updated 5 years ago
- ☆28Jul 11, 2024Updated last year
- [NeurIPS 2025] MedAgentBoard: Benchmarking Multi-Agent Collaboration with Conventional Methods for Diverse Medical Tasks☆51Mar 13, 2026Updated last month
- FeatureAlignment = Alignment + Mechanistic Interpretability☆35Mar 8, 2025Updated last year
- ☆29Mar 13, 2026Updated last month
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆59Nov 18, 2024Updated last year
- PLM: Efficient Peripheral Language Models Hardware-Co-Designed for Ubiquitous Computing☆21Mar 18, 2025Updated last year
- Gazebo Classic Simulation of Universial Robot + Robotiq 2f-85.☆13Feb 11, 2025Updated last year
- A better Alpaca Model Trained with Less Data (only 9k instructions of the original set)☆24Jul 26, 2024Updated last year
- [NAACL 2025] Representing Rule-based Chatbots with Transformers☆23Feb 9, 2025Updated last year
- 该工具包主要完成在项目内根据提供的实体类包,自动生成spring mybatis,所需要的service层接口与实现,数据库表的创建包括主键,描述,长度等的设置,数据库操作接口与对应xml文件,支持jar与maven☆10Dec 23, 2022Updated 3 years ago
- ☆12Apr 21, 2024Updated 2 years ago
- Offical implementation of "Inherent Redundancy in Spiking Neural Networks" (ICCV2023)☆29Jan 7, 2024Updated 2 years ago
- ☆25May 29, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆13Jan 14, 2026Updated 3 months ago
- Relative Preference Optimization: Enhancing LLM Alignment through Contrasting Responses across Identical and Diverse Prompts☆25Feb 23, 2024Updated 2 years ago
- Official repository of paper "Parameters vs. Context: Fine-Grained Control of Knowledge Reliance in Language Models"☆25May 27, 2025Updated 11 months ago
- Official implementation for "Knowledge Distillation with Refined Logits".☆23Aug 26, 2024Updated last year
- Example of collecting data from SRanipal at 120hz with HTC Vive Pro Eye☆15Dec 1, 2022Updated 3 years ago
- [WSDM 2025] Source code for "Spectrum-based Modality Representation Fusion Graph Convolutional Network for Multimodal Recommendation".☆37Dec 22, 2024Updated last year
- [ACL 2024] Instruct Once, Chat Consistently in Multiple Rounds: An Efficient Tuning Framework for Dialogue☆26Oct 18, 2025Updated 6 months ago
- 《基于BERT模型的自然语言处理实战》随书代码☆17Jun 13, 2022Updated 3 years ago
- The Pre-lease github repository of ECHOPULSE: ECG CONTROLLED ECHOCARDIO- GRAMS VIDEO GENERATION☆43Feb 4, 2025Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ✨ Official code for our paper: "Uncertainty-o: One Model-agnostic Framework for Unveiling Epistemic Uncertainty in Large Multimodal Model…☆20Mar 13, 2025Updated last year
- Repository for Skill Set Optimization☆14Jul 26, 2024Updated last year
- ChatGPT for Software Architects, Harnessing AI for Architectural Innovation using Intelligent Design Decision Support☆38Apr 1, 2026Updated last month
- Official repository Flash Local Linear Attention☆23Apr 23, 2026Updated 2 weeks ago
- Implementation for "The Scalability of Simplicity: Empirical Analysis of Vision-Language Learning with a Single Transformer"☆83Oct 29, 2025Updated 6 months ago
- This is a repository for paper titled, PlaSma: Making Small Language Models Better Procedural Knowledge Models for (Counterfactual) Plann…☆14Nov 3, 2023Updated 2 years ago
- ☆90Aug 16, 2025Updated 8 months ago