A curated list of reinforcement learning (RL) for agents.
☆97Mar 30, 2026Updated 2 months ago
Alternatives and similar repositories for awesome-rl-for-agents
Users that are interested in awesome-rl-for-agents are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆26Jan 26, 2024Updated 2 years ago
- [IEEE TIP 2024] Facial Prior Guided Micro-Expression Generation☆13Nov 8, 2024Updated last year
- [CVPR2024] Official implementation of the paper: Skeleton-in-Context: Unified Skeleton Sequence Modeling with In-Context Learning☆40Aug 15, 2025Updated 9 months ago
- Implementation of the paper: ModelNet-O: A Large-Scale Synthetic Dataset for Occlusion-Aware Point Cloud Classification☆13Aug 31, 2024Updated last year
- ☆47Mar 15, 2025Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- [NeurIPS2023] Implementation of the paper: Explore In-Context Learning for 3D Point Cloud Understanding☆74Mar 18, 2026Updated 2 months ago
- Unofficial implementation of Chain of Hindsight (https://arxiv.org/abs/2302.02676) using pytorch and huggingface Trainers.☆11Apr 5, 2023Updated 3 years ago
- SearchGPT: Building a quick conversation-based search engine with LLMs.☆46Jan 5, 2025Updated last year
- Code repo for "CritiPrefill: A Segment-wise Criticality-based Approach for Prefilling Acceleration in LLMs".☆17Sep 15, 2024Updated last year
- [CVPR2024] Learning from Synthetic Human Group Activities☆14Feb 24, 2025Updated last year
- [ICCV 25] Official repository of "Collaborative Instance Object Navigation: Leveraging Uncertainty-Awareness to Minimize Human-Agent Dial…☆29Apr 1, 2026Updated last month
- Code for safety test in "Keeping LLMs Aligned After Fine-tuning: The Crucial Role of Prompt Templates"☆22Sep 21, 2025Updated 8 months ago
- [ICCV2023] Chaotic World: A Large and Challenging Benchmark for Human Behavior Understanding in Chaotic Events☆10Dec 7, 2024Updated last year
- Application for detecting command and control (C2) communication through network traffic analysis.☆16May 12, 2023Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- [TRIT 2024] Implementation of the paper “Explore Human Parsing Modality for Action Recognition”.☆39Aug 26, 2024Updated last year
- ☆30Mar 11, 2025Updated last year
- [ACL 2026] OPT-BENCH: Evaluating LLM Agent on Large-Scale Search Spaces Optimization Problems☆125May 12, 2026Updated 2 weeks ago
- ☆10Apr 26, 2023Updated 3 years ago
- DCPO: Dynamic Adaptive Clipping for RL☆49Apr 1, 2026Updated last month
- ☆513Oct 11, 2025Updated 7 months ago
- HARPER is a HRI dataset for 3D Human Pose Estimation and Forecasting from the Robot’s Perspective.☆13Sep 2, 2025Updated 8 months ago
- PyTorch code of “Out-of-Sample Representation Learning for Multi-Relational Graphs” (EMNLP 2020)☆10Oct 2, 2020Updated 5 years ago
- Another Wheel to parse json☆11Mar 13, 2020Updated 6 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A heuristic, python-based detector for fast-flux botnets.☆13Feb 24, 2012Updated 14 years ago
- codes and plots for "Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs"☆11Dec 30, 2024Updated last year
- Implementations of modern convex optimization-based graph algorithms in Python. Available on the Python Package Index (PyPI).☆16Jul 18, 2019Updated 6 years ago
- 用触动精灵lua脚本刷各种广告,由后台下发个广告SDK任务,支持23种分辨率,支持水军,留存上报,留存可后台设置百分比,10多个平台广告同时刷,实时上报刷量数据 本脚本建议配合盘古后台系统,fakeapk hook 工具使用。☆13Mar 30, 2018Updated 8 years ago
- ☆11Dec 6, 2020Updated 5 years ago
- A collection of papers and libraries for performing multi-agent optimization☆19Feb 7, 2026Updated 3 months ago
- ☆55Apr 7, 2026Updated last month
- Materials for paper "Are Large Language Models Temporally Grounded?"☆14Nov 16, 2023Updated 2 years ago
- ☆13Jan 7, 2023Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Multiview variant of Pointpillars. Contains Pytorch reimplementation of Pillar-od.☆14Jan 15, 2021Updated 5 years ago
- [EMNLP 2025 Main] SpecVLM: Enhancing Speculative Decoding of Video LLMs via Verifier-Guided Token Pruning☆44Apr 16, 2026Updated last month
- [ICLR'25] Official repository for "AVHBench: A Cross-Modal Hallucination Evaluation for Audio-Visual Large Language Models"☆24Mar 8, 2026Updated 2 months ago
- Static code injection using text padding and reverse text extension☆11Jun 7, 2017Updated 8 years ago
- 北语 246 实验室新生简明指南☆10May 30, 2022Updated 3 years ago
- Forecastbench Datasets, updated nightly☆28May 21, 2026Updated last week
- ☆14Dec 9, 2022Updated 3 years ago