amazon-science/PAE

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/amazon-science/PAE)

amazon-science / PAE

☆70

Alternatives and similar repositories for PAE

Users that are interested in PAE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

DigiRL-agent / digiq
View on GitHub
☆121Apr 8, 2025Updated last year
ZrW00 / GraCeFul
View on GitHub
The code implementation of GraCeFul (Accepted in COLING 2025)
☆13Jan 27, 2025Updated last year
DigiRL-agent / digirl
View on GitHub
Official repo for paper DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning.
☆393Feb 22, 2025Updated last year
test-time-interaction / TTI
View on GitHub
☆76Jun 10, 2025Updated last year
data-for-agents / insta
View on GitHub
Official Repo for InSTA: Towards Internet-Scale Training For Agents
☆56Jul 11, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
MindLab-Research / longstraw
View on GitHub
MinT-2M: Long-context training system for resident-prefix GRPO
☆17Updated this week
cocoa-org / NanoRollout
View on GitHub
Scale digital agent rollouts without pain.
☆34Jun 18, 2026Updated last month
hkust-nlp / GUIMid
View on GitHub
☆22May 3, 2025Updated last year
vivekmyers / tra-ogbench
View on GitHub
☆18Feb 13, 2025Updated last year
ChristosKap / policy_consolidation
View on GitHub
Code for Policy Consolidation for Continual Reinforcement Learning
☆10May 12, 2019Updated 7 years ago
HypherX / Evolution-Analysis
View on GitHub
☆25Dec 13, 2024Updated last year
yjyddq / EOSER-ASS-RL
View on GitHub
Official Repository of "Taming Masked Diffusion Language Models via Consistency Trajectory Reinforcement Learning with Fewer Decoding Ste…
☆28Mar 9, 2026Updated 4 months ago
tianyi-lab / R2-T2
View on GitHub
[ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"
☆19Mar 10, 2025Updated last year
YifeiZhou02 / ArCHer
View on GitHub
Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"
☆208Apr 17, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Berkeley-NLP / Agent-Eval-Refine
View on GitHub
Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]
☆149Nov 26, 2024Updated last year
chentong0 / rl-binary-rar
View on GitHub
Official repo for "Binary Retrieval-augmented Reward Mitigates Hallucinations"
☆15Nov 13, 2025Updated 8 months ago
LgQu / TIGeR
View on GitHub
Code for paper: Unified Text-to-Image Generation and Retrieval
☆16Updated this week
wumingqi / LLM-Math-Evaluation
View on GitHub
Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination.
☆21Jul 18, 2025Updated last year
intervention-training / int
View on GitHub
☆16Feb 4, 2026Updated 5 months ago
zhxieml / PDT
View on GitHub
Implementation of ICML 2023 paper: Future-conditioned Unsupervised Pretraining for Decision Transformer
☆29Jul 25, 2023Updated 2 years ago
THUDM / WebRL
View on GitHub
Building Open LLM Web Agents with Self-Evolving Online Curriculum RL
☆535Jun 6, 2025Updated last year
sunblaze-ucb / reasoning_ladder
View on GitHub
☆35May 16, 2025Updated last year
mathllm / Step-Controlled_DPO
View on GitHub
☆23Jul 5, 2024Updated 2 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
WujiangXu / EPO
View on GitHub
The code for paper "EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning"
☆40Jul 13, 2026Updated last week
RL4VLM / RL4VLM
View on GitHub
Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning
☆415Dec 15, 2024Updated last year
kyle8581 / Web-Shepherd
View on GitHub
[NeurIPS 2025 Spotlight] Official repository for "Web-Shepherd: Advancing PRMs for Reinforcing Web Agents"
☆58May 21, 2025Updated last year
OpenWebRL / OpenWebRL
View on GitHub
Code for paper OpenWebRL: Online Multi-Turn Reinforcement Learning for Visual Web Agents
☆37Jul 9, 2026Updated last week
RTkenny / RiskPO
View on GitHub
Official implementation of 'RiskPO: Risk-based Policy Optimization via Verifiable Reward for LLM Post-Training', accepted by ICLR 2026
☆18Oct 15, 2025Updated 9 months ago
google-deepmind / dmc_vision_benchmark
View on GitHub
☆34Jun 21, 2024Updated 2 years ago
floriangroetschla / AgentsNet
View on GitHub
☆37Jul 16, 2025Updated last year
Jikai0Wang / OPT-Tree
View on GitHub
☆30May 24, 2025Updated last year
thu-coai / SPaR
View on GitHub
☆47Jun 11, 2025Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
hulianyuyy / iLLaVA
View on GitHub
iLLaVA: An Image is Worth Fewer Than 1/3 Input Tokens in Large Multimodal Models (ICLR2026)
☆23Jun 24, 2026Updated 3 weeks ago
jylee425 / b-moca
View on GitHub
Benchmarking Mobile Device Control Agents across Diverse Configurations (ICLR 2024 workshop GenAI4DM spotlight presentation; CoLLAs 2025)
☆34Jul 21, 2025Updated last year
hangeol / UniR
View on GitHub
Official repo for paper: Universal Reasoner: A Single, Composable Plug-and-Play Reasoner for Frozen LLMs
☆20Nov 26, 2025Updated 7 months ago
convergence-ai / webgames
View on GitHub
Challenges for general-purpose web-browsing AI agents
☆68Jun 2, 2025Updated last year
microsoft / webgym
View on GitHub
This project includes code for using the AsyncWebRL and WebGym frameworks to train web agent models.
☆46Jun 9, 2026Updated last month
cheryyunl / Make-An-Agent
View on GitHub
☆51Jul 22, 2024Updated last year
LinWeizheDragon / Knowledge-Aware-Graph-Enhanced-GPT-2-for-Dialogue-State-Tracking
View on GitHub
This is the official repository of EMNLP 2021 paper "Knowledge-Aware Graph-Enhanced GPT-2 for Dialogue State Tracking".
☆24Nov 11, 2021Updated 4 years ago