SalesforceAIResearch/UserRL

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/SalesforceAIResearch/UserRL)

SalesforceAIResearch / UserRL

The raw UserRL repo under construction

☆110

Alternatives and similar repositories for UserRL

Users that are interested in UserRL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

SalesforceAIResearch / UserBench
View on GitHub
☆63Jun 2, 2026Updated last month
chentong0 / rl-binary-rar
View on GitHub
Official repo for "Binary Retrieval-augmented Reward Mitigates Hallucinations"
☆15Nov 13, 2025Updated 8 months ago
zzwkk / MUA-RL
View on GitHub
MUA-RL: MULTI-TURN USER-INTERACTING AGENT REINFORCEMENT LEARNING FOR AGENTIC TOOL USE
☆65Nov 5, 2025Updated 8 months ago
stellalisy / PrefPalette
View on GitHub
☆21Apr 3, 2026Updated 3 months ago
sunnweiwei / PPP-Agent
View on GitHub
Training Proactive and Personalized LLM Agents
☆112Jan 20, 2026Updated 6 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
pearls-lab / meow-tea-taro
View on GitHub
A Practitioner's Guide to M(eow)ti Turn Agentic ReinfOrcement learning
☆83Jan 16, 2026Updated 6 months ago
zhiyuan-zhang0206 / HomeworkAgent
View on GitHub
A multi-agent framework to help with your homework.
☆11Mar 1, 2025Updated last year
lili-chen / rltf
View on GitHub
Reinforcement Learning from Text Feedback
☆49Feb 17, 2026Updated 5 months ago
bin123apple / InfantAgent
View on GitHub
[NeurIPS 2025] A multimodal agent that can interact with its own PC in a multimodal manner.
☆39Apr 23, 2026Updated 2 months ago
facebookresearch / meta-agents-research-environments
View on GitHub
Meta Agents Research Environments is a comprehensive platform designed to evaluate AI agents in dynamic, realistic scenarios. Unlike stat…
☆528Jun 20, 2026Updated last month
OPPO-PersonalAI / PersonalizedDeepResearchBench
View on GitHub
☆24Jan 27, 2026Updated 5 months ago
AgenticIR-Lab / OThink-R1
View on GitHub
This is the official code for OThink-R1 project.
☆21Jun 19, 2025Updated last year
TIGER-AI-Lab / Hierarchical-Reasoner
View on GitHub
Emergent Hierarchical Reasoning in LLMs/VLMs through Reinforcement Learning [ICLR26]
☆64Apr 11, 2026Updated 3 months ago
allenai / olmix
View on GitHub
☆41May 26, 2026Updated last month
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
czp16 / Bridge-LLM-reasoning
View on GitHub
Behavior Injection: Preparing Language Models for Reinforcement Learning (NeurIPS 2025)
☆17Jul 1, 2025Updated last year
upup-wei / RAG-ReasonAlignment
View on GitHub
☆20May 20, 2025Updated last year
howard-yen / SLIM
View on GitHub
☆27Jun 22, 2026Updated 3 weeks ago
zaydzuhri / token-order-prediction
View on GitHub
Landing repository for the paper "Predicting the Order of Upcoming Tokens Improves Language Modeling"
☆48May 13, 2026Updated 2 months ago
FoundationAgents / InteractComp
View on GitHub
☆22Jan 26, 2026Updated 5 months ago
JiazhengZhang / AgentV-RL
View on GitHub
☆15Apr 17, 2026Updated 3 months ago
RM-R1-UIUC / RM-R1
View on GitHub
[ICLR'26] RM-R1: Unleashing the Reasoning Potential of Reward Models
☆167Jun 26, 2025Updated last year
cxcscmu / General-AgentBench
View on GitHub
Benchmark Test-Time Scaling of General LLM Agents
☆20Apr 14, 2026Updated 3 months ago
Farseer-Scaling-Law / Farseer
View on GitHub
☆21Jun 12, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
LgQu / TIGeR
View on GitHub
Code for paper: Unified Text-to-Image Generation and Retrieval
☆16Updated this week
RUC-NLPIR / ARPO
View on GitHub
[ICLR 2026] Agentic Reinforced Policy Optimization (ARPO)
☆1,088Jul 13, 2026Updated last week
inclusionAI / GroveMoE
View on GitHub
☆24Aug 20, 2025Updated 11 months ago
TsinghuaC3I / SSRL
View on GitHub
SSRL: Self-Search Reinforcement Learning
☆210Aug 20, 2025Updated 11 months ago
qiancheng0 / ModelingAgent
View on GitHub
☆23Sep 7, 2025Updated 10 months ago
wizard-III / ArcherCodeR
View on GitHub
ArcherCodeR is an open-source initiative enhancing code reasoning in large language models through scalable, rule-governed reinforcement …
☆44Aug 6, 2025Updated 11 months ago
GAIR-NLP / OctoThinker
View on GitHub
Revisiting Mid-training in the Era of Reinforcement Learning Scaling
☆189Jul 23, 2025Updated 11 months ago
xlang-ai / OSWorld-G
View on GitHub
[NeurIPS 2025 Spotlight] Scaling Computer-Use Grounding via UI Decomposition and Synthesis
☆172Jun 18, 2026Updated last month
kfq20 / Coleaf
View on GitHub
AI workspace for collaborative Overleaf/LaTeX writing
☆19May 7, 2026Updated 2 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
xlang-ai / CUA-Gym
View on GitHub
Scalable pipeline for synthesizing verifiable RLVR training data for computer-use agents
☆177May 26, 2026Updated last month
bowen-upenn / PersonaMem
View on GitHub
[COLM 2025] Know Me, Respond to Me: Benchmarking LLMs for Dynamic User Profiling and Personalized Responses at Scale
☆172Mar 19, 2026Updated 4 months ago
TIGER-AI-Lab / AceCoder
View on GitHub
The official repo for "AceCoder: Acing Coder RL via Automated Test-Case Synthesis" [ACL25]
☆100Apr 9, 2025Updated last year
UMass-Embodied-AGI / BudgetGuidance
View on GitHub
[ACL'26 Findings] Steering LLM Thinking with Budget Guidance
☆32Feb 19, 2026Updated 5 months ago
openaiotlab / ContextAgent
View on GitHub
[NeurIPS'25] ContextAgent: Context-Aware Proactive LLM Agents with Open-World Sensory Perceptions
☆48Dec 7, 2025Updated 7 months ago
uq-project / UQ
View on GitHub
UQ: Assessing Language Models on Unsolved Questions
☆30Aug 26, 2025Updated 10 months ago
wssun / PromptCS
View on GitHub
A Prompt Learning Framework for Source Code Summarization
☆14Dec 26, 2023Updated 2 years ago