☆63Jun 2, 2026Updated 3 weeks ago
Alternatives and similar repositories for UserBench
Users that are interested in UserBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The raw UserRL repo under construction☆104Jun 2, 2026Updated 3 weeks ago
- ☆20Nov 3, 2024Updated last year
- Functional Optimal Transport: Map Estimation and Domain Adaptation for Functional data☆28Jun 7, 2021Updated 5 years ago
- This is the source code of FUSION, a safety-aware causal representation for generalizable driving agents.☆27Oct 23, 2024Updated last year
- An open-source framework to benchmark and assess safety specifications of Reinforcement Learning problems.☆14Aug 25, 2023Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Companion code to https://arxiv.org/abs/2402.15491☆22Sep 18, 2025Updated 9 months ago
- [ICLR 2026] Code, benchmark and environment for "ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows"☆129Feb 2, 2026Updated 4 months ago
- EB1A DIY Collection☆18Jun 8, 2026Updated 3 weeks ago
- Feasibility Consistent Representation Learning for Safe Reinforcement Learning (ICML 2024). Current SOTA model-free safe RL algorithm on …☆17Jul 12, 2024Updated last year
- ☆21Jan 5, 2025Updated last year
- Train and visualise a latent variable model of moving objects.☆16Apr 28, 2020Updated 6 years ago
- Efficient Scaling laws and collaborative pretraining.☆22Sep 18, 2025Updated 9 months ago
- ☆34Jan 25, 2026Updated 5 months ago
- The official repo for "CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models"☆35Mar 26, 2026Updated 3 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- UnifiedMLLM: Enabling Unified Representation for Multi-modal Multi-tasks With Large Language Model☆22Aug 5, 2024Updated last year
- ☆84May 14, 2026Updated last month
- Source code repository for our EMNLP paper on cross-domain claim identification☆14Oct 24, 2018Updated 7 years ago
- The Wasserstein Distance and Optimal Transport Map of Gaussian Processes☆52Aug 3, 2020Updated 5 years ago
- [CHIL 2024] Interpretation of Intracardiac Electrograms Through Textual Representations☆12Sep 4, 2024Updated last year
- [ICLR 2025] Robust Gymnasium: A Unified Modular Benchmark for Robust Reinforcement Learning.☆98Mar 20, 2026Updated 3 months ago
- ☆89May 24, 2026Updated last month
- Code for the arxiv paper: Complex Claim Verification with Evidence Retrieved in the Wild☆13Nov 27, 2023Updated 2 years ago
- A comrephensive collection of learning from rewards in the post-training and test-time scaling of LLMs, with a focus on both reward model…☆71Jun 13, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 🍎Wende Chinese QA system (experimental)☆10Jun 1, 2021Updated 5 years ago
- ☆43Feb 27, 2026Updated 4 months ago
- ☆11May 29, 2025Updated last year
- 本项目是July的《程序员编程艺术》的电子书版本☆10Jan 9, 2014Updated 12 years ago
- Multi-agent synthetic data generation pipeline capable of generating and validating long horizon terminal/coding tasks for RL training☆67Jul 28, 2025Updated 11 months ago
- Paper: “MEMRL: SELF-EVOLVING AGENTS VIA RUNTIME REINFORCEMENT LEARNING ON EPISODIC MEMORY” Open-Source Code☆135May 2, 2026Updated last month
- ☆46Oct 1, 2024Updated last year
- ☆27Oct 27, 2025Updated 8 months ago
- Bayes-Adaptive RL for LLM Reasoning☆45May 28, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆12Feb 6, 2021Updated 5 years ago
- [arxiv: 2512.19673] Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies☆60Feb 6, 2026Updated 4 months ago
- Deep Learning 2021 in School of Data Science, USTC☆12May 17, 2023Updated 3 years ago
- The code for creating the iGSM datasets in papers "Physics of Language Models Part 2.1, Grade-School Math and the Hidden Reasoning Proces…☆88Jan 12, 2025Updated last year
- ☆22Nov 5, 2024Updated last year
- [ICLR 2025] Official implementation and benchmark evaluation repository of <PhysBench: Benchmarking and Enhancing Vision-Language Models …☆92Jan 21, 2026Updated 5 months ago
- Code repo for FaStfact: Faster, Stronger Long-Form Factuality Evaluations in LLMs.☆31Nov 5, 2025Updated 7 months ago