☆55Aug 5, 2025Updated 6 months ago
Alternatives and similar repositories for UserBench
Users that are interested in UserBench are comparing it to the libraries listed below
Sorting:
- The raw UserRL repo under construction☆94Sep 25, 2025Updated 5 months ago
- ☆20Nov 3, 2024Updated last year
- This is the source code of FUSION, a safety-aware causal representation for generalizable driving agents.☆26Oct 23, 2024Updated last year
- Functional Optimal Transport: Map Estimation and Domain Adaptation for Functional data☆27Jun 7, 2021Updated 4 years ago
- ☆16Jan 5, 2025Updated last year
- Evaluating Durability: Benchmark Insights into Multimodal Watermarking☆12Jun 7, 2024Updated last year
- Efficient Scaling laws and collaborative pretraining.☆21Sep 18, 2025Updated 5 months ago
- Feasibility Consistent Representation Learning for Safe Reinforcement Learning (ICML 2024). Current SOTA model-free safe RL algorithm on …☆13Jul 12, 2024Updated last year
- An open-source framework to benchmark and assess safety specifications of Reinforcement Learning problems.☆14Aug 25, 2023Updated 2 years ago
- Plancraft is a minecraft environment and agent suite to test planning capabilities in LLMs☆26Nov 7, 2025Updated 3 months ago
- Train and visualise a latent variable model of moving objects.☆16Apr 28, 2020Updated 5 years ago
- Preprint: Asymmetry in Low-Rank Adapters of Foundation Models☆38Feb 27, 2024Updated 2 years ago
- ☆21Nov 5, 2024Updated last year
- Companion code to https://arxiv.org/abs/2402.15491☆22Sep 18, 2025Updated 5 months ago
- ☆58Jan 19, 2025Updated last year
- An Open-Source Reinforcement Learning Framework for Robot-Task Environments☆27Jul 6, 2023Updated 2 years ago
- [ICLR 2025] Official implementation and benchmark evaluation repository of <PhysBench: Benchmarking and Enhancing Vision-Language Models …☆85Jan 21, 2026Updated last month
- ☆77Nov 6, 2025Updated 3 months ago
- The Wasserstein Distance and Optimal Transport Map of Gaussian Processes☆52Aug 3, 2020Updated 5 years ago
- [arxiv: 2512.19673] Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies☆59Feb 6, 2026Updated 3 weeks ago
- A Text2SQL benchmark for evaluation of Large Language Models☆41Updated this week
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆12Jun 28, 2025Updated 7 months ago
- [CHIL 2024] Interpretation of Intracardiac Electrograms Through Textual Representations☆12Sep 4, 2024Updated last year
- REvolveR: Continuous Evolutionary Models for Robot-to-robot Policy Transfer (ICML 2022 Long Oral)☆26Sep 10, 2022Updated 3 years ago
- ☆30Sep 28, 2023Updated 2 years ago
- ☆46Sep 27, 2025Updated 5 months ago
- [NeurIPS ENLSP Workshop'24] CSKV: Training-Efficient Channel Shrinking for KV Cache in Long-Context Scenarios☆16Oct 18, 2024Updated last year
- Bayes-Adaptive RL for LLM Reasoning☆45May 28, 2025Updated 9 months ago
- A library for minimizing the effects of confounding covariates☆15May 28, 2025Updated 9 months ago
- ☆18Jun 10, 2025Updated 8 months ago
- Sotopia-RL: Reward Design for Social Intelligence☆46Jan 29, 2026Updated last month
- [CVPR 2026] LongVT: Incentivizing "Thinking with Long Videos" via Native Tool Calling☆191Updated this week
- The code for creating the iGSM datasets in papers "Physics of Language Models Part 2.1, Grade-School Math and the Hidden Reasoning Proces…☆84Jan 12, 2025Updated last year
- This is the official implementation of NeurIPS 2022 paper "Generalizing Goal-Conditioned Reinforcement Learning with Variational Causal R…☆35Jan 25, 2023Updated 3 years ago
- [ICLR 2026] ParallelBench: Understanding the Tradeoffs of Parallel Decoding in Diffusion LLMs☆30Updated this week
- multicast learning in network programming course☆10Oct 30, 2020Updated 5 years ago
- A Framework for Evaluating AI Agent Safety in Realistic Environments☆30Oct 2, 2025Updated 4 months ago
- The official implement of paper 《DaMo: Data Mixing Optimizer in Fine-tuning Multimodal LLMs for Mobile Phone Agents》☆29Oct 23, 2025Updated 4 months ago
- ☆11Jun 22, 2025Updated 8 months ago