yuzhu-cai/rSDE-Bench

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yuzhu-cai/rSDE-Bench)

yuzhu-cai / rSDE-Bench

☆36

Alternatives and similar repositories for rSDE-Bench

Users that are interested in rSDE-Bench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

EthanLeo-LYX / BiDeV
View on GitHub
[AAAI2025 Oral] BiDeV: Bilateral Defusing Verification for Complex Claim Fact-Checking
☆15Apr 22, 2025Updated last year
youngsoul0731 / FLORA-Bench
View on GitHub
[Arxiv 2025] Official code and datasets of paper: GNNs as Predictors of Agentic Workflow Performances
☆20Jan 15, 2026Updated 6 months ago
xbmxb / EnvDistraction
View on GitHub
☆24Oct 11, 2024Updated last year
MASWorks / MASLab
View on GitHub
☆243Jul 25, 2025Updated last year
hengzzzhou / ReSo
View on GitHub
☆25Jan 29, 2026Updated 6 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
tsinghua-fib-lab / SmartAgent
View on GitHub
The official repository of "SmartAgent: Chain-of-User-Thought for Embodied Personalized Agent in Cyber World".
☆27Updated this week
Open-Social-World / autolibra
View on GitHub
AutoLibra: Metric Induction for Agents from Open-Ended Human Feedback
☆19Apr 23, 2026Updated 3 months ago
YuxiangChai / AMEX-codebase
View on GitHub
☆33Sep 27, 2024Updated last year
DorothyDUUU / SWE-Dev
View on GitHub
Official code space for "SWE-Dev: Evaluating and Training Autonomous Feature-Driven Software Development"
☆62Oct 24, 2025Updated 9 months ago
RUCAIBox / OlymMATH
View on GitHub
The OlymMATH dataset
☆24Jun 1, 2025Updated last year
zz-haooo / LLMs-Preference-Optimization
View on GitHub
☆18May 31, 2024Updated 2 years ago
vast-ai / vast-pyworker
View on GitHub
☆12May 20, 2025Updated last year
sjtu-sai-agents / DataMaster
View on GitHub
official code for DataMaster
☆47May 17, 2026Updated 2 months ago
sjtu-sai-agents / MagiClaw
View on GitHub
MagiClaw: Conversational Command Center for Your Scientific Agent Team.
☆130Apr 16, 2026Updated 3 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
seketeam / EvoCodeBench
View on GitHub
An Evolving Code Generation Benchmark Aligned with Real-world Code Repositories
☆71Aug 15, 2024Updated last year
yanweiyue / GDesigner
View on GitHub
☆97Dec 5, 2024Updated last year
taco-group / COCMT
View on GitHub
[IROS'25] COCMT
☆12Aug 14, 2025Updated 11 months ago
bingreeky / MaAS
View on GitHub
[ICML'25 Oral] Multi-agent Architecture Search via Agentic Supernet
☆279Nov 13, 2025Updated 8 months ago
euReKa025 / AgentLongBench
View on GitHub
☆22Jan 29, 2026Updated 6 months ago
DorothyDUUU / Info-Mosaic
View on GitHub
[ICLR 2026] InfoMosaic-Bench: Evaluating Multi-Source Information Seeking in Tool-Augmented Agents
☆128Feb 5, 2026Updated 5 months ago
LINs-lab / SupervisorAgent
View on GitHub
[ICLR 2026] Stop Wasting Your Tokens: Towards Efficient Runtime Multi-Agent Systems
☆32Mar 2, 2026Updated 4 months ago
JungHoyoun / PromptCompressor
View on GitHub
☆12Apr 29, 2024Updated 2 years ago
portal-cornell / muCode
View on GitHub
☆33Oct 2, 2025Updated 9 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
richardzhuang0412 / EmbedLLM
View on GitHub
Repo for EmbedLLM: Learning Compact Representations of Large Language Models
☆32Sep 25, 2025Updated 10 months ago
ShuoTang123 / MATRIX
View on GitHub
Implementation of the MATRIX framework (ICML 2024)
☆60May 6, 2024Updated 2 years ago
mahimanzum / FixEval
View on GitHub
We introduce FixEval , a dataset for competitive programming bug fixing along with a comprehensive test suite and show the necessity of e…
☆26Aug 31, 2022Updated 3 years ago
SinHanYang / Dual-CAN
View on GitHub
Entity-Aware Dual Co-Attention Network for Fake News Detection, EACL 2023 Findings
☆10Jun 11, 2023Updated 3 years ago
Tim-Siu / reinforcement-distillation
View on GitHub
Code repo for "Harnessing Negative Signals: Reinforcement Distillation from Teacher Data for LLM Reasoning"
☆33Jul 25, 2025Updated last year
mizvol / gephi-tutorials
View on GitHub
Gephi tutorials for data visualisation lecture. A Network Tour of Data Science 2019 Fall semester
☆12Apr 11, 2021Updated 5 years ago
MultiagentBench / MARBLE
View on GitHub
(ACL 2025 Main) Code for MultiAgentBench : Evaluating the Collaboration and Competition of LLM agents https://www.arxiv.org/pdf/2503.019…
☆54Jun 21, 2025Updated last year
AlbertChen1991 / nEM
View on GitHub
Code and data for EMNLP2019 Paper "Uncover the Ground-Truth Relations in Distant Supervision: A Neural Expectation-Maximization Framework…
☆10May 24, 2020Updated 6 years ago
ACADLab / SA-DS
View on GitHub
☆15Jul 25, 2024Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
OpenDevin / OD-SWE-bench
View on GitHub
Enhanced fork of SWE-bench, tailored for OpenDevin's ecosystem.
☆30May 26, 2024Updated 2 years ago
tanzelin430 / The-Scaling-Law-for-Reinforcement-Learning
View on GitHub
[ACL2026]Code Repo for paper "Scaling Behaviors of LLM Reinforcement Learning Post-Training"
☆24Jul 1, 2026Updated 3 weeks ago
seketeam / group-meeting-slides
View on GitHub
☆14Jun 3, 2025Updated last year
StefanHeng / ProgGen
View on GitHub
Code for paper "ProgGen: Generating Named Entity Recognition Datasets Step-by-step with Self-Reflexive Large Language Models"
☆17Mar 29, 2024Updated 2 years ago
cavaunpeu / mcts-llm-codegen
View on GitHub
A Python reimplementation + extension of "Planning with Large Language Models for Code Generation" (https://arxiv.org/abs/2303.05510)
☆17Dec 1, 2023Updated 2 years ago
lblankl / Short-RL
View on GitHub
Short RL
☆19Apr 16, 2026Updated 3 months ago
Kangningthu / SUM
View on GitHub
Uncertainty-aware Fine-tuning of Segmentation Foundation Models (NeurIPS 2024).
☆16Jan 9, 2025Updated last year