CUHK-Shenzhen-SE/UTBoost

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/CUHK-Shenzhen-SE/UTBoost)

CUHK-Shenzhen-SE / UTBoost

[ACL'25] UTBoost: Rigorous Evaluation of Coding Agents on SWE-Bench

☆36

Alternatives and similar repositories for UTBoost

Users that are interested in UTBoost are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Susan571 / LENSLLM
View on GitHub
This repository contains the code for our ICML 2025 paper——LENSLLM: Unveiling Fine-Tuning Dynamics for LLM Selection🎉
☆26May 29, 2025Updated last year
CUHK-Shenzhen-SE / RetromorphicTesting
View on GitHub
☆11Jan 19, 2025Updated last year
ltlhuuu / PSEC
View on GitHub
[ICLR 2025] The offical implementation of "PSEC: Skill Expansion and Composition in Parameter Space", a new framework designed to facilit…
☆65Feb 12, 2025Updated last year
PrismaX-Team / PhysUniBenchmark
View on GitHub
☆20Nov 27, 2025Updated 8 months ago
xyliu-cs / RISE
View on GitHub
[NeurIPS'25] Official Implementation of RISE (Reinforcing Reasoning with Self-Verification)
☆33Aug 8, 2025Updated 11 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
GiantAILab / DeepSound-V1
View on GitHub
Official code for DeepSound-V1
☆12May 14, 2025Updated last year
OpenAgentEval / SWE-ABS
View on GitHub
[ICML 2026] SWE-ABS: Adversarial Benchmark Strengthening Exposes Inflated Success Rates on Test-based Benchmark
☆22May 6, 2026Updated 2 months ago
Qiukunpeng / Siamese-Diffusion
View on GitHub
[CVPR 2025] Noise-Consistent Siamese-Diffusion for Medical Image Synthesis and Segmentation
☆90Nov 29, 2025Updated 7 months ago
CUHK-Shenzhen-SE / D4C
View on GitHub
[ICSE'25] Aligning the Objective of LLM-based Program Repair
☆24Mar 8, 2025Updated last year
QiangweiPeng / stVCR
View on GitHub
Reconstructing spatiotemporal dynamics from spatial transcriptome snapshots
☆64Apr 12, 2026Updated 3 months ago
LGU-SE-Internal / GRev
View on GitHub
A lightweight tool for detecting bugs on Graph Database Management Systems
☆15Jan 9, 2024Updated 2 years ago
GiantAILab / DeepDubber-V1
View on GitHub
DeepDubber-V1: Towards High Quality and Dialogue, Narration, Monologue Adaptive Movie Dubbing Via Multi-Modal Chain-of-Thoughts Reasoning…
☆30Sep 7, 2025Updated 10 months ago
OrangeSodahub / InfGen
View on GitHub
[ICCV 2025] Long-term Traffic Simulation with Interleaved Autoregressive Motion and Scenario Generation.
☆53Aug 27, 2025Updated 11 months ago
AuroraZengfh / Local-Prompt
View on GitHub
[ICLR 2025] Official Implementation of Local-Prompt: Extensible Local Prompts for Few-Shot Out-of-Distribution Detection
☆52Jul 30, 2025Updated 11 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
YutingLi0606 / Vision-Matters
View on GitHub
(ArXiv25) Vision Matters: Simple Visual Perturbations Can Boost Multimodal Math Reasoning
☆60Sep 30, 2025Updated 9 months ago
inFaaa / Awesome-Personalized-Video-Creation
View on GitHub
📖 This is a repository for organizing papers, codes, and other resources related to personalized video generation and editing.
☆64Dec 9, 2025Updated 7 months ago
logpai / hybridlogparser
View on GitHub
A toolkit for hybrid log parsing
☆18Aug 23, 2023Updated 2 years ago
fscdc / ReasonMap
View on GitHub
[CVPR 2026] ReasonMap: Towards Fine-Grained Visual Reasoning from Transit Maps
☆87Updated this week
Tim-Siu / reinforcement-distillation
View on GitHub
Code repo for "Harnessing Negative Signals: Reinforcement Distillation from Teacher Data for LLM Reasoning"
☆33Jul 25, 2025Updated last year
ZhishanQ / UniHGKR
View on GitHub
The official repository of UniHGKR: Unified Instruction-aware Heterogeneous Knowledge Retrievers
☆27Jun 12, 2025Updated last year
Siyuexi / Hue
View on GitHub
[ESEC/FSE'23] Hue: A User-Adaptive Parser for Hybrid Logs
☆10Aug 24, 2023Updated 2 years ago
zstsandy / Awesome-3D-Gaussian-Splatting-in-Robotics
View on GitHub
☆118Feb 13, 2025Updated last year
RainBowLuoCS / OpenOmni
View on GitHub
(NIPS 2025) OpenOmni: Official implementation of Advancing Open-Source Omnimodal Large Language Models with Progressive Multimodal Align…
☆142May 9, 2026Updated 2 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
DataArcTech / SQL-R1
View on GitHub
[NeurIPS'25] Official Repository for the Paper "SQL-R1: Training Natural Language to SQL Reasoning Model By Reinforcement Learning"
☆145Nov 20, 2025Updated 8 months ago
PacificStudio / openase
View on GitHub
Ticket-Driven Automated Software Engineering. OpenASE is an all-in-one platform that turns tickets into working code — AI agents automati…
☆263Updated this week
ZhonghaoJiang / CoSIL
View on GitHub
[ASE 2025] CoSIL: Issue Localization via Iteritive Code Graph Searching
☆24May 31, 2026Updated last month
iie-ycx / DEER
View on GitHub
This is the repository of DEER, a Dynamic Early Exit in Reasoning method for Large Reasoning Language Models.
☆199Jul 7, 2025Updated last year
RobustNLP / DeRTa
View on GitHub
A novel approach to improve the safety of large language models, enabling them to transition effectively from unsafe to safe state.
☆72May 22, 2025Updated last year
liangyuwang / zo2
View on GitHub
ZO2 (Zeroth-Order Offloading): Full Parameter Fine-Tuning 175B LLMs with 18GB GPU Memory [COLM2025]
☆208Jul 16, 2025Updated last year
murphylmf / M-LRM
View on GitHub
☆94Mar 19, 2025Updated last year
GiantAILab / DeepAudio-V1
View on GitHub
☆17May 13, 2025Updated last year
upup-wei / RAG-ReasonAlignment
View on GitHub
☆20May 20, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
NuoJohnChen / XtraGPT
View on GitHub
[ACL 2026 Main] XtraGPT: Context-Aware and Controllable Academic Paper Revision via Human-AI Collaboration
☆25Apr 23, 2026Updated 3 months ago
SQD1 / RESTORE-DiT
View on GitHub
[RSE 2025] RESTORE-DiT: Reliable satellite image time series reconstruction by multimodal sequential diffusion transformer
☆60Jul 7, 2025Updated last year
rekkles2 / Fed_WSVAD
View on GitHub
[IEEE TII 2025] Official Implementation for "Dual-Detector Reoptimization for Federated Weakly Supervised Video Anomaly Detection via Ada…
☆27Nov 11, 2025Updated 8 months ago
NiceRingNode / Awesome-Generative-Models-for-OCR
View on GitHub
[arXiv 25] OCRGenBench: A Comprehensive Benchmark for Evaluating OCR Generative Capabilities
☆273Apr 13, 2026Updated 3 months ago
ModelTC / HarmoniCa
View on GitHub
[ICML 2025] This is the official PyTorch implementation of "🎵 HarmoniCa: Harmonizing Training and Inference for Better Feature Caching i…
☆46Jul 10, 2025Updated last year
fscdc / Awesome-Efficient-Reasoning-Models
View on GitHub
[TMLR 2025] Efficient Reasoning Models: A Survey
☆317Jun 26, 2026Updated last month
BIT-DA / ABS
View on GitHub
[ICML2025] Official Code of From Local Details to Global Context: Advancing Vision-Language Models with Attention-Based Selection
☆27Jun 27, 2025Updated last year