alibaba/terminal-bench-pro

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/alibaba/terminal-bench-pro)

alibaba / terminal-bench-pro

☆119

Alternatives and similar repositories for terminal-bench-pro

Users that are interested in terminal-bench-pro are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

harbor-framework / terminal-bench-challenges
View on GitHub
☆18Jun 18, 2026Updated 3 weeks ago
abundant-ai / SWE-gen
View on GitHub
Convert GitHub PRs into Harbor tasks
☆69Updated this week
panilya / awesome-ai-benchmarks
View on GitHub
Awesome AI Benchmarks
☆37Updated this week
facebookresearch / BigOBench
View on GitHub
BigOBench assesses the capacity of Large Language Models (LLMs) to comprehend time-space computational complexity of input or generated c…
☆43Apr 15, 2025Updated last year
bhanML / SIGUA
View on GitHub
ICML'20: SIGUA: Forgetting May Make Learning with Noisy Labels More Robust
☆17Dec 14, 2020Updated 5 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
allenai / gpv2-web10k
View on GitHub
Download Web-10K data by querying Bing Image Search
☆10Feb 1, 2022Updated 4 years ago
PeterLeeXX / DSCformer
View on GitHub
DSCformer: Lightweight model for predicting soil nitrogen content using VNIR-SWIR spectroscopy
☆66Dec 6, 2024Updated last year
casetext / r-and-r
View on GitHub
Code for the "Long Context Needs Some R&R" paper.
☆12Mar 11, 2024Updated 2 years ago
qagentur / texttunnel
View on GitHub
Python package for extractive NLP using the OpenAI API
☆17Aug 28, 2024Updated last year
microsoft / SWE-bench-Live
View on GitHub
[NeurIPS 2025 D&B] 🚀 SWE-bench Goes Live!
☆208Jun 11, 2026Updated last month
Brandt-J / SpectraReconstruction
View on GitHub
☆15Mar 6, 2022Updated 4 years ago
srush / transformers-bet
View on GitHub
☆12Mar 3, 2022Updated 4 years ago
crushr / EANN_Implemetation
View on GitHub
EANN(Pytorch)
☆10Mar 12, 2022Updated 4 years ago
simonw / webvid-datasette
View on GitHub
A Datasette instance for searching WebVid-10M
☆15Sep 30, 2022Updated 3 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
0x404 / conventional-commit-classification
View on GitHub
A First Look at Conventional Commits Classification
☆15Nov 18, 2024Updated last year
Mogball / triton_lite
View on GitHub
☆20May 24, 2025Updated last year
ydzhang-stormstout / LGCN
View on GitHub
Source code for WWW 2021 paper "Lorentzian Graph Convolutional Networks"
☆14Jun 11, 2021Updated 5 years ago
SWE-EVO / SWE-EVO
View on GitHub
☆50May 3, 2026Updated 2 months ago
Qwen-Applications / STAR
View on GitHub
STAR: Similarity-guided Teacher-Assisted Refinement for Super-Tiny Function Calling Models
☆49Apr 23, 2026Updated 2 months ago
CharlesYu2000 / PCGU-UnlearningBias
View on GitHub
☆17Nov 7, 2023Updated 2 years ago
osome-iu / ChatGPT_domain_rating
View on GitHub
Code and data for paper "Large language models can rate news outlet credibility"
☆13Aug 10, 2024Updated last year
camel-ai / seta-env
View on GitHub
💻 SETA: Scaling Environments for Terminal Agents - Environments
☆141Feb 16, 2026Updated 4 months ago
Lucky-Wang-Chenlong / CodeSync
View on GitHub
[ICML25] CODESYNC: Synchronizing Large Language Models with Dynamic Code Evolution at Scale
☆25Jul 31, 2025Updated 11 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
THUDM / ChatGLM-Math
View on GitHub
☆82Apr 18, 2024Updated 2 years ago
Zhaoyang-Chu / HGRL-DTA
View on GitHub
This repository contains a PyTorch implementation of the paper "Hierarchical Graph Representation Learning for the Prediction of Drug-Tar…
☆12Jul 21, 2022Updated 3 years ago
epoch-research / training-cost-trends
View on GitHub
☆27Apr 1, 2026Updated 3 months ago
shiivangii / Leveraging-Intra-and-Inter-Modality-Relationship-for-Multimodal-Fake-News-Detection
View on GitHub
☆10Apr 24, 2022Updated 4 years ago
chrisjtan / gnn_cff
View on GitHub
☆19Apr 30, 2022Updated 4 years ago
AI-Hypercomputer / inference-benchmark
View on GitHub
☆22Mar 11, 2026Updated 4 months ago
Zhaoyang-Chu / code-unlearning
View on GitHub
This repository contains a PyTorch implementation of the ICSE'26 paper "Scrub It Out! Erasing Sensitive Memorization in Code Language Mod…
☆30Sep 18, 2025Updated 9 months ago
HetTransformer / HetTransformer-model
View on GitHub
☆10Jun 21, 2021Updated 5 years ago
pgasawa / continual-learning-bench
View on GitHub
Continual Learning Bench
☆182Jun 8, 2026Updated last month
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
Zhitao-He / AgentsCourt
View on GitHub
AgentsCourt: Building Judicial Decision-Making Agents with Court Debate Simulation and Legal Knowledge Augmentation (EMNLP 2024 Findings)
☆18Dec 30, 2024Updated last year
jma712 / GraphCFE
View on GitHub
☆19May 21, 2025Updated last year
timbrgr / complex-scheduling-optimization-case-studies
View on GitHub
Optimization Case Studies: Generic Time Scheduling Problem (GTSP), Resource-Constrained Project Scheduling Problem (RCPSP) with Pulse Var…
☆11Nov 7, 2018Updated 7 years ago
rlite-project / RLite
View on GitHub
A lightweight reinforcement learning framework that integrates seamlessly into your codebase, empowering developers to focus on algorithm…
☆106Aug 25, 2025Updated 10 months ago
SprocketLab / slop-code-bench
View on GitHub
SlopCodeBench: Measuring Code Erosion Under Iterative Specification Refinement
☆87Jun 11, 2026Updated last month
zorazrw / multilingual-conala
View on GitHub
[EACL'23] MCoNaLa: A Benchmark for Code Generation from Multiple Natural Languages
☆23Feb 13, 2023Updated 3 years ago
spencerwooo / zan-chat
View on GitHub
A peer-to-peer communication system. BIT 小学期软件开发实训。
☆11Sep 7, 2018Updated 7 years ago