open-compass/CIBench

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/open-compass/CIBench)

open-compass / CIBench

Official Repo of "CIBench: Evaluation of LLMs as Code Interpreter "

☆15

Alternatives and similar repositories for CIBench

Users that are interested in CIBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

RUCAIBox / FIGA
View on GitHub
[ICLR 2024] This is the official implementation for the paper: "Beyond imitation: Leveraging fine-grained quality signals for alignment"
☆10May 5, 2024Updated 2 years ago
morning9393 / ETPO
View on GitHub
☆14Mar 5, 2024Updated 2 years ago
Tenggou / CCKS2020_task1_KBQA
View on GitHub
☆12Sep 11, 2020Updated 5 years ago
MarcTLaw / LorentzianDistanceRetrieval
View on GitHub
Lorentzian Distance Learning for Hyperbolic Representations: Retrieval experiments
☆13May 28, 2019Updated 7 years ago
open-mmlab / ecosystem
View on GitHub
☆36Sep 4, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
whats2000 / AgentLaboratory
View on GitHub
Agent Laboratory is an end-to-end autonomous research workflow meant to assist you as the human researcher toward implementing your resea…
☆12Mar 29, 2026Updated 3 months ago
shachardon / naturally_occurring_feedback
View on GitHub
☆14Dec 1, 2025Updated 7 months ago
KodCode-AI / code-r1
View on GitHub
Reproducing R1 for Code with Reliable Rewards
☆13Apr 9, 2025Updated last year
miziha-zp / BiuG-XMRec-WSDMCup22
View on GitHub
☆18Feb 22, 2022Updated 4 years ago
whats2000 / fabric-mod-chinese-traslation-resouce-pack
View on GitHub
Making translate for fabric mod pack
☆12Mar 27, 2023Updated 3 years ago
sczhou / ProPainter_website
View on GitHub
Source code to the ProPainter website
☆14Jan 15, 2024Updated 2 years ago
aranciokov / FSMMDA_VideoRetrieval
View on GitHub
☆10Nov 23, 2023Updated 2 years ago
YihongDong / CDD-TED4LLMs
View on GitHub
☆16Nov 26, 2024Updated last year
Richar-Du / Virgo
View on GitHub
Official code of *Virgo: A Preliminary Exploration on Reproducing o1-like MLLM*
☆20May 27, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
LeiLiLab / HardTestGen
View on GitHub
☆17Jan 27, 2026Updated 5 months ago
sqybi / DLXcn
View on GitHub
The Chinese version of the paper "Dancing Links" by Knuth
☆14Mar 28, 2013Updated 13 years ago
zjulgc / llmpeft4apr
View on GitHub
☆16Nov 9, 2024Updated last year
multi-swe-bench / MagentLess
View on GitHub
☆13Jul 31, 2025Updated 11 months ago
Snow-Dancing / ReinforcementLearning
View on GitHub
利用强化学习的Q价值迭代,Q学习以及SARSA方法解决小车爬山以及倒立摆的控制问题
☆15Jul 25, 2019Updated 6 years ago
Tianwei-She / awesome-natural-language-generation
View on GitHub
A curated list of Natural Language Generation papers, tutorials, and blogs.
☆12Dec 13, 2018Updated 7 years ago
CodeLLM-Research / CodeJudge-Eval
View on GitHub
[COLING25] CodeJudge Eval: Can Large Language Models be Good Judges in Code Understanding?
☆12Dec 3, 2024Updated last year
fdalvi / analyzing-redundancy-in-pretrained-transformer-models
View on GitHub
Code for Analyzing Redundancy in Pretrained Transformer Models accepted at EMNLP 2020
☆14Oct 6, 2020Updated 5 years ago
ydzhang-stormstout / LGCN
View on GitHub
Source code for WWW 2021 paper "Lorentzian Graph Convolutional Networks"
☆14Jun 11, 2021Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
AliBigdeli / Ultimate-Django4.2-Template
View on GitHub
Ultimate Django4.2 Template for starting any project from not zero!
☆15Jul 14, 2023Updated 3 years ago
MiuLab / GenDef
View on GitHub
Probing task; contextual embeddings -> textual definitions (EMNLP19)
☆12Apr 22, 2021Updated 5 years ago
the-crypt-keeper / llm-webapps
View on GitHub
jQuery, React and Streamlit applications written by LLMs
☆15Dec 24, 2023Updated 2 years ago
yhif / Provinces_citys_areas
View on GitHub
省市区数据库、包含别名、拼音、拼音缩写、邮编
☆14May 25, 2016Updated 10 years ago
microsoft / LEMA
View on GitHub
official repo for the paper "Learning From Mistakes Makes LLM Better Reasoner"
☆60Dec 20, 2023Updated 2 years ago
dhgottesman / keen_estimating_knowledge_in_llms
View on GitHub
☆18Nov 5, 2025Updated 8 months ago
RossiXu / event-centric-opinion-mining
View on GitHub
☆12Nov 20, 2023Updated 2 years ago
open-compass / GPassK
View on GitHub
[ACL 2025] Are Your LLMs Capable of Stable Reasoning?
☆33Aug 5, 2025Updated 11 months ago
shengyuzhang / VideoTitling
View on GitHub
Comprehensive Information Integration Modeling Framework for Video Titling
☆11Aug 27, 2020Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
datacluster-labs / Domestic-Fire-and-Smoke-Dataset
View on GitHub
This dataset consists of domestic Fire and Smoke images.
☆16Oct 22, 2021Updated 4 years ago
MLSysOps / Code-Agent-Survey
View on GitHub
A survey of Code Agents / Foundation Models for improving development productivity. Become 10x SWE, MLE, etc.
☆22Aug 20, 2024Updated last year
CyberZHG / keras-trans-mask
View on GitHub
Remove and restore masks for layers that do not support masking
☆16Jan 22, 2022Updated 4 years ago
gaia-agent / gaia-agent
View on GitHub
GAIA-benchmark-ready super agent built on AI SDK v6 ToolLoopAgent
☆16Apr 7, 2026Updated 3 months ago
utahnlp / BERT-fine-tuning-analysis
View on GitHub
The codebase for the paper: A Closer Look at How Fine-tuning Changes BERT
☆24Apr 3, 2023Updated 3 years ago
RPMTW / ResourcePack-Mod-zh_tw
View on GitHub
一個相容Forge/Fabric的Minecraft繁體中文化模組資源包。採用不同於以往的方式，既便利又簡單。
☆17Oct 1, 2023Updated 2 years ago
Qwen-Applications / DIR
View on GitHub
☆17Feb 14, 2026Updated 5 months ago