FlagOpen/TACO

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/FlagOpen/TACO)

FlagOpen / TACO

☆239

Alternatives and similar repositories for TACO

Users that are interested in TACO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

huangd1999 / EffiCoder
View on GitHub
[ICML 2025] EffiCoder: Enhancing Code Generation in Large Language Models through Efficiency-Aware Fine-tuning
☆16May 24, 2025Updated last year
LiveCodeBench / LiveCodeBench
View on GitHub
Official repository for the paper "LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code"
☆911Jul 16, 2025Updated last year
KodCode-AI / code-r1
View on GitHub
Reproducing R1 for Code with Reliable Rewards
☆13Apr 9, 2025Updated last year
FlagOpen / Infinity-Instruct
View on GitHub
☆51Jun 14, 2024Updated 2 years ago
jiangxxxue / ROCODE
View on GitHub
This repository hosts the source code for the paper "ROCODE: Integrating Backtracking Mechanism and Program Analysis in Large Language Mo…
☆16Dec 16, 2025Updated 7 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
nuprl / MultiPL-E
View on GitHub
A multi-programming language benchmark for LLMs
☆312Apr 12, 2026Updated 3 months ago
bigcode-project / octopack
View on GitHub
🐙 OctoPack: Instruction Tuning Code Large Language Models
☆479Feb 5, 2025Updated last year
ganler / code-r1
View on GitHub
Reproducing R1 for Code with Reliable Rewards
☆313May 5, 2025Updated last year
Adaxry / Post-Instruction
View on GitHub
☆21Sep 5, 2023Updated 2 years ago
evalplus / evalplus
View on GitHub
Rigourous evaluation of LLM-synthesized code - NeurIPS 2023 & COLM 2024
☆1,782Oct 2, 2025Updated 9 months ago
hendrycks / apps
View on GitHub
APPS: Automated Programming Progress Standard (NeurIPS 2021)
☆534Jun 19, 2024Updated 2 years ago
YihongDong / CDD-TED4LLMs
View on GitHub
☆16Nov 26, 2024Updated last year
R2E-Gym / R2E-Gym
View on GitHub
[COLM 2025] Official repository for R2E-Gym: Procedural Environment Generation and Hybrid Verifiers for Scaling Open-Weights SWE Agents
☆307Jul 13, 2025Updated last year
bigcode-project / bigcode-evaluation-harness
View on GitHub
A framework for the evaluation of autoregressive code generation language models.
☆1,052Jul 22, 2025Updated 11 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
openai / human-eval
View on GitHub
Code for the paper "Evaluating Large Language Models Trained on Code"
☆3,312Jan 17, 2025Updated last year
wanghanbinpanda / Large-Language-Models-for-Code
View on GitHub
Large Language Models(LLMs) of Code
☆20Apr 23, 2023Updated 3 years ago
InternLM / SWE-Fixer
View on GitHub
☆139May 8, 2025Updated last year
ysh-1998 / CoWPiRec
View on GitHub
The official implementation for Collaborative Word-based Pre-trained Item Representation for Transferable Recommendation.
☆25Jan 30, 2024Updated 2 years ago
ANTONIOPSD / CaptionIMG
View on GitHub
Simple program to manually caption your images (or any other file types) so you can use them for AI training
☆37Mar 20, 2023Updated 3 years ago
amazon-science / cceval
View on GitHub
CrossCodeEval: A Diverse and Multilingual Benchmark for Cross-File Code Completion (NeurIPS 2023)
☆181Aug 15, 2025Updated 11 months ago
seketeam / EvoCodeBench
View on GitHub
An Evolving Code Generation Benchmark Aligned with Real-world Code Repositories
☆71Aug 15, 2024Updated last year
microsoft / FEA-Bench
View on GitHub
[ACL25] FEA-Bench: A Benchmark for Evaluating Repository-Level Code Generation for Feature Implementation
☆57Jan 28, 2026Updated 5 months ago
bigcode-project / astraios
View on GitHub
Astraios: Parameter-Efficient Instruction Tuning Code Language Models
☆63Apr 10, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
pkuzqh / GrammarT5
View on GitHub
☆11May 18, 2024Updated 2 years ago
rosewang2008 / backtracing
View on GitHub
Backtracing: Retrieving the Cause of the Query, EACL 2024 Long Paper, Findings.
☆91Jul 21, 2024Updated 2 years ago
GAIR-NLP / ReasonEval
View on GitHub
[AAAI 2025 oral] Evaluating Mathematical Reasoning Beyond Accuracy
☆80Oct 9, 2025Updated 9 months ago
JohannesTheo / trapped-in-texture-bias
View on GitHub
Official code release for the paper Trapped in texture bias? A large scale comparison of deep instance segmentation, accepted at ECCV 202…
☆16Jan 16, 2024Updated 2 years ago
CriticBench / CriticBench
View on GitHub
[ACL 2024 Findings] CriticBench: Benchmarking LLMs for Critique-Correct Reasoning
☆31Mar 5, 2024Updated 2 years ago
allenai / hyperdecoders
View on GitHub
Codebase for Hyperdecoders https://arxiv.org/abs/2203.08304
☆14Oct 11, 2022Updated 3 years ago
TIGER-AI-Lab / MAmmoTH2
View on GitHub
Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]
☆146Oct 27, 2024Updated last year
THUDM / NaturalCodeBench
View on GitHub
NaturalCodeBench (Findings of ACL 2024)
☆70Oct 14, 2024Updated last year
MasterVito / SwS
View on GitHub
Official Repo for SwS: A Weakness-driven Problem Synthesis Framework in RL for LLM Reasoning
☆41Nov 11, 2025Updated 8 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
bigcode-project / bigcodebench
View on GitHub
[ICLR'25] BigCodeBench: Benchmarking Code Generation Towards AGI
☆515Jan 3, 2026Updated 6 months ago
Leolty / repobench
View on GitHub
✨ RepoBench: Benchmarking Repository-Level Code Auto-Completion Systems - ICLR 2024
☆212Aug 16, 2024Updated last year
GAIR-NLP / MathPile
View on GitHub
[NeurlPS D&B 2024] Generative AI for Math: MathPile
☆418Apr 4, 2025Updated last year
PootieT / explain-then-translate
View on GitHub
Official repo for EMNLP 2023 paper "Explain-then-Translate: An Analysis on Improving Program Translation with Self-generated Explanations…
☆29Dec 5, 2023Updated 2 years ago
bigcode-project / selfcodealign
View on GitHub
[NeurIPS'24] SelfCodeAlign: Self-Alignment for Code Generation
☆323Feb 24, 2025Updated last year
QwenLM / CodeElo
View on GitHub
CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings
☆74Feb 3, 2025Updated last year
Naman-ntc / FastCode
View on GitHub
Utilities for efficient fine-tuning, inference and evaluation of code generation models
☆20Oct 3, 2023Updated 2 years ago