CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings
☆72Feb 3, 2025Updated last year
Alternatives and similar repositories for CodeElo
Users that are interested in CodeElo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Based on the R1-Zero method, using rule-based rewards and GRPO on the Code Contests dataset.☆18Apr 22, 2025Updated 11 months ago
- ☆12Feb 11, 2026Updated 2 months ago
- ☆16Feb 6, 2024Updated 2 years ago
- Official repository for the paper "LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code"☆835Jul 16, 2025Updated 8 months ago
- Code and dataset for Polyglot Prompting: Multilingual Multitask Prompt Training.☆18Dec 7, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆14Nov 11, 2025Updated 5 months ago
- LLM play 20questions with itself☆13Mar 31, 2023Updated 3 years ago
- ☆22Oct 10, 2025Updated 6 months ago
- R1-Code-Interpreter: Training LLMs to Reason with Code via Supervised and Reinforcement Learning☆36Feb 9, 2026Updated 2 months ago
- ☆12Aug 8, 2023Updated 2 years ago
- James' cookbook of evaluations and finetuning experiments☆26Feb 19, 2026Updated last month
- ☆71Oct 23, 2025Updated 5 months ago
- SIFT: Grounding LLM Reasoning in Contexts via Stickers☆57Mar 6, 2025Updated last year
- ☆23Dec 17, 2025Updated 3 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆21Feb 10, 2025Updated last year
- An archive of learning resources assembled by current Exun members and alumni.☆15Oct 6, 2022Updated 3 years ago
- ☆11Aug 10, 2021Updated 4 years ago
- [AAAI 2025] Augmenting Math Word Problems via Iterative Question Composing (https://arxiv.org/abs/2401.09003)☆23Oct 2, 2025Updated 6 months ago
- A lambda calculus parser, evaluator and repl☆11Oct 30, 2021Updated 4 years ago
- Reproducing R1 for Code with Reliable Rewards☆302May 5, 2025Updated 11 months ago
- The tool facilitates debugging convergence issues and testing new algorithms and recipes for training LLMs using Nvidia libraries such as…☆19Sep 17, 2025Updated 6 months ago
- 🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.☆18Jul 21, 2023Updated 2 years ago
- A scrapy crawler that crawls problems and its best solutions on codeforces.com☆13Feb 25, 2016Updated 10 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆236Feb 28, 2026Updated last month
- Data mapping framework for rust stuff☆51Mar 25, 2026Updated 2 weeks ago
- Code for CVPR paper: Computationally Budgeted Continual Learning: What Does Matter?☆17Mar 16, 2024Updated 2 years ago
- An esoteric programming language with just two data types: null and tape☆11Jan 31, 2024Updated 2 years ago
- ☆60Apr 2, 2025Updated last year
- ☆42Mar 26, 2025Updated last year
- my personal mcp server☆13Apr 23, 2025Updated 11 months ago
- ☆24Sep 24, 2024Updated last year
- Learning to route instances for Human vs AI Feedback (ACL Main '25)☆28Jul 23, 2025Updated 8 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆1,126Jan 10, 2026Updated 3 months ago
- RIFE with IFUNet, FusionNet and RefineNet☆12Jun 30, 2022Updated 3 years ago
- Hypercorn is an ASGI and WSGI Server based on Hyper libraries and inspired by Gunicorn.☆15Jan 12, 2026Updated 2 months ago
- Explore and Control with Adversarial Surprise☆10Jul 20, 2021Updated 4 years ago
- ☆14Jan 21, 2025Updated last year
- 🌏 Modular retrievers for zero-shot multilingual IR.☆30Mar 6, 2024Updated 2 years ago
- Organize the Web: Constructing Domains Enhances Pre-Training Data Curation☆80May 2, 2025Updated 11 months ago