β20Oct 10, 2025Updated 5 months ago
Alternatives and similar repositories for code-judge
Users that are interested in code-judge are comparing it to the libraries listed below
Sorting:
- All-in-one benchmarking platform for evaluating LLM.β15Nov 12, 2025Updated 4 months ago
- π» Terminal-Agent with Human-in-the-Loop Learningβ39Jan 16, 2026Updated 2 months ago
- Based on the R1-Zero method, using rule-based rewards and GRPO on the Code Contests dataset.β18Apr 22, 2025Updated 11 months ago
- [ICML25] CODESYNC: Synchronizing Large Language Models with Dynamic Code Evolution at Scaleβ25Jul 31, 2025Updated 7 months ago
- β51Mar 9, 2026Updated last week
- β42Mar 26, 2025Updated 11 months ago
- CopyNet (Copy Mechanism in Seq2Seq) implementation with TensorFlow 2β10Nov 21, 2022Updated 3 years ago
- FeedbackQA: Improving Question Answering Post-Deployment with Interactive Feedbackβ12Jul 13, 2022Updated 3 years ago
- β42Jan 11, 2021Updated 5 years ago
- Provides a minimal implementation to extract FLAN datasets for further processingβ11Feb 1, 2023Updated 3 years ago
- FlexAttention w/ FlashAttention3 Supportβ27Oct 5, 2024Updated last year
- β12Jan 15, 2015Updated 11 years ago
- Labs of 2019 Web Information Processing and Application in USTC.β11Jan 15, 2020Updated 6 years ago
- β76Sep 15, 2025Updated 6 months ago
- This code implements a basic, Twitter-aware tokenizer.β12Feb 8, 2024Updated 2 years ago
- APRIL: Active Partial Rollouts in Reinforcement Learning to Tame Long-tail Generation. A system-level optimization for scalable LLM traβ¦β54Oct 11, 2025Updated 5 months ago
- This project aims at predicting correlated column pairs in data tables by analyzing column names via large language models.β11Aug 21, 2023Updated 2 years ago
- Polyhedral Extraction Tool (source repository: http://repo.or.cz/w/pet.git)β40Jul 22, 2022Updated 3 years ago
- β12Dec 28, 2016Updated 9 years ago
- Dynamic Spear Modelβ12Jul 24, 2019Updated 6 years ago
- β14May 26, 2021Updated 4 years ago
- β16Feb 6, 2024Updated 2 years ago
- β¨ A synthetic dataset generation framework that produces diverse coding questions and verifiable solutions - all in one framworkβ313Sep 6, 2025Updated 6 months ago
- The code for the Mimic and Rephrase paperβ13Mar 19, 2023Updated 3 years ago
- Adapt MLLMs to Domains via Post-Training (EMNLP 2025 Findings)β13Nov 11, 2025Updated 4 months ago
- Muon fsdp 2β55Aug 8, 2025Updated 7 months ago
- Microsoft Complex Tasks Datasetβ17Jun 12, 2023Updated 2 years ago
- β15Feb 24, 2021Updated 5 years ago
- Offical implementation of our paper "Exploring the Potential of Diffusion Large Language Models in Code Generation".β20Oct 29, 2025Updated 4 months ago
- Giza++β12May 12, 2015Updated 10 years ago
- CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratingsβ67Feb 3, 2025Updated last year
- Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".β55Nov 29, 2024Updated last year
- This is a demo how to write a high performance convolution run on apple siliconβ57Feb 8, 2022Updated 4 years ago
- β22Dec 25, 2025Updated 2 months ago
- The official implementation of dLLM-Varβ31Nov 6, 2025Updated 4 months ago
- Code accompanying ICML 2021 paper "Few-shot Language Coordination by Modeling Theory of Mind"β18May 18, 2022Updated 3 years ago
- Learning to Model Editing Processesβ26Aug 3, 2025Updated 7 months ago
- β35Updated this week
- Codes for "NAST: A Non-Autoregressive Generator with Word Alignment for Unsupervised Text Style Transfer" (ACL 2021 findings)β15Nov 3, 2021Updated 4 years ago