β23Oct 10, 2025Updated 6 months ago
Alternatives and similar repositories for code-judge
Users that are interested in code-judge are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- All-in-one benchmarking platform for evaluating LLM.β15Nov 12, 2025Updated 5 months ago
- π» Terminal-Agent with Human-in-the-Loop Learningβ39Jan 16, 2026Updated 3 months ago
- [ICML25] CODESYNC: Synchronizing Large Language Models with Dynamic Code Evolution at Scaleβ25Jul 31, 2025Updated 9 months ago
- β52Mar 9, 2026Updated last month
- β32Jan 16, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- CopyNet (Copy Mechanism in Seq2Seq) implementation with TensorFlow 2β10Nov 21, 2022Updated 3 years ago
- possibly useful materials for learning RWKV language model.β26Jun 8, 2023Updated 2 years ago
- FeedbackQA: Improving Question Answering Post-Deployment with Interactive Feedbackβ12Jul 13, 2022Updated 3 years ago
- β41Jan 11, 2021Updated 5 years ago
- Provides a minimal implementation to extract FLAN datasets for further processingβ11Feb 1, 2023Updated 3 years ago
- FlexAttention w/ FlashAttention3 Supportβ27Oct 5, 2024Updated last year
- β12Jan 15, 2015Updated 11 years ago
- Labs of 2019 Web Information Processing and Application in USTC.β11Jan 15, 2020Updated 6 years ago
- This code implements a basic, Twitter-aware tokenizer.β12Feb 8, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- APRIL: Active Partial Rollouts in Reinforcement Learning to Tame Long-tail Generation. A system-level optimization for scalable LLM traβ¦β57Oct 11, 2025Updated 6 months ago
- β81Sep 15, 2025Updated 7 months ago
- 2022 USTC 011705 (OSH) Course Project of Runikraft Groupβ13Jul 22, 2022Updated 3 years ago
- Polyhedral Extraction Tool (source repository: http://repo.or.cz/w/pet.git)β41Jul 22, 2022Updated 3 years ago
- β12Dec 28, 2016Updated 9 years ago
- Dynamic Spear Modelβ12Jul 24, 2019Updated 6 years ago
- β14May 26, 2021Updated 4 years ago
- The code for the Mimic and Rephrase paperβ13Mar 19, 2023Updated 3 years ago
- β16Feb 6, 2024Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits β’ AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- β¨ A synthetic dataset generation framework that produces diverse coding questions and verifiable solutions - all in one framworkβ318Sep 6, 2025Updated 7 months ago
- Adapt MLLMs to Domains via Post-Training (EMNLP 2025 Findings)β14Nov 11, 2025Updated 5 months ago
- Muon fsdp 2β56Aug 8, 2025Updated 8 months ago
- Microsoft Complex Tasks Datasetβ17Jun 12, 2023Updated 2 years ago
- β15Feb 24, 2021Updated 5 years ago
- CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratingsβ74Feb 3, 2025Updated last year
- Bridging the Generalization Gap in Text-to-SQL Parsing with Schema Expansionβ13Jul 26, 2023Updated 2 years ago
- This is the code for neural-Jacana aligner, and the data for MultiMWA dataset.β20Feb 12, 2023Updated 3 years ago
- This is a demo how to write a high performance convolution run on apple siliconβ56Feb 8, 2022Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- The official implementation of dLLM-Varβ32Nov 6, 2025Updated 5 months ago
- β12Jun 8, 2021Updated 4 years ago
- Reproducing R1 for Code with Reliable Rewardsβ310May 5, 2025Updated 11 months ago
- nnScaler: Compiling DNN models for Parallel Trainingβ129Apr 8, 2026Updated 3 weeks ago
- Embedding-based evaluation metrics for dialogue generation.β15Jan 8, 2023Updated 3 years ago
- Learning to Model Editing Processesβ26Aug 3, 2025Updated 8 months ago
- β43Mar 23, 2026Updated last month