0xWJ/code-judge

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/0xWJ/code-judge)

0xWJ / code-judge

☆24

Alternatives and similar repositories for code-judge

Users that are interested in code-judge are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ysy-phoenix / evalhub
View on GitHub
All-in-one benchmarking platform for evaluating LLM.
☆15Nov 12, 2025Updated 8 months ago
phonism / CP-Zero
View on GitHub
Based on the R1-Zero method, using rule-based rewards and GRPO on the Code Contests dataset.
☆18Apr 22, 2025Updated last year
richardodliu / OpenCodeEval
View on GitHub
☆52Mar 9, 2026Updated 4 months ago
instance-wise-ordered-transformer / IOT
View on GitHub
☆20Feb 26, 2021Updated 5 years ago
zhehangdu / Newton-Muon
View on GitHub
The Newton-Muon optimizer
☆30Jun 5, 2026Updated last month
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
microsoft / rStar
View on GitHub
☆1,422Sep 12, 2025Updated 10 months ago
ChengpengLi1003 / CoRT
View on GitHub
☆72Oct 23, 2025Updated 8 months ago
Labman42 / JetEngine
View on GitHub
A lightweight Inference Engine built for block diffusion models
☆47Apr 12, 2026Updated 3 months ago
Lucky-Wang-Chenlong / CodeSync
View on GitHub
[ICML25] CODESYNC: Synchronizing Large Language Models with Dynamic Code Evolution at Scale
☆24Jul 31, 2025Updated 11 months ago
microsoft / nnscaler
View on GitHub
nnScaler: Compiling DNN models for Parallel Training
☆135Jul 2, 2026Updated 2 weeks ago
THUDM / APAR
View on GitHub
APAR: LLMs Can Do Auto-Parallel Auto-Regressive Decoding
☆14Jul 22, 2024Updated last year
terminal-agent / reptile
View on GitHub
💻 Terminal-Agent with Human-in-the-Loop Learning
☆40Jan 16, 2026Updated 6 months ago
ant-research / M2-Miner
View on GitHub
[ICLR 2026] M2-Miner: Multi-Agent Enhanced MCTS for Mobile GUI Agent Data Mining
☆55Apr 22, 2026Updated 2 months ago
huggingface / ioi
View on GitHub
☆42Mar 26, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
zwhe99 / DeepMath
View on GitHub
A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning
☆294Sep 25, 2025Updated 9 months ago
McGill-NLP / feedbackqa
View on GitHub
FeedbackQA: Improving Question Answering Post-Deployment with Interactive Feedback
☆12Jul 13, 2022Updated 4 years ago
pavanchhatpar / copynet-tf
View on GitHub
CopyNet (Copy Mechanism in Seq2Seq) implementation with TensorFlow 2
☆10Nov 21, 2022Updated 3 years ago
Hannibal046 / RWKV-howto
View on GitHub
possibly useful materials for learning RWKV language model.
☆27Jun 8, 2023Updated 3 years ago
jacobandreas / geca
View on GitHub
☆41Jan 11, 2021Updated 5 years ago
usail-hkust / benchmark_inference_time_computation_LLM
View on GitHub
[NeurIPS 2025] Bag of Tricks for Inference-time Computation of LLM Reasoning
☆16Sep 20, 2025Updated 10 months ago
leloykun / adaptive-muon
View on GitHub
A single-line modification to any (dualizer-based) optimizer that allows the optimizer to adapt to the scale of the gradients as they cha…
☆19Jan 11, 2025Updated last year
yuleiqin / RAIF
View on GitHub
A Recipe for Building LLM Reasoners to Solve Complex Instructions
☆32Oct 9, 2025Updated 9 months ago
bethgelab / sober-reasoning
View on GitHub
A Sober Look at Language Model Reasoning
☆92Nov 18, 2025Updated 8 months ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
ChicagoHAI / decsum
View on GitHub
Implementation for Decision-focused Summarization (EMNLP2021)
☆12Mar 14, 2022Updated 4 years ago
waynchi / editbench
View on GitHub
☆31Apr 7, 2026Updated 3 months ago
icepear-jzx / USTC-Web-Info-2019
View on GitHub
Labs of 2019 Web Information Processing and Application in USTC.
☆11Jan 15, 2020Updated 6 years ago
portal-cornell / muCode
View on GitHub
☆33Oct 2, 2025Updated 9 months ago
yzhangchuck / awesome-llm-reasoning-long2short-papers
View on GitHub
☆17Apr 11, 2025Updated last year
OSH-2022 / x-runikraft
View on GitHub
2022 USTC 011705 (OSH) Course Project of Runikraft Group
☆13Jul 22, 2022Updated 3 years ago
Meinersbur / pet
View on GitHub
Polyhedral Extraction Tool (source repository: http://repo.or.cz/w/pet.git)
☆42Jul 22, 2022Updated 3 years ago
dlatk / happierfuntokenizing
View on GitHub
This code implements a basic, Twitter-aware tokenizer.
☆12Feb 8, 2024Updated 2 years ago
bigai-ai / QA-Synthesizer
View on GitHub
Adapt MLLMs to Domains via Post-Training (EMNLP 2025 Findings)
☆14Nov 11, 2025Updated 8 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
liushulinle / UloRL
View on GitHub
An Ultra-Long Output Reinforcement Learning Approach
☆23Jul 31, 2025Updated 11 months ago
hao-cheng / dynamic_speaker_model
View on GitHub
Dynamic Spear Model
☆12Jul 24, 2019Updated 6 years ago
microsoft / MSComplexTasks
View on GitHub
Microsoft Complex Tasks Dataset
☆18Jun 12, 2023Updated 3 years ago
KodCode-AI / kodcode
View on GitHub
✨ A synthetic dataset generation framework that produces diverse coding questions and verifiable solutions - all in one framwork
☆321Sep 6, 2025Updated 10 months ago
alexey-osipenko / giza-pp
View on GitHub
Giza++
☆12May 12, 2015Updated 11 years ago
asappresearch / interactive-classification
View on GitHub
☆15Feb 24, 2021Updated 5 years ago
WooooDyy / MathCritique
View on GitHub
Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".
☆55Nov 29, 2024Updated last year