0xWJ / code-judgeLinks
☆9Updated last month
Alternatives and similar repositories for code-judge
Users that are interested in code-judge are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2024] Fast Best-of-N Decoding via Speculative Rejection☆49Updated 9 months ago
- Official implementation for DenseMixer: Improving MoE Post-Training with Precise Router Gradient☆44Updated last week
- ☆40Updated 2 months ago
- Async pipelined version of Verl☆112Updated 4 months ago
- Bridge Megatron-Core to Hugging Face/Reinforcement Learning☆74Updated last week
- Ouroboros: Speculative Decoding with Large Model Enhanced Drafting (EMNLP 2024 main)☆109Updated 4 months ago
- Estimate MFU for DeepSeekV3☆24Updated 7 months ago
- ☆114Updated 2 months ago
- Source code for the paper "LongGenBench: Long-context Generation Benchmark"☆22Updated 10 months ago
- Sirius, an efficient correction mechanism, which significantly boosts Contextual Sparsity models on reasoning tasks while maintaining its…☆22Updated 11 months ago
- Code for the preprint "Cache Me If You Can: How Many KVs Do You Need for Effective Long-Context LMs?"☆41Updated 2 weeks ago
- Official Implementation of SAM-Decoding: Speculative Decoding via Suffix Automaton☆30Updated 5 months ago
- ☆29Updated this week
- Homepage for ProLong (Princeton long-context language models) and paper "How to Train Long-Context Language Models (Effectively)"☆218Updated 5 months ago
- Implementation of NAACL 2024 Outstanding Paper "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"☆148Updated 4 months ago
- ☆26Updated 6 months ago
- Best practices for testing advanced Mixtral, DeepSeek, and Qwen series MoE models using Megatron Core MoE.☆45Updated last week
- The official implementation of paper: SimLayerKV: A Simple Framework for Layer-Level KV Cache Reduction.☆48Updated 9 months ago
- LongSpec: Long-Context Lossless Speculative Decoding with Efficient Drafting and Verification☆61Updated 3 weeks ago
- Repo for ACL2023 Findings paper "Emergent Modularity in Pre-trained Transformers"☆25Updated 2 years ago
- End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning☆162Updated last week
- All-in-one benchmarking platform for evaluating LLM.☆15Updated 3 weeks ago
- PoC for "SpecReason: Fast and Accurate Inference-Time Compute via Speculative Reasoning" [arXiv '25]☆45Updated last month
- Code associated with the paper **Draft & Verify: Lossless Large Language Model Acceleration via Self-Speculative Decoding**☆200Updated 5 months ago
- ☆20Updated 4 months ago
- MiroRL is an MCP-first reinforcement learning framework for deep research agent.☆74Updated this week
- Repository of LV-Eval Benchmark☆68Updated 11 months ago
- The repository of the project "Fine-tuning Large Language Models with Sequential Instructions", code base comes from open-instruct and LA…☆29Updated 8 months ago
- LongProc: Benchmarking Long-Context Language Models on Long Procedural Generation☆26Updated last month
- Based on the R1-Zero method, using rule-based rewards and GRPO on the Code Contests dataset.☆17Updated 3 months ago