APEXLAB / CodeApex
☆49Updated last year
Alternatives and similar repositories for CodeApex
Users that are interested in CodeApex are comparing it to the libraries listed below
Sorting:
- ☆81Updated last year
- Reproducing R1 for Code with Reliable Rewards☆190Updated 2 weeks ago
- Pretrain、decay、SFT a CodeLLM from scratch 🧙♂️☆35Updated last year
- AI Alignment: A Comprehensive Survey☆133Updated last year
- ☆33Updated 5 months ago
- NaturalCodeBench (Findings of ACL 2024)☆64Updated 7 months ago
- ☆31Updated this week
- [COLING 2025] ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios☆66Updated this week
- Unleashing the Power of Cognitive Dynamics on Large Language Models☆61Updated 7 months ago
- ☆39Updated 5 months ago
- xCodeEval: A Large Scale Multilingual Multitask Benchmark for Code Understanding, Generation, Translation and Retrieval☆81Updated 8 months ago
- ☆143Updated 10 months ago
- Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models (…☆78Updated this week
- 代码大模型 预训练&微调&DPO 数据处理 业界处理pipeline sota☆38Updated 9 months ago
- Token level visualization tools for large language models☆80Updated 4 months ago
- ☆63Updated 5 months ago
- A Comprehensive Benchmark for Software Development.☆105Updated 11 months ago
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆32Updated 11 months ago
- code for Scaling Laws of RoPE-based Extrapolation☆73Updated last year
- [ACL 2023] Solving Math Word Problems via Cooperative Reasoning induced Language Models (LLMs + MCTS + Self-Improvement)☆49Updated last year
- Must-read papers on Repository-level Code Generation & Issue Resolution 🔥☆57Updated this week
- ☆20Updated 3 weeks ago
- Dataset and evaluation script for "Evaluating Hallucinations in Chinese Large Language Models"☆127Updated 11 months ago
- A Bilingual Role Evaluation Benchmark for Large Language Models☆40Updated last year
- Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models☆132Updated 11 months ago
- A Comprehensive Survey on Long Context Language Modeling☆142Updated last month
- Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".☆73Updated 3 weeks ago
- Feeling confused about super alignment? Here is a reading list☆42Updated last year
- StepCoder: Improve Code Generation with Reinforcement Learning from Compiler Feedback☆64Updated 8 months ago
- A research repo for experiments about Reinforcement Finetuning☆46Updated last month