APEXLAB / CodeApex
☆48Updated last year
Alternatives and similar repositories for CodeApex:
Users that are interested in CodeApex are comparing it to the libraries listed below
- NaturalCodeBench (Findings of ACL 2024)☆61Updated 3 months ago
- ☆62Updated 3 months ago
- ☆92Updated 9 months ago
- ☆81Updated 9 months ago
- A Comprehensive Benchmark for Software Development.☆88Updated 7 months ago
- A visuailzation tool to make deep understaning and easier debugging for RLHF training.☆106Updated last week
- Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models☆128Updated 7 months ago
- Codev-Bench (Code Development Benchmark), a fine-grained, real-world, repository-level, and developer-centric evaluation framework. Codev…☆33Updated 2 months ago
- ☆41Updated 7 months ago
- ☆50Updated last month
- xCodeEval: A Large Scale Multilingual Multitask Benchmark for Code Understanding, Generation, Translation and Retrieval☆76Updated 4 months ago
- ☆48Updated 10 months ago
- ☆28Updated 2 months ago
- ☆137Updated 6 months ago
- Official repository for our paper "FullStack Bench: Evaluating LLMs as Full Stack Coders"☆58Updated last month
- SuperCLUE-Agent: 基于中文原生任务的Agent智能体核心能力测评基准☆79Updated last year
- ☆87Updated last month
- [COLING 2025] ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios☆64Updated last month
- CLongEval: A Chinese Benchmark for Evaluating Long-Context Large Language Models☆38Updated 10 months ago
- ☆125Updated last year
- The implementation of paper "LLM Critics Help Catch Bugs in Mathematics: Towards a Better Mathematical Verifier with Natural Language Fee…☆37Updated 5 months ago
- AI Alignment: A Comprehensive Survey☆133Updated last year
- ☆159Updated this week
- 代码大模型 预训练&微调&DPO 数据处理 业界处理pipeline sota☆31Updated 5 months ago
- Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)☆110Updated 2 months ago
- StepCoder: Improve Code Generation with Reinforcement Learning from Compiler Feedback☆62Updated 4 months ago
- ☆40Updated last month
- A curated reading list for large language model (LLM) alignment. Take a look at our new survey "Large Language Model Alignment: A Survey"…☆72Updated last year
- ☆25Updated last month
- Feeling confused about super alignment? Here is a reading list☆42Updated last year