bigcode-project / bigcodebench-annotation
BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions
☆19Updated 7 months ago
Alternatives and similar repositories for bigcodebench-annotation:
Users that are interested in bigcodebench-annotation are comparing it to the libraries listed below
- Astraios: Parameter-Efficient Instruction Tuning Code Language Models☆57Updated 11 months ago
- ☆22Updated 4 months ago
- InstructCoder: Instruction Tuning Large Language Models for Code Editing | Oral ACL-2024 srw☆58Updated 5 months ago
- Training and Benchmarking LLMs for Code Preference.☆33Updated 4 months ago
- ☆59Updated 6 months ago
- official repo for the paper "Learning From Mistakes Makes LLM Better Reasoner"☆59Updated last year
- GenRM-CoT: Data release for verification rationales☆51Updated 5 months ago
- A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Models☆44Updated last month
- ☆52Updated last year
- ☆26Updated 2 months ago
- [ICML'24] TroVE: Inducing Verifiable and Efficient Toolboxes for Solving Programmatic Tasks☆25Updated 6 months ago
- ☆115Updated 8 months ago
- Official repository for paper "Weak-to-Strong Extrapolation Expedites Alignment"☆73Updated 9 months ago
- Code and data used in the paper: "Training on Incorrect Synthetic Data via RL Scales LLM Math Reasoning Eight-Fold"☆29Updated 9 months ago
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆54Updated last year
- Code for Paper: Teaching Language Models to Critique via Reinforcement Learning☆84Updated last month
- [𝐄𝐌𝐍𝐋𝐏 𝐅𝐢𝐧𝐝𝐢𝐧𝐠𝐬 𝟐𝟎𝟐𝟒 & 𝐀𝐂𝐋 𝟐𝟎𝟐𝟒 𝐍𝐋𝐑𝐒𝐄 𝐎𝐫𝐚𝐥] 𝘌𝘯𝘩𝘢𝘯𝘤𝘪𝘯𝘨 𝘔𝘢𝘵𝘩𝘦𝘮𝘢𝘵𝘪𝘤𝘢𝘭 𝘙𝘦𝘢𝘴𝘰𝘯𝘪𝘯…☆48Updated 10 months ago
- [ICML 2024] Self-Infilling Code Generation☆18Updated 10 months ago
- Code for the paper <SelfCheck: Using LLMs to Zero-Shot Check Their Own Step-by-Step Reasoning>☆48Updated last year
- ☆94Updated last year
- ☆12Updated 4 months ago
- [EMNLP'23] Execution-Based Evaluation for Open Domain Code Generation☆47Updated last year
- [EMNLP 2024] Multi-modal reasoning problems via code generation.☆20Updated last month
- Watch Every Step! LLM Agent Learning via Iterative Step-level Process Refinement (EMNLP 2024 Main Conference)☆56Updated 5 months ago
- [ICLR 2024] Evaluating Large Language Models at Evaluating Instruction Following☆121Updated 8 months ago
- ☆11Updated 9 months ago
- Official github repo for the paper "Compression Represents Intelligence Linearly" [COLM 2024]☆130Updated 6 months ago
- GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.☆55Updated 8 months ago
- Code for the TMLR 2023 paper "PPOCoder: Execution-based Code Generation using Deep Reinforcement Learning"☆109Updated last year
- ☆36Updated 9 months ago