bigcode-project / astraios
Astraios: Parameter-Efficient Instruction Tuning Code Language Models
☆57Updated 6 months ago
Related projects ⓘ
Alternatives and complementary repositories for astraios
- ☆39Updated 5 months ago
- Plug in and play implementation of " Textbooks Are All You Need", ready for training, inference, and dataset generation☆76Updated last year
- A dataset of LLM-generated chain-of-thought steps annotated with mistake location.☆72Updated 2 months ago
- This is the official repository of the paper "OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI"☆85Updated last month
- StepCoder: Improve Code Generation with Reinforcement Learning from Compiler Feedback☆52Updated 2 months ago
- InstructCoder: Instruction Tuning Large Language Models for Code Editing | Oral ACL-2024 srw☆52Updated last month
- CRUXEval: Code Reasoning, Understanding, and Execution Evaluation☆111Updated 3 weeks ago
- ☆18Updated last week
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆73Updated 9 months ago
- [NAACL 2024 Outstanding Paper] Source code for the NAACL 2024 paper entitled "R-Tuning: Instructing Large Language Models to Say 'I Don't…☆84Updated 4 months ago
- CodeUltraFeedback: aligning large language models to coding preferences☆65Updated 4 months ago
- ☆41Updated this week
- xCodeEval: A Large Scale Multilingual Multitask Benchmark for Code Understanding, Generation, Translation and Retrieval☆74Updated last month
- Lightweight tool to identify Data Contamination in LLMs evaluation☆40Updated 8 months ago
- ☆75Updated last year
- "Improving Mathematical Reasoning with Process Supervision" by OPENAI☆78Updated this week
- Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"☆44Updated 9 months ago
- BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions☆19Updated 3 months ago
- Source code for MMEvalPro, a more trustworthy and efficient benchmark for evaluating LMMs☆22Updated last month
- evol augment any dataset online☆55Updated last year
- ☆33Updated 2 months ago
- ☆50Updated last year
- [NeurIPS-2024] 📈 Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623☆67Updated last month
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆36Updated 8 months ago
- Official repository for paper "Weak-to-Strong Extrapolation Expedites Alignment"☆67Updated 5 months ago
- Code and Data for "Long-context LLMs Struggle with Long In-context Learning"☆91Updated 4 months ago
- ☆111Updated last month
- [ACL 2024] Self-Training with Direct Preference Optimization Improves Chain-of-Thought Reasoning☆30Updated 3 months ago
- Training and Benchmarking LLMs for Code Preference.☆19Updated last week
- ☆98Updated 5 months ago