bigcode-project / the-stack-v2
Code for the curation of The Stack v2 and StarCoder2 training data
☆90Updated 7 months ago
Related projects ⓘ
Alternatives and complementary repositories for the-stack-v2
- CrossCodeEval: A Diverse and Multilingual Benchmark for Cross-File Code Completion (NeurIPS 2023)☆122Updated 3 months ago
- CRUXEval: Code Reasoning, Understanding, and Execution Evaluation☆115Updated last month
- A multi-programming language benchmark for LLMs☆208Updated this week
- Repository for analysis and experiments in the BigCode project.☆115Updated 8 months ago
- xCodeEval: A Large Scale Multilingual Multitask Benchmark for Code Understanding, Generation, Translation and Retrieval☆74Updated 2 months ago
- ✨ RepoBench: Benchmarking Repository-Level Code Auto-Completion Systems - ICLR 2024☆134Updated 3 months ago
- [ICML 2023] Data and code release for the paper "DS-1000: A Natural and Reliable Benchmark for Data Science Code Generation".☆222Updated 3 weeks ago
- This project studies the performance and robustness of language models and task-adaptation methods.☆141Updated 6 months ago
- Official github repo for the paper "Compression Represents Intelligence Linearly" [COLM 2024]☆127Updated 2 months ago
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆124Updated 3 weeks ago
- Accepted by Transactions on Machine Learning Research (TMLR)☆120Updated last month
- ☆366Updated 3 months ago
- Official repository for the paper "LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code"☆222Updated last month
- Official Repo for ICLR 2024 paper MINT: Evaluating LLMs in Multi-turn Interaction with Tools and Language Feedback by Xingyao Wang*, Ziha…☆104Updated 5 months ago
- Codes for the paper "∞Bench: Extending Long Context Evaluation Beyond 100K Tokens": https://arxiv.org/abs/2402.13718☆285Updated last month
- ☆119Updated 6 months ago
- Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasks☆206Updated 10 months ago
- Positional Skip-wise Training for Efficient Context Window Extension of LLMs to Extremely Length (ICLR 2024)☆199Updated 6 months ago
- ☆103Updated last year
- RepoQA: Evaluating Long-Context Code Understanding☆100Updated 3 weeks ago
- Open Source WizardCoder Dataset☆153Updated last year
- ☆101Updated 4 months ago
- Code and Data for "Long-context LLMs Struggle with Long In-context Learning"☆91Updated 4 months ago
- A framework for few-shot evaluation of autoregressive language models.☆23Updated 11 months ago
- NaturalCodeBench (Findings of ACL 2024)☆56Updated last month
- A project to improve skills of large language models☆194Updated this week
- Code for paper "LEVER: Learning to Verifiy Language-to-Code Generation with Execution" (ICML'23)☆79Updated last year
- What's In My Big Data (WIMBD) - a toolkit for analyzing large text datasets☆193Updated last week
- [NeurIPS'24] SelfCodeAlign: Self-Alignment for Code Generation☆270Updated 3 weeks ago
- InstructCoder: Instruction Tuning Large Language Models for Code Editing | Oral ACL-2024 srw☆52Updated last month