bigcode-project / bigcode-datasetLinks
β462Updated 10 months ago
Alternatives and similar repositories for bigcode-dataset
Users that are interested in bigcode-dataset are comparing it to the libraries listed below
Sorting:
- π OctoPack: Instruction Tuning Code Large Language Modelsβ468Updated 4 months ago
- A framework for the evaluation of autoregressive code generation language models.β949Updated 7 months ago
- Run evaluation on LLMs using human-eval benchmarkβ414Updated last year
- [ICML 2023] Data and code release for the paper "DS-1000: A Natural and Reliable Benchmark for Data Science Code Generation".β245Updated 7 months ago
- Open Source WizardCoder Datasetβ158Updated last year
- A multi-programming language benchmark for LLMsβ253Updated this week
- β657Updated 7 months ago
- β270Updated 2 years ago
- β756Updated last year
- Repository for analysis and experiments in the BigCode project.β119Updated last year
- Fine-tune SantaCoder for Code/Text Generation.β192Updated 2 years ago
- Code used for sourcing and cleaning the BigScience ROOTS corpusβ313Updated 2 years ago
- β¨ RepoBench: Benchmarking Repository-Level Code Auto-Completion Systems - ICLR 2024β167Updated 10 months ago
- This repository contains code to quantitatively evaluate instruction-tuned models such as Alpaca and Flan-T5 on held-out tasks.β546Updated last year
- A collection of practical code generation tasks and tests in open source projects. Complementary to HumanEval by OpenAI.β143Updated 6 months ago
- CrossCodeEval: A Diverse and Multilingual Benchmark for Cross-File Code Completion (NeurIPS 2023)β143Updated 10 months ago
- Measuring Massive Multitask Language Understanding | ICLR 2021β1,434Updated 2 years ago
- Code for the curation of The Stack v2 and StarCoder2 training dataβ108Updated last year
- Pytorch implementation of DoReMi, a method for optimizing the data mixture weights in language modeling datasetsβ332Updated last year
- β520Updated 7 months ago
- This is the official code for the paper CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning (Neurβ¦β534Updated 5 months ago
- Official repository for LongChat and LongEvalβ521Updated last year
- Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Themβ497Updated last year
- distributed trainer for LLMsβ577Updated last year
- Official repository for the paper "LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code"β554Updated this week
- [ICLR'25] BigCodeBench: Benchmarking Code Generation Towards AGIβ383Updated 2 months ago
- YaRN: Efficient Context Window Extension of Large Language Modelsβ1,499Updated last year
- Expanding natural instructionsβ1,006Updated last year
- β361Updated 2 years ago
- Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]β556Updated 6 months ago