loubnabnl / santacoder-finetuning
Fine-tune SantaCoder for Code/Text Generation.
β186Updated last year
Related projects β
Alternatives and complementary repositories for santacoder-finetuning
- Run evaluation on LLMs using human-eval benchmarkβ380Updated last year
- π OctoPack: Instruction Tuning Code Large Language Modelsβ435Updated 2 months ago
- β263Updated last year
- Open Source WizardCoder Datasetβ153Updated last year
- β344Updated last year
- Data and code for "DocPrompting: Generating Code by Retrieving the Docs" @ICLR 2023β232Updated 11 months ago
- Open-source Self-Instruction Tuning Code LLMβ168Updated last year
- Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasksβ206Updated 10 months ago
- β¨ RepoBench: Benchmarking Repository-Level Code Auto-Completion Systems - ICLR 2024β133Updated 3 months ago
- β72Updated last year
- CodeGen2 models for program synthesisβ274Updated last year
- Official repository for LongChat and LongEvalβ512Updated 6 months ago
- β86Updated last year
- Convenient wrapper for fine-tuning and inference of Large Language Models (LLMs) with several quantization techniques (GTPQ, bitsandbytesβ¦β145Updated last year
- evol augment any dataset onlineβ55Updated last year
- Fast & more realistic evaluation of chat language models. Includes leaderboard.β183Updated 11 months ago
- [NeurIPS 2023 D&B] Code repository for InterCode benchmark https://arxiv.org/abs/2306.14898β194Updated 6 months ago
- The data processing pipeline for the Koala chatbot language modelβ117Updated last year
- β366Updated 3 months ago
- [NeurIPS'24] SelfCodeAlign: Self-Alignment for Code Generationβ270Updated 3 weeks ago
- This project is an attempt to create a common metric to test LLM's for progress in eliminating hallucinations which is the most serious cβ¦β221Updated last year
- Accepted by Transactions on Machine Learning Research (TMLR)β120Updated last month
- Official codebase for "SelFee: Iterative Self-Revising LLM Empowered by Self-Feedback Generation"β220Updated last year
- Code for the paper "Rethinking Benchmark and Contamination for Language Models with Rephrased Samples"β293Updated 11 months ago
- A bagel, with everything.β312Updated 7 months ago
- batched lorasβ336Updated last year
- CrossCodeEval: A Diverse and Multilingual Benchmark for Cross-File Code Completion (NeurIPS 2023)β122Updated 3 months ago
- A multi-programming language benchmark for LLMsβ208Updated this week
- β175Updated last year
- Repository for analysis and experiments in the BigCode project.β115Updated 8 months ago