loubnabnl / santacoder-finetuningLinks
Fine-tune SantaCoder for Code/Text Generation.
☆195Updated 2 years ago
Alternatives and similar repositories for santacoder-finetuning
Users that are interested in santacoder-finetuning are comparing it to the libraries listed below
Sorting:
- 🐙 OctoPack: Instruction Tuning Code Large Language Models☆472Updated 8 months ago
- CodeGen2 models for program synthesis☆271Updated 2 years ago
- ☆275Updated 2 years ago
- Repository for analysis and experiments in the BigCode project.☆124Updated last year
- Run evaluation on LLMs using human-eval benchmark☆420Updated 2 years ago
- Ongoing research training transformer models at scale☆391Updated last year
- [NeurIPS'24] SelfCodeAlign: Self-Alignment for Code Generation☆316Updated 7 months ago
- Open-source Self-Instruction Tuning Code LLM☆170Updated 2 years ago
- ☆84Updated 2 years ago
- Open Source WizardCoder Dataset☆161Updated 2 years ago
- Generative model for code infilling and synthesis☆308Updated 2 years ago
- ☆667Updated 11 months ago
- Accepted by Transactions on Machine Learning Research (TMLR)☆131Updated last year
- The data processing pipeline for the Koala chatbot language model☆118Updated 2 years ago
- OpenAlpaca: A Fully Open-Source Instruction-Following Model Based On OpenLLaMA☆302Updated 2 years ago
- [ICLR 2024] Lemur: Open Foundation Models for Language Agents☆555Updated last year
- CodeSage: Code Representation Learning At Scale (ICLR 2024)☆112Updated 11 months ago
- Official repository for LongChat and LongEval☆533Updated last year
- Data and code for "DocPrompting: Generating Code by Retrieving the Docs" @ICLR 2023☆249Updated last year
- Minimal library to train LLMs on TPU in JAX with pjit().☆301Updated last year
- PaL: Program-Aided Language Models (ICML 2023)☆511Updated 2 years ago
- Code for the paper "Rethinking Benchmark and Contamination for Language Models with Rephrased Samples"☆311Updated last year
- Code for the paper "Efficient Training of Language Models to Fill in the Middle"☆186Updated 2 years ago
- evol augment any dataset online☆60Updated 2 years ago
- ☆416Updated last year
- Merge Transformers language models by use of gradient parameters.☆208Updated last year
- This is the repo for the paper Shepherd -- A Critic for Language Model Generation☆218Updated 2 years ago
- This repository contains code for extending the Stanford Alpaca synthetic instruction tuning to existing instruction-tuned models such as…☆353Updated 2 years ago
- Generate textbook-quality synthetic LLM pretraining data☆505Updated last year
- ☆472Updated last year