Fine-tune SantaCoder for Code/Text Generation.
☆195Apr 11, 2023Updated 3 years ago
Alternatives and similar repositories for santacoder-finetuning
Users that are interested in santacoder-finetuning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A framework for the evaluation of autoregressive code generation language models.☆1,035Jul 22, 2025Updated 8 months ago
- ☆23Jul 10, 2023Updated 2 years ago
- 🐙 OctoPack: Instruction Tuning Code Large Language Models☆478Feb 5, 2025Updated last year
- Code for the examples presented in the talk "Training a Llama in your backyard: fine-tuning very large models on consumer hardware" given…☆15Oct 16, 2023Updated 2 years ago
- Accepted by Transactions on Machine Learning Research (TMLR)☆135Oct 5, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- LLM Workshop by Sourab Mangrulkar☆402Jun 16, 2024Updated last year
- A multi-programming language benchmark for LLMs☆301Apr 12, 2026Updated last week
- Home of StarCoder: fine-tuning & inference!☆7,518Feb 27, 2024Updated 2 years ago
- Python examples using the bigcode/tiny_starcoder_py 159M model to generate code☆45May 31, 2023Updated 2 years ago
- ☆127Apr 22, 2023Updated 2 years ago
- Repository for opt-out requests.☆10Mar 25, 2024Updated 2 years ago
- Data and code for "DocPrompting: Generating Code by Retrieving the Docs" @ICLR 2023☆251Dec 15, 2023Updated 2 years ago
- Generative model for code infilling and synthesis☆313Sep 9, 2023Updated 2 years ago
- Run evaluation on LLMs using human-eval benchmark☆430Sep 12, 2023Updated 2 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Source codes for paper ”ReACC: A Retrieval-Augmented Code Completion Framework“☆65Apr 18, 2022Updated 4 years ago
- Astraios: Parameter-Efficient Instruction Tuning Code Language Models☆63Apr 10, 2024Updated 2 years ago
- CodeGen is a family of open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.☆5,169Oct 27, 2025Updated 5 months ago
- Ongoing research training transformer models at scale☆394Aug 20, 2024Updated last year
- Open Source WizardCoder Dataset☆166Jul 12, 2023Updated 2 years ago
- ☆491Aug 15, 2024Updated last year
- C++ implementation for 💫StarCoder☆458Sep 9, 2023Updated 2 years ago
- Used for adaptive human in the loop evaluation of language and embedding models.☆307Mar 1, 2023Updated 3 years ago
- Using short models to classify long texts☆21Mar 8, 2023Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Rigourous evaluation of LLM-synthesized code - NeurIPS 2023 & COLM 2024☆1,713Oct 2, 2025Updated 6 months ago
- A pre-trained GPT model for Python code completion and generation☆282Jun 12, 2023Updated 2 years ago
- Repository for analysis and experiments in the BigCode project.☆127Mar 20, 2024Updated 2 years ago
- Two Automatic code completion IDE extensions for @JetBrains and @microsoft/vscode based on Transformer-based large language models for so…☆56Mar 21, 2024Updated 2 years ago
- ☆23Jan 25, 2023Updated 3 years ago
- A Python reimplementation + extension of "Planning with Large Language Models for Code Generation" (https://arxiv.org/abs/2303.05510)☆18Dec 1, 2023Updated 2 years ago
- Code for the curation of The Stack v2 and StarCoder2 training data☆130Apr 11, 2024Updated 2 years ago
- Manage scalable open LLM inference endpoints in Slurm clusters☆284Jul 11, 2024Updated last year
- ☆567Nov 20, 2024Updated last year
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Empirical Study of Transformers for Source Code & A Simple Approach for Handling Out-of-Vocabulary Identifiers in Deep Learning for Sourc…☆66Dec 3, 2021Updated 4 years ago
- This is the official code for the paper CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning (Neur…☆566Jan 21, 2025Updated last year
- [NeurIPS'24] SelfCodeAlign: Self-Alignment for Code Generation☆323Feb 24, 2025Updated last year
- Generate the WizardCoder Instruct from the CodeAlpaca☆21Jun 27, 2023Updated 2 years ago
- Pipeline for pulling and processing online language model pretraining data from the web☆179Jul 31, 2023Updated 2 years ago
- [ICLR 2021] "Generating Adversarial Computer Programs using Optimized Obfuscations" by Shashank Srikant, Sijia Liu, Tamara Mitrovska, Shi…☆32Nov 15, 2021Updated 4 years ago
- The implementation of the IJCAI 2018 paper: Code Completion with Neural Attention and Pointer Networks☆18Sep 11, 2019Updated 6 years ago