loubnabnl / santacoder-finetuningLinks
Fine-tune SantaCoder for Code/Text Generation.
β194Updated 2 years ago
Alternatives and similar repositories for santacoder-finetuning
Users that are interested in santacoder-finetuning are comparing it to the libraries listed below
Sorting:
- π OctoPack: Instruction Tuning Code Large Language Modelsβ479Updated 10 months ago
- Run evaluation on LLMs using human-eval benchmarkβ426Updated 2 years ago
- β277Updated 2 years ago
- CodeGen2 models for program synthesisβ271Updated 2 years ago
- Open-source Self-Instruction Tuning Code LLMβ171Updated 2 years ago
- [NeurIPS'24] SelfCodeAlign: Self-Alignment for Code Generationβ323Updated 10 months ago
- Accepted by Transactions on Machine Learning Research (TMLR)β136Updated last year
- Repository for analysis and experiments in the BigCode project.β128Updated last year
- β84Updated 2 years ago
- Data and code for "DocPrompting: Generating Code by Retrieving the Docs" @ICLR 2023β251Updated 2 years ago
- Ongoing research training transformer models at scaleβ394Updated last year
- OpenAlpaca: A Fully Open-Source Instruction-Following Model Based On OpenLLaMAβ302Updated 2 years ago
- Open Source WizardCoder Datasetβ162Updated 2 years ago
- Official repository for LongChat and LongEvalβ533Updated last year
- NexusRaven-13B, a new SOTA Open-Source LLM for function calling. This repo contains everything for reproducing our evaluation on NexusRavβ¦β318Updated 2 years ago
- batched lorasβ347Updated 2 years ago
- β379Updated 2 years ago
- Code for the paper "Rethinking Benchmark and Contamination for Language Models with Rephrased Samples"β316Updated 2 years ago
- CodeSage: Code Representation Learning At Scale (ICLR 2024)β114Updated last year
- Fast & more realistic evaluation of chat language models. Includes leaderboard.β189Updated 2 years ago
- [EMNLP 2023] The Vault: A Comprehensive Multilingual Dataset for Advancing Code Understanding and Generationβ103Updated last year
- A joint community effort to create one central leaderboard for LLMs.β308Updated last year
- A bagel, with everything.β326Updated last year
- β672Updated last year
- This is the repo for the paper Shepherd -- A Critic for Language Model Generationβ220Updated 2 years ago
- The data processing pipeline for the Koala chatbot language modelβ118Updated 2 years ago
- CodeBERTScore: an automatic metric for code generation, based on BERTScoreβ206Updated last year
- Generate textbook-quality synthetic LLM pretraining dataβ508Updated 2 years ago
- β173Updated 2 years ago
- β¨ RepoBench: Benchmarking Repository-Level Code Auto-Completion Systems - ICLR 2024β182Updated last year