FSoft-AI4Code / CodeCapybara
Open-source Self-Instruction Tuning Code LLM
☆170Updated last year
Alternatives and similar repositories for CodeCapybara:
Users that are interested in CodeCapybara are comparing it to the libraries listed below
- [EMNLP 2023] The Vault: A Comprehensive Multilingual Dataset for Advancing Code Understanding and Generation☆92Updated 8 months ago
- ☆84Updated last year
- ☆73Updated last year
- Pre-training code for CrystalCoder 7B LLM☆54Updated 11 months ago
- evol augment any dataset online☆59Updated last year
- ☆269Updated last year
- ☆94Updated last year
- ☆75Updated last month
- [NeurIPS 2023 D&B] Code repository for InterCode benchmark https://arxiv.org/abs/2306.14898☆216Updated 11 months ago
- A hard gym for programming☆152Updated 9 months ago
- ☆172Updated last year
- Flacuna was developed by fine-tuning Vicuna on Flan-mini, a comprehensive instruction collection encompassing various tasks. Vicuna is al…☆111Updated last year
- Simple next-token-prediction for RLHF☆225Updated last year
- A set of utilities for running few-shot prompting experiments on large-language models☆118Updated last year
- [FORGE 2025] Graph-based method for end-to-end code completion with context awareness on repository☆62Updated 7 months ago
- Spherical Merge Pytorch/HF format Language Models with minimal feature loss.☆120Updated last year
- [NeurIPS'24] SelfCodeAlign: Self-Alignment for Code Generation☆304Updated 2 months ago
- 🐙 OctoPack: Instruction Tuning Code Large Language Models☆462Updated 2 months ago
- [ACL 2024] Novel reranking method to select the best solutions for code generation☆16Updated 10 months ago
- This project is an attempt to create a common metric to test LLM's for progress in eliminating hallucinations which is the most serious c…☆222Updated 2 years ago
- Convenient wrapper for fine-tuning and inference of Large Language Models (LLMs) with several quantization techniques (GTPQ, bitsandbytes…☆147Updated last year
- Code for paper "LEVER: Learning to Verifiy Language-to-Code Generation with Execution" (ICML'23)☆86Updated last year
- Data and code for "DocPrompting: Generating Code by Retrieving the Docs" @ICLR 2023☆243Updated last year
- CRUXEval: Code Reasoning, Understanding, and Execution Evaluation☆136Updated 6 months ago
- Open Source WizardCoder Dataset☆157Updated last year
- Small and Efficient Mathematical Reasoning LLMs☆71Updated last year
- ☆178Updated 2 years ago
- Fine-tune SantaCoder for Code/Text Generation.☆191Updated 2 years ago
- Implementation of Toolformer: Language Models Can Teach Themselves to Use Tools☆138Updated 2 years ago
- Astraios: Parameter-Efficient Instruction Tuning Code Language Models☆57Updated last year