Home of StarCoder2!
☆2,047Mar 21, 2024Updated last year
Alternatives and similar repositories for starcoder2
Users that are interested in starcoder2 are comparing it to the libraries listed below
Sorting:
- Home of StarCoder: fine-tuning & inference!☆7,529Feb 27, 2024Updated 2 years ago
- Code for the curation of The Stack v2 and StarCoder2 training data☆130Apr 11, 2024Updated last year
- official repository of aiXcoder-7B Code Large Language Model☆2,273Jul 9, 2025Updated 8 months ago
- Inference code for CodeLlama models☆16,344Aug 12, 2024Updated last year
- [ICML'24] Magicoder: Empowering Code Generation with OSS-Instruct☆2,088Nov 1, 2024Updated last year
- [NeurIPS'24] SelfCodeAlign: Self-Alignment for Code Generation☆322Feb 24, 2025Updated last year
- LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath☆9,479Jun 7, 2025Updated 9 months ago
- OpenCodeInterpreter is a suite of open-source code generation systems aimed at bridging the gap between large language models and sophist…☆1,704May 7, 2024Updated last year
- Modeling, training, eval, and inference code for OLMo☆6,404Nov 24, 2025Updated 3 months ago
- Official implementation for the paper: "Code Generation with AlphaCodium: From Prompt Engineering to Flow Engineering""☆3,925Nov 25, 2024Updated last year
- DeepSeek Coder: Let the Code Write Itself☆22,938Nov 11, 2025Updated 4 months ago
- ⏩ Source-controlled AI checks, enforceable in CI. Powered by the open-source Continue CLI☆31,921Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆73,479Updated this week
- 🙌 OpenHands: AI-Driven Development☆69,254Updated this week
- Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek, Qwen, Llama, Gemma, TTS 2x faster with 70% less VRAM.☆54,096Updated this week
- Large Language Model Text Generation Inference☆10,803Jan 8, 2026Updated 2 months ago
- DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence☆6,531Nov 11, 2025Updated 4 months ago
- Large World Model -- Modeling Text and Video with Millions Context☆7,402Oct 19, 2024Updated last year
- A framework for the evaluation of autoregressive code generation language models.☆1,021Jul 22, 2025Updated 7 months ago
- Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We als…☆18,261Mar 3, 2026Updated 2 weeks ago
- Official inference library for Mistral models☆10,709Feb 26, 2026Updated 3 weeks ago
- Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing a…☆39,597Updated this week
- Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.☆165,557Updated this week
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆13,228Mar 6, 2026Updated 2 weeks ago
- Tools for merging pretrained large language models.☆6,867Updated this week
- A programming framework for agentic AI☆55,559Mar 11, 2026Updated last week
- DeepSeek-VL: Towards Real-World Vision-Language Understanding☆4,080Apr 24, 2024Updated last year
- A series of large language models trained from scratch by developers @01-ai☆7,842Nov 27, 2024Updated last year
- Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work t…☆46,408Updated this week
- aider is AI pair programming in your terminal☆41,939Mar 9, 2026Updated last week
- DSPy: The framework for programming—not prompting—language models☆32,853Updated this week
- This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.☆12,141Mar 8, 2026Updated last week
- The official Meta Llama 3 GitHub site☆29,279Jan 26, 2025Updated last year
- We introduced a new model designed for the Code generation task. Its test accuracy on the HumanEval base dataset surpasses that of GPT-4 …☆853Jul 6, 2024Updated last year
- tiny vision language model☆9,427Nov 14, 2025Updated 4 months ago
- Reaching LLaMA2 Performance with 0.1M Dollars☆989Jul 23, 2024Updated last year
- Qwen3-Coder is the code version of Qwen3, the large language model series developed by Qwen team.☆16,008Feb 3, 2026Updated last month
- Rigourous evaluation of LLM-synthesized code - NeurIPS 2023 & COLM 2024☆1,698Oct 2, 2025Updated 5 months ago
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆8,911May 3, 2024Updated last year