Home of StarCoder2!
☆2,041Mar 21, 2024Updated last year
Alternatives and similar repositories for starcoder2
Users that are interested in starcoder2 are comparing it to the libraries listed below
Sorting:
- Home of StarCoder: fine-tuning & inference!☆7,530Feb 27, 2024Updated 2 years ago
- official repository of aiXcoder-7B Code Large Language Model☆2,270Jul 9, 2025Updated 7 months ago
- Code for the curation of The Stack v2 and StarCoder2 training data☆126Apr 11, 2024Updated last year
- Inference code for CodeLlama models☆16,346Aug 12, 2024Updated last year
- LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath☆9,476Jun 7, 2025Updated 8 months ago
- OpenCodeInterpreter is a suite of open-source code generation systems aimed at bridging the gap between large language models and sophist…☆1,700May 7, 2024Updated last year
- Modeling, training, eval, and inference code for OLMo☆6,326Nov 24, 2025Updated 3 months ago
- [ICML'24] Magicoder: Empowering Code Generation with OSS-Instruct☆2,076Nov 1, 2024Updated last year
- Official implementation for the paper: "Code Generation with AlphaCodium: From Prompt Engineering to Flow Engineering""☆3,921Nov 25, 2024Updated last year
- DeepSeek Coder: Let the Code Write Itself☆22,833Nov 11, 2025Updated 3 months ago
- ⏩ Source-controlled AI checks, enforceable in CI. Powered by the open-source Continue CLI☆31,532Updated this week
- [NeurIPS'24] SelfCodeAlign: Self-Alignment for Code Generation☆323Feb 24, 2025Updated last year
- DeepSeek-VL: Towards Real-World Vision-Language Understanding☆4,072Apr 24, 2024Updated last year
- Large World Model -- Modeling Text and Video with Millions Context☆7,399Oct 19, 2024Updated last year
- Official inference library for Mistral models☆10,683Nov 21, 2025Updated 3 months ago
- DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence☆6,466Nov 11, 2025Updated 3 months ago
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆13,182Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆71,234Updated this week
- Large Language Model Text Generation Inference☆10,774Jan 8, 2026Updated last month
- Reaching LLaMA2 Performance with 0.1M Dollars☆988Jul 23, 2024Updated last year
- Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We als…☆18,220Nov 3, 2025Updated 3 months ago
- A series of large language models trained from scratch by developers @01-ai☆7,843Nov 27, 2024Updated last year
- Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek, Qwen, Llama, Gemma, TTS 2x faster with 70% less VRAM.☆52,724Updated this week
- Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing a…☆37,083Updated this week
- 🙌 OpenHands: AI-Driven Development☆68,154Updated this week
- Tools for merging pretrained large language models.☆6,814Jan 26, 2026Updated last month
- Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work t…☆44,662Updated this week
- DSPy: The framework for programming—not prompting—language models☆32,381Updated this week
- tiny vision language model☆9,364Nov 14, 2025Updated 3 months ago
- Qwen3-Coder is the code version of Qwen3, the large language model series developed by Qwen team.☆15,723Feb 3, 2026Updated 3 weeks ago
- A programming framework for agentic AI☆54,683Jan 22, 2026Updated last month
- We introduced a new model designed for the Code generation task. Its test accuracy on the HumanEval base dataset surpasses that of GPT-4 …☆851Jul 6, 2024Updated last year
- aider is AI pair programming in your terminal☆40,851Feb 19, 2026Updated last week
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆8,891May 3, 2024Updated last year
- Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.☆13,349Feb 16, 2026Updated last week
- SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersec…☆18,517Feb 17, 2026Updated last week
- DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model☆4,995Sep 25, 2024Updated last year
- The official Meta Llama 3 GitHub site☆29,265Jan 26, 2025Updated last year
- Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.☆163,045Feb 21, 2026Updated last week