bigcode-project / starcoder2
Home of StarCoder2!
☆1,899Updated last year
Alternatives and similar repositories for starcoder2:
Users that are interested in starcoder2 are comparing it to the libraries listed below
- [ICML'24] Magicoder: Empowering Code Generation with OSS-Instruct☆2,017Updated 5 months ago
- official repository of aiXcoder-7B Code Large Language Model☆2,258Updated 3 months ago
- OpenCodeInterpreter is a suite of open-source code generation systems aimed at bridging the gap between large language models and sophist…☆1,643Updated 11 months ago
- Reaching LLaMA2 Performance with 0.1M Dollars☆981Updated 9 months ago
- SWE-bench [Multimodal]: Can Language Models Resolve Real-world Github Issues?☆2,862Updated this week
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆3,033Updated last week
- Official Pytorch repository for Extreme Compression of Large Language Models via Additive Quantization https://arxiv.org/pdf/2401.06118.p…☆1,249Updated last week
- Agent framework and applications built upon Qwen>=2.0, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.☆6,613Updated last week
- We introduced a new model designed for the Code generation task. Its test accuracy on the HumanEval base dataset surpasses that of GPT-4 …☆843Updated 9 months ago
- Granite Code Models: A Family of Open Foundation Models for Code Intelligence☆1,208Updated 5 months ago
- [EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which ach…☆5,042Updated last month
- ☆1,417Updated last month
- Official implementation for the paper: "Code Generation with AlphaCodium: From Prompt Engineering to Flow Engineering""☆3,817Updated 5 months ago
- Modeling, training, eval, and inference code for OLMo☆5,519Updated this week
- Training LLMs with QLoRA + FSDP☆1,472Updated 5 months ago
- This is a Phi Family of SLMs book for getting started with Phi Models. Phi a family of open sourced AI models developed by Microsoft. Phi…☆3,187Updated 2 weeks ago
- ☆2,915Updated 7 months ago
- Llama-3 agents that can browse the web by following instructions and talking to you☆1,398Updated 4 months ago
- Rigourous evaluation of LLM-synthesized code - NeurIPS 2023 & COLM 2024☆1,445Updated 3 weeks ago
- lightweight, standalone C++ inference engine for Google's Gemma models.☆6,362Updated this week
- ☆2,729Updated this week
- The official PyTorch implementation of Google's Gemma models☆5,422Updated last month
- ☆1,469Updated last year
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆8,432Updated 11 months ago
- Code for the paper "Evaluating Large Language Models Trained on Code"☆2,704Updated 3 months ago
- An self-improving embodied conversational agent seamlessly integrated into the operating system to automate our daily tasks.☆1,646Updated 7 months ago
- A lightweight framework for building LLM-based agents☆2,104Updated last month
- A project structure aware autonomous software engineer aiming for autonomous program improvement. Resolved 37.3% tasks (pass@1) in SWE-be…☆2,921Updated last month
- Run Mixtral-8x7B models in Colab or consumer desktops☆2,308Updated last year
- Gemma open-weight LLM library, from Google DeepMind☆3,201Updated last week