bigcode-project / starcoder2
Home of StarCoder2!
☆1,775Updated 7 months ago
Related projects ⓘ
Alternatives and complementary repositories for starcoder2
- [ICML'24] Magicoder: Empowering Code Generation with OSS-Instruct☆1,972Updated last week
- official repository of aiXcoder-7B Code Large Language Model☆2,220Updated 2 months ago
- OpenCodeInterpreter is a suite of open-source code generation systems aimed at bridging the gap between large language models and sophist…☆1,591Updated 6 months ago
- AIOS: LLM Agent Operating System☆3,390Updated this week
- Llama-3 agents that can browse the web by following instructions and talking to you☆1,349Updated 3 months ago
- DeepSeek Coder: Let the Code Write Itself☆6,801Updated 5 months ago
- DeepSeek LLM: Let there be answers☆1,438Updated 9 months ago
- Training LLMs with QLoRA + FSDP☆1,418Updated this week
- Set of tools to assess and improve LLM security.☆2,697Updated last week
- This is a Phi-3 book for getting started with Phi-3. Phi-3, a family of open sourced AI models developed by Microsoft. Phi-3 models are t…☆2,457Updated this week
- A project structure aware autonomous software engineer aiming for autonomous program improvement. Resolved 30.67% tasks (pass@1) in SWE-b…☆2,711Updated last week
- [NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments☆1,326Updated this week
- Qwen2.5-Coder is the code version of Qwen2.5, the large language model series developed by Qwen team, Alibaba Cloud.☆801Updated this week
- An self-improving embodied conversational agent seamlessly integrated into the operating system to automate our daily tasks.☆1,505Updated 2 months ago
- Agent framework and applications built upon Qwen>=2.0, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.☆3,417Updated last month
- A framework for prompt tuning using Intent-based Prompt Calibration☆2,171Updated this week
- We introduced a new model designed for the Code generation task. Its test accuracy on the HumanEval base dataset surpasses that of GPT-4 …☆810Updated 4 months ago
- DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model☆3,574Updated last month
- Tools for merging pretrained large language models.☆4,788Updated this week
- ☆1,878Updated last week
- PyTorch native finetuning library☆4,267Updated this week
- [ICLR 2024] SWE-bench: Can Language Models Resolve Real-world Github Issues?☆1,943Updated last week
- Run Mixtral-8x7B models in Colab or consumer desktops☆2,293Updated 7 months ago
- [ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling☆1,504Updated 4 months ago
- [EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which ach…☆4,608Updated 2 months ago
- TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones☆1,246Updated 6 months ago
- Mora: More like Sora for Generalist Video Generation☆1,513Updated last month
- Open weights LLM from Google DeepMind.☆2,459Updated last week
- Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"☆3,206Updated 6 months ago
- ToRA is a series of Tool-integrated Reasoning LLM Agents designed to solve challenging mathematical reasoning problems by interacting wit…☆971Updated 8 months ago