LLM powered development for VSCode
☆1,315May 26, 2026Updated 2 weeks ago
Alternatives and similar repositories for llm-vscode
Users that are interested in llm-vscode are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- starcoder server for huggingface-vscdoe custom endpoint☆179Nov 18, 2023Updated 2 years ago
- Home of StarCoder: fine-tuning & inference!☆7,506Feb 27, 2024Updated 2 years ago
- Extension for using alternative GitHub Copilot (StarCoder API) in VSCode☆98Apr 2, 2024Updated 2 years ago
- Large Language Model Text Generation Inference☆10,859Mar 21, 2026Updated 2 months ago
- LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath☆9,483Jun 7, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆139Nov 5, 2023Updated 2 years ago
- The open source codebase powering HuggingChat☆10,754Jun 5, 2026Updated last week
- ⏩ Source-controlled AI checks, enforceable in CI. Powered by the open-source Continue CLI☆33,646Updated this week
- C++ implementation for 💫StarCoder☆458Sep 9, 2023Updated 2 years ago
- Visual Studio Code extension for WizardCoder☆148Aug 1, 2023Updated 2 years ago
- Inference code for CodeLlama models☆16,311Aug 12, 2024Updated last year
- Simple, safe way to store and distribute tensors☆3,764Jun 4, 2026Updated last week
- LlamaIndex is the leading document agent and OCR platform☆50,073Updated this week
- Type less, code more: Cody is an AI code assistant that uses advanced search and codebase context to help you write and fix code.☆3,800Aug 1, 2025Updated 10 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- 🚀 Accelerate inference and training of 🤗 Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimization…☆3,409Jun 3, 2026Updated last week
- Accessible large language models via k-bit quantization for PyTorch.☆8,263Updated this week
- Go ahead and axolotl questions☆12,032Updated this week
- Open-source desktop app for local LLMs. Text, vision, tool-calling, OpenAI/Anthropic-compatible API. 100% private.☆47,283Jun 2, 2026Updated last week
- A blazing fast inference solution for text embeddings models☆4,861May 26, 2026Updated 2 weeks ago
- 🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading☆10,179Sep 7, 2024Updated last year
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.☆39,470May 1, 2026Updated last month
- A high-throughput and memory-efficient inference and serving engine for LLMs☆82,482Updated this week
- Python bindings for llama.cpp☆10,388Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Python bindings for the Transformer models implemented in C/C++ using GGML library.☆1,886Jan 28, 2024Updated 2 years ago
- An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.☆5,068Apr 11, 2025Updated last year
- H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://docs.h2o.ai/h2o-llmstudio/☆4,978Jun 6, 2026Updated last week
- Minimalist ML framework for Rust☆20,426Jun 7, 2026Updated last week
- Universal LLM Deployment Engine with ML Compilation☆22,792May 11, 2026Updated last month
- A guidance language for controlling large language models.☆21,488May 21, 2026Updated 3 weeks ago
- Tools for merging pretrained large language models.☆7,126May 6, 2026Updated last month
- Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)☆12,895Apr 13, 2026Updated 2 months ago
- 🤗 AutoTrain Advanced☆4,579May 26, 2026Updated 2 weeks ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- CodeGen is a family of open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.☆5,175Jun 2, 2026Updated last week
- Tensor library for machine learning☆14,804Updated this week
- StableLM: Stability AI Language Models☆15,700Apr 8, 2024Updated 2 years ago
- Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing a…☆50,129Updated this week
- Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Ad…☆6,079Jul 1, 2025Updated 11 months ago
- Self-hosted AI coding assistant☆33,583Mar 2, 2026Updated 3 months ago
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆13,414Updated this week