LucienShui / huggingface-vscode-endpoint-serverLinks

starcoder server for huggingface-vscdoe custom endpoint

☆175

Alternatives and similar repositories for huggingface-vscode-endpoint-server

Users that are interested in huggingface-vscode-endpoint-server are comparing it to the libraries listed below

Sorting:

mzbac / wizardCoder-vsc
Visual Studio Code extension for WizardCoder
☆148Updated 2 years ago
bigcode-project / starcoder.cpp
C++ implementation for 💫StarCoder
☆455Updated 2 years ago
nuance1979 / llama-server
LLaMA Server combines the power of LLaMA C++ with the beauty of Chatbot UI.
☆130Updated 2 years ago
wangcx18 / llm-vscode-inference-server
An endpoint server for efficiently serving quantized open-source LLMs for code.
☆57Updated 2 years ago
mzbac / AutoGPTQ-API
Host the GPTQ model using AutoGPTQ as an API that is compatible with text generation UI API.
☆88Updated 2 years ago
mayank31398 / GPTQ-for-SantaCoder
4 bits quantization of SantaCoder using GPTQ
☆50Updated 2 years ago
Lisoveliy / StarCoderEx
Extension for using alternative GitHub Copilot (StarCoder API) in VSCode
☆100Updated last year
loubnabnl / santacoder-finetuning
Fine-tune SantaCoder for Code/Text Generation.
☆193Updated 2 years ago
the-crypt-keeper / can-ai-code
Self-evaluating interview for AI coders
☆596Updated 4 months ago
paolorechia / learn-langchain
☆275Updated 2 years ago
aigoopy / llm-jeopardy
Automated prompting and scoring framework to evaluate LLMs using updated human knowledge prompts
☆108Updated 2 years ago
iaalm / llama-api-server
A OpenAI API compatible REST server for llama.
☆208Updated 8 months ago
ausboss / Local-LLM-Langchain
Load local LLMs effortlessly in a Jupyter notebook for testing purposes alongside Langchain or other agents. Contains Oobagooga and Kobol…
☆212Updated 2 years ago
cmp-nct / ggllm.cpp
Falcon LLM ggml framework with CPU and GPU support
☆247Updated last year
ChuloAI / BrainChulo
Harnessing the Memory Power of the Camelids
☆147Updated 2 years ago
bigcode-project / Megatron-LM
Ongoing research training transformer models at scale
☆392Updated last year
emrgnt-cmplxty / zero-shot-replication
☆73Updated 2 years ago
TheBlokeAI / AIScripts
Some simple scripts that I use day-to-day when working with LLMs and Huggingface Hub
☆160Updated 2 years ago
Nuggt-dev / Nuggt
An Autonomous LLM Agent that runs on Wizcoder-15B
☆333Updated last year
lastmile-ai / llama-retrieval-plugin
LLaMa retrieval plugin script using OpenAI's retrieval plugin
☆323Updated 2 years ago
togethercomputer / redpajama.cpp
Extend the original llama.cpp repo to support redpajama model.
☆118Updated last year
QuangBK / localLLM_guidance
Local LLM ReAct Agent with Guidance
☆158Updated 2 years ago
c0sogi / llama-api
An OpenAI-like LLaMA inference API
☆113Updated 2 years ago
lhenault / simpleAI
An easy way to host your own AI API and expose alternative models, while being compatible with "open" AI clients.
☆332Updated last year
petals-infra / chat.petals.dev
💬 Chatbot web app + HTTP and Websocket endpoints for LLM inference with the Petals client
☆315Updated last year
LudwigStumpp / llm-leaderboard
A joint community effort to create one central leaderboard for LLMs.
☆305Updated last year
OpenAccess-AI-Collective / ggml-webui
Deploy your GGML models to HuggingFace Spaces with Docker and gradio
☆37Updated 2 years ago
Venthe / vscode-fauxpilot
☆203Updated last year
1b5d / langchain-llm-api
☆39Updated 2 years ago
eugenepentland / landmark-attention-qlora
Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA
☆123Updated 2 years ago