LucienShui / huggingface-vscode-endpoint-server
starcoder server for huggingface-vscdoe custom endpoint
β171Updated last year
Alternatives and similar repositories for huggingface-vscode-endpoint-server:
Users that are interested in huggingface-vscode-endpoint-server are comparing it to the libraries listed below
- An endpoint server for efficiently serving quantized open-source LLMs for code.β54Updated last year
- Fine-tune SantaCoder for Code/Text Generation.β190Updated last year
- C++ implementation for π«StarCoderβ453Updated last year
- Visual Studio Code extension for WizardCoderβ147Updated last year
- 4 bits quantization of SantaCoder using GPTQβ51Updated last year
- Extension for using alternative GitHub Copilot (StarCoder API) in VSCodeβ100Updated 11 months ago
- A command-line interface to generate textual and conversational datasets with LLMs.β293Updated last year
- Falcon LLM ggml framework with CPU and GPU supportβ246Updated last year
- Some simple scripts that I use day-to-day when working with LLMs and Huggingface Hubβ158Updated last year
- Instruct-tuning LLaMA on consumer hardwareβ66Updated 2 years ago
- β84Updated last year
- LLaMA Server combines the power of LLaMA C++ with the beauty of Chatbot UI.β122Updated last year
- β140Updated last year
- Ongoing research training transformer models at scaleβ383Updated 7 months ago
- Automated prompting and scoring framework to evaluate LLMs using updated human knowledge promptsβ111Updated last year
- Host the GPTQ model using AutoGPTQ as an API that is compatible with text generation UI API.β91Updated last year
- β73Updated last year
- Local LLM ReAct Agent with Guidanceβ157Updated last year
- Merge Transformers language models by use of gradient parameters.β205Updated 7 months ago
- LLaMa retrieval plugin script using OpenAI's retrieval pluginβ324Updated last year
- Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRAβ123Updated last year
- CodeGen2 models for program synthesisβ274Updated last year
- LoRA weights for Cerebras-GPT-2.7b finetuned on Alpaca dataset with shorter promptβ63Updated last year
- A joint community effort to create one central leaderboard for LLMs.β293Updated 7 months ago
- OpenAI compatible API for LLMs and embeddings (LLaMA, Vicuna, ChatGLM and many others)β273Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMsβ130Updated 8 months ago
- Code Assistance/ Developer Productivity suite of Modelsβ125Updated last year
- QLoRA: Efficient Finetuning of Quantized LLMsβ77Updated 11 months ago
- β268Updated last year
- Tune MPTsβ84Updated last year