bigcode-project / starcoder.cppLinks

C++ implementation for 💫StarCoder

☆459

Alternatives and similar repositories for starcoder.cpp

Users that are interested in starcoder.cpp are comparing it to the libraries listed below

Sorting:

cmp-nct / ggllm.cpp
Falcon LLM ggml framework with CPU and GPU support
☆249Updated 2 years ago
lastmile-ai / llama-retrieval-plugin
LLaMa retrieval plugin script using OpenAI's retrieval plugin
☆323Updated 2 years ago
skeskinen / bert.cpp
ggml implementation of BERT
☆498Updated last year
the-crypt-keeper / can-ai-code
Self-evaluating interview for AI coders
☆600Updated 7 months ago
ggml-org / p1
LLM-based code completion engine
☆190Updated last year
NouamaneTazi / bloomz.cpp
C++ implementation for BLOOM
☆809Updated 2 years ago
togethercomputer / redpajama.cpp
Extend the original llama.cpp repo to support redpajama model.
☆118Updated last year
LucienShui / huggingface-vscode-endpoint-server
starcoder server for huggingface-vscdoe custom endpoint
☆179Updated 2 years ago
mzbac / wizardCoder-vsc
Visual Studio Code extension for WizardCoder
☆148Updated 2 years ago
rmihaylov / falcontune
Tune any FALCON in 4-bit
☆463Updated 2 years ago
Nuggt-dev / Nuggt
An Autonomous LLM Agent that runs on Wizcoder-15B
☆333Updated last year
jondurbin / airoboros
Customizable implementation of the self-instruct paper.
☆1,049Updated last year
iaalm / llama-api-server
A OpenAI API compatible REST server for llama.
☆208Updated 11 months ago
bigcode-project / Megatron-LM
Ongoing research training transformer models at scale
☆395Updated last year
catid / supercharger
Supercharge Open-Source AI Models
☆349Updated 2 years ago
johnsmith0031 / alpaca_lora_4bit
☆535Updated 2 years ago
NolanoOrg / cformers
SoTA Transformers with C-backend for fast inference on your CPU.
☆311Updated 2 years ago
PotatoSpudowski / fastLLaMa
fastLLaMa: An experimental high-performance framework for running Decoder-only LLMs with 4-bit quantization in Python using a C/C++ backe…
☆412Updated 2 years ago
keldenl / gpt-llama.cpp
A llama.cpp drop-in replacement for OpenAI's GPT endpoints, allowing GPT-powered apps to run off local llama.cpp models instead of OpenAI…
☆597Updated 2 years ago
xNul / code-llama-for-vscode
Use Code Llama with Visual Studio Code and the Continue extension. A local LLM alternative to GitHub Copilot.
☆568Updated last year
eugenepentland / landmark-attention-qlora
Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA
☆124Updated 2 years ago
aigoopy / llm-jeopardy
Automated prompting and scoring framework to evaluate LLMs using updated human knowledge prompts
☆109Updated 2 years ago
pointnetwork / point-alpaca
☆404Updated 2 years ago
kuleshov-group / llmtools
Finetuning Large Language Models on One Consumer GPU in 2 Bits
☆734Updated last year
abacaj / mpt-30B-inference
Run inference on MPT-30B using CPU
☆576Updated 2 years ago
petals-infra / chat.petals.dev
💬 Chatbot web app + HTTP and Websocket endpoints for LLM inference with the Petals client
☆316Updated last year
mbzuai-nlp / LaMini-LM
LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions
☆823Updated 2 years ago
abacaj / replit-3B-inference
Run inference on replit-3B code instruct model using CPU
☆160Updated 2 years ago
thomasantony / llamacpp-python
Python bindings for llama.cpp
☆198Updated 2 years ago
ausboss / Local-LLM-Langchain
Load local LLMs effortlessly in a Jupyter notebook for testing purposes alongside Langchain or other agents. Contains Oobagooga and Kobol…
☆213Updated 2 years ago