bigcode-project / starcoder.cpp
C++ implementation for 💫StarCoder
☆443Updated last year
Related projects: ⓘ
- LLaMa retrieval plugin script using OpenAI's retrieval plugin☆326Updated last year
- ggml implementation of BERT☆460Updated 6 months ago
- Falcon LLM ggml framework with CPU and GPU support☆245Updated 7 months ago
- Self-evaluating interview for AI coders☆515Updated last week
- Extend the original llama.cpp repo to support redpajama model.☆117Updated 2 weeks ago
- An Autonomous LLM Agent that runs on Wizcoder-15B☆338Updated 11 months ago
- starcoder server for huggingface-vscdoe custom endpoint☆166Updated 10 months ago
- LLM-based code completion engine☆172Updated last year
- Supercharge Open-Source AI Models☆348Updated last year
- C++ implementation for BLOOM☆813Updated last year
- ☆533Updated 9 months ago
- Customizable implementation of the self-instruct paper.☆1,004Updated 6 months ago
- SoTA Transformers with C-backend for fast inference on your CPU.☆311Updated 9 months ago
- Tune any FALCON in 4-bit☆469Updated last year
- LLaMA Server combines the power of LLaMA C++ with the beauty of Chatbot UI.☆109Updated last year
- Run Alpaca LLM in LangChain☆218Updated 9 months ago
- ☆406Updated last year
- LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions☆810Updated last year
- ☆453Updated 11 months ago
- Visual Studio Code extension for WizardCoder☆143Updated last year
- Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA☆123Updated last year
- ☆1,411Updated last year
- Finetuning Large Language Models on One Consumer GPU in Under 4 Bits☆696Updated 3 months ago
- Run inference on replit-3B code instruct model using CPU☆155Updated last year
- A llama.cpp drop-in replacement for OpenAI's GPT endpoints, allowing GPT-powered apps to run off local llama.cpp models instead of OpenAI…☆594Updated last year
- fastLLaMa: An experimental high-performance framework for running Decoder-only LLMs with 4-bit quantization in Python using a C/C++ backe…☆408Updated last year
- Web UI for ExLlamaV2☆420Updated 2 weeks ago
- ☆275Updated last year
- Convenient wrapper for fine-tuning and inference of Large Language Models (LLMs) with several quantization techniques (GTPQ, bitsandbytes…☆139Updated 11 months ago