TabbyML / registry-tabbyLinks
☆39Updated 6 months ago
Alternatives and similar repositories for registry-tabby
Users that are interested in registry-tabby are comparing it to the libraries listed below
Sorting:
- ☆89Updated last month
- Cortex.Tensorrt-LLM is a C++ inference library that can be loaded by any server at runtime. It submodules NVIDIA’s TensorRT-LLM for GPU a…☆42Updated last year
- LLM powered development for IntelliJ☆84Updated last year
- VSCode AI coding assistant powered by self-hosted llama.cpp endpoint.☆183Updated 10 months ago
- A combination of Oobabooga's fork and the main cuda branch of GPTQ-for-LLaMa in a package format.☆22Updated 2 years ago
- Transplants vocabulary between language models, enabling the creation of draft models for speculative decoding WITHOUT retraining.☆47Updated last month
- cortex.llamacpp is a high-efficiency C++ inference engine for edge computing. It is a dynamic library that can be loaded by any server a…☆41Updated 5 months ago
- ☆20Updated last year
- Enhancing Translation with RAG-Powered Large Language Models☆87Updated 3 weeks ago
- ☆166Updated last year
- AirLLM 70B inference with single 4GB GPU☆14Updated 5 months ago
- Transformer GPU VRAM estimator☆67Updated last year
- ☆34Updated 8 months ago
- Forces DeepSeek R1 models to engage in extended reasoning by intercepting early termination tokens.☆19Updated 10 months ago
- A data visualisation of a 100 responses when asking local LLMs to imagine a random person.☆24Updated last year
- Code Assistance/ Developer Productivity suite of Models☆125Updated last year
- An endpoint server for efficiently serving quantized open-source LLMs for code.☆58Updated 2 years ago
- CI for ggml and related projects☆31Updated 2 months ago
- LlamaTor: Decentralized AI model sharing via BitTorrent for efficient, user-friendly distribution and collaboration.☆53Updated 11 months ago
- fast state-of-the-art speech models and a runtime that runs anywhere 💥☆57Updated 6 months ago
- Visual Studio Code extension for WizardCoder☆149Updated 2 years ago
- Loader extension for tabbyAPI in SillyTavern☆26Updated 5 months ago
- My personal fork of koboldcpp where I hack in experimental samplers.☆44Updated last year
- Serving LLMs in the HF-Transformers format via a PyFlask API☆72Updated last year
- LLM-based code completion engine