TabbyML / registry-tabby
☆27Updated this week
Related projects ⓘ
Alternatives and complementary repositories for registry-tabby
- LLM powered development for IntelliJ☆69Updated 7 months ago
- multispy is a lsp client library in Python intended to be used to build applications around language servers.☆59Updated last month
- ggml implementation of BERT Embedding☆24Updated last year
- UnitEval is a benchmarking and evaluation tools for AutoDev Coder.☆10Updated 10 months ago
- HTTP proxy for on-demand model loading with llama.cpp (or other OpenAI compatible backends)☆41Updated this week
- ☆26Updated this week
- Rust executable for Refact Agent, it lives inside your IDE and keeps AST and VecDB indexes up to date, offers agentic tools for an AI mod…☆47Updated this week
- Self-hosted LLM chatbot arena, with yourself as the only judge☆36Updated 9 months ago
- llama.cpp to PyTorch Converter☆26Updated 7 months ago
- CI for ggml and related projects☆20Updated this week
- Tensor library for machine learning☆20Updated last year
- An endpoint server for efficiently serving quantized open-source LLMs for code.☆54Updated last year
- Port of Facebook's LLaMA model in C/C++☆20Updated last year
- Inference Llama/Llama2 Modes in NumPy☆20Updated last year
- LLM inference in C/C++☆11Updated 3 months ago
- Refact AI: Open-source AI Code assistant with autocompletion, chat, refactoring and more for VS Code☆77Updated this week
- AirLLM 70B inference with single 4GB GPU☆12Updated 3 months ago
- cortex.llamacpp is a high-efficiency C++ inference engine for edge computing. It is a dynamic library that can be loaded by any server a…☆23Updated this week
- Extension for using alternative GitHub Copilot (StarCoder API) in VSCode☆99Updated 7 months ago
- General purpose GPU compute framework built on Vulkan to support 1000s of cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). …☆41Updated last month
- LLMtranslator translates and generates text in multiple languages.☆41Updated 6 months ago
- Ask shortgpt for instant and concise answers☆13Updated last year
- Shire Lang Spring/Java Demo project☆13Updated this week
- GPT-4 Level Conversational QA Trained In a Few Hours☆55Updated 3 months ago
- ☆75Updated this week
- Inference of Large Multimodal Models in C/C++. LLaVA and others☆46Updated last year
- ☆53Updated 5 months ago
- An experimental Rust library for general code file relationship analysis. Based on tree-sitter and git analysis.☆37Updated this week
- BlinkDL's RWKV-v4 running in the browser☆46Updated last year
- Simple, Fast, Parallel Huggingface GGML model downloader written in python☆24Updated last year