TabbyML / registry-tabbyLinks
☆33Updated 2 weeks ago
Alternatives and similar repositories for registry-tabby
Users that are interested in registry-tabby are comparing it to the libraries listed below
Sorting:
- LLM powered development for IntelliJ☆81Updated last year
- Self-hosted LLM chatbot arena, with yourself as the only judge☆41Updated last year
- starcoder server for huggingface-vscdoe custom endpoint☆172Updated last year
- The hearth of The Pulsar App, fast, secure and shared inference with modern UI☆56Updated 6 months ago
- LlamaTor: Decentralized AI model sharing via BitTorrent for efficient, user-friendly distribution and collaboration.☆48Updated 5 months ago
- Simple, Fast, Parallel Huggingface GGML model downloader written in python☆24Updated last year
- ☆21Updated 4 months ago
- Web tool to count LLM tokens (GPT, Claude, Llama, ...)☆33Updated last week
- GGML implementation of BERT model with Python bindings and quantization.☆55Updated last year
- GPT-4 Level Conversational QA Trained In a Few Hours☆61Updated 9 months ago
- Visual Studio Code extension for WizardCoder☆148Updated last year
- Thin wrapper around GGML to make life easier☆34Updated this week
- asynchronous/distributed speculative evaluation for llama3☆39Updated 9 months ago
- ☆90Updated 2 weeks ago
- Tool to download models from Huggingface Hub and convert them to GGML/GGUF for llama.cpp☆148Updated last month
- llama.cpp to PyTorch Converter☆33Updated last year
- ☆53Updated last year
- Ask shortgpt for instant and concise answers☆13Updated 2 years ago
- Extension for using alternative GitHub Copilot (StarCoder API) in VSCode☆100Updated last year
- Inference Llama/Llama2/Llama3 Modes in NumPy☆21Updated last year
- Testing LLM reasoning abilities with family relationship quizzes.☆61Updated 4 months ago
- ☆18Updated 3 months ago
- See how HTTPX, Requests, and AIOHTTP libraries compare for sending network requests and find out which one may fit your case better.☆18Updated last month
- AirLLM 70B inference with single 4GB GPU☆13Updated 9 months ago
- A minimalistic C++ Jinja templating engine for LLM chat templates☆153Updated 3 weeks ago
- Easily convert HuggingFace models to GGUF-format for llama.cpp☆21Updated 10 months ago
- cortex.llamacpp is a high-efficiency C++ inference engine for edge computing. It is a dynamic library that can be loaded by any server a…☆40Updated this week
- 🌟 Yi-Coder is a series of open-source code language models that delivers state-of-the-art coding performance with fewer than 10 billion …☆404Updated 8 months ago
- ggml implementation of embedding models including SentenceTransformer and BGE☆58Updated last year
- Scrape details about Code Interpreter to track any changes☆67Updated last month