the-crypt-keeper / ggml-downloaderLinks
Simple, Fast, Parallel Huggingface GGML model downloader written in python
☆24Updated last year
Alternatives and similar repositories for ggml-downloader
Users that are interested in ggml-downloader are comparing it to the libraries listed below
Sorting:
- An OpenAI API compatible LLM inference server based on ExLlamaV2.☆25Updated last year
- Experimental sampler to make LLMs more creative☆31Updated last year
- Simple LLM inference server☆20Updated last year
- Local LLM inference & management server with built-in OpenAI API☆31Updated last year
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆38Updated last year
- run ollama & gguf easily with a single command☆52Updated last year
- The hearth of The Pulsar App, fast, secure and shared inference with modern UI☆57Updated 7 months ago
- Yet Another (LLM) Web UI, made with Gemini☆12Updated 6 months ago
- Generate visual podcasts about novels using open source models☆25Updated 2 years ago
- A sleek, customizable interface for managing LLMs with responsive design and easy agent personalization.☆16Updated 10 months ago
- A QT GUI for large language models☆37Updated last year
- GPT-2 small trained on phi-like data☆66Updated last year
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆45Updated last year
- After my server ui improvements were successfully merged, consider this repo a playground for experimenting, tinkering and hacking around…☆54Updated 10 months ago
- 5X faster 60% less memory QLoRA finetuning☆21Updated last year
- ☆49Updated this week
- Python package wrapping llama.cpp for on-device LLM inference☆75Updated this week
- ☆28Updated 10 months ago
- a lightweight, open-source blueprint for building powerful and scalable LLM chat applications☆28Updated last year
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆27Updated 8 months ago
- A simple speech-to-text and text-to-speech AI chatbot that can be run fully offline.☆45Updated last year
- DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.☆36Updated last year
- Cog wrapper for collabora/WhisperSpeech☆25Updated last year
- Embedding models from Jina AI☆61Updated last year
- A combination of Oobabooga's fork and the main cuda branch of GPTQ-for-LLaMa in a package format.☆22Updated last year
- A Javascript library (with Typescript types) to parse metadata of GGML based GGUF files.☆47Updated 11 months ago
- A Qt GUI for large language models☆43Updated last year
- ☆22Updated 3 months ago
- ☆23Updated 5 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆66Updated 8 months ago