the-crypt-keeper / ggml-downloaderLinks
Simple, Fast, Parallel Huggingface GGML model downloader written in python
☆24Updated last year
Alternatives and similar repositories for ggml-downloader
Users that are interested in ggml-downloader are comparing it to the libraries listed below
Sorting:
- Local LLM inference & management server with built-in OpenAI API☆31Updated last year
- ☆31Updated last year
- Yet Another (LLM) Web UI, made with Gemini☆12Updated 5 months ago
- Experimental sampler to make LLMs more creative☆31Updated last year
- An OpenAI API compatible LLM inference server based on ExLlamaV2.☆25Updated last year
- YouTube Transcript Cleaner is a simple web-based application that improves the readability of YouTube transcripts.☆26Updated 3 months ago
- ☆21Updated 3 months ago
- A QT GUI for large language models☆35Updated last year
- ☆16Updated last year
- A guidance compatibility layer for llama-cpp-python☆35Updated last year
- Python examples using the bigcode/tiny_starcoder_py 159M model to generate code☆44Updated 2 years ago
- A repository to store helpful information and emerging insights in regard to LLMs☆20Updated last year
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆38Updated last year
- ☆24Updated 5 months ago
- A combination of Oobabooga's fork and the main cuda branch of GPTQ-for-LLaMa in a package format.☆22Updated last year
- run ollama & gguf easily with a single command☆51Updated last year
- ☆55Updated 2 years ago
- Run embedding models using ONNX☆34Updated last year
- Complex RAG backend☆28Updated last year
- 5X faster 60% less memory QLoRA finetuning☆21Updated last year
- Plug n Play GBNF Compiler for llama.cpp☆25Updated last year
- The hearth of The Pulsar App, fast, secure and shared inference with modern UI☆56Updated 6 months ago
- Embedding models from Jina AI☆60Updated last year
- an auto-sleeping and -waking framework around llama.cpp☆12Updated 4 months ago
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆26Updated 7 months ago
- ☆16Updated 2 years ago
- Generate visual podcasts about novels using open source models☆25Updated 2 years ago
- A simple speech-to-text and text-to-speech AI chatbot that can be run fully offline.☆45Updated last year
- jQuery, React and Streamlit applications written by LLMs☆16Updated last year
- convert a saved pytorch model to gguf and generate as much corresponding ggml c code as possible☆14Updated last year