monk1337 / auto-ollama
run ollama & gguf easily with a single command
☆50Updated 11 months ago
Alternatives and similar repositories for auto-ollama:
Users that are interested in auto-ollama are comparing it to the libraries listed below
- ☆24Updated 3 months ago
- Complex RAG backend☆28Updated last year
- Local LLM inference & management server with built-in OpenAI API☆31Updated last year
- ☆66Updated 11 months ago
- Text generation in Python, as easy as possible☆60Updated this week
- Gradio based tool to run opensource LLM models directly from Huggingface☆91Updated 10 months ago
- After my server ui improvements were successfully merged, consider this repo a playground for experimenting, tinkering and hacking around…☆54Updated 8 months ago
- Accepts a Hugging Face model URL, automatically downloads and quantizes it using Bits and Bytes.☆38Updated last year
- Serving LLMs in the HF-Transformers format via a PyFlask API☆71Updated 7 months ago
- entropix style sampling + GUI☆26Updated 6 months ago
- Dagger functions to import Hugging Face GGUF models into a local ollama instance and optionally push them to ollama.com.☆115Updated 11 months ago
- A repository to store helpful information and emerging insights in regard to LLMs☆20Updated last year
- ☆130Updated last week
- Locally running LLM with internet access☆94Updated 3 weeks ago
- ☆112Updated 4 months ago
- Experimental sampler to make LLMs more creative☆31Updated last year
- All the world is a play, we are but actors in it.☆49Updated this week
- ☆17Updated 4 months ago
- A guidance compatibility layer for llama-cpp-python☆34Updated last year
- a lightweight, open-source blueprint for building powerful and scalable LLM chat applications☆28Updated 10 months ago
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆38Updated last year
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆66Updated 6 months ago
- Easily convert HuggingFace models to GGUF-format for llama.cpp☆21Updated 9 months ago
- Easily view and modify JSON datasets for large language models☆75Updated 2 months ago
- automatically quant GGUF models☆170Updated last week
- Function Calling Benchmark & Testing☆87Updated 9 months ago
- The hearth of The Pulsar App, fast, secure and shared inference with modern UI☆56Updated 5 months ago
- Embed anything.☆29Updated 11 months ago
- idea: https://github.com/nyxkrage/ebook-groupchat/☆86Updated 8 months ago
- ☆53Updated 11 months ago