monk1337 / auto-ollama
run ollama & gguf easily with a single command
☆49Updated 9 months ago
Alternatives and similar repositories for auto-ollama:
Users that are interested in auto-ollama are comparing it to the libraries listed below
- ☆24Updated last month
- ☆124Updated this week
- Easily view and modify JSON datasets for large language models☆71Updated last week
- ☆16Updated 2 months ago
- entropix style sampling + GUI☆25Updated 4 months ago
- Serving LLMs in the HF-Transformers format via a PyFlask API☆70Updated 6 months ago
- Gradio based tool to run opensource LLM models directly from Huggingface☆91Updated 8 months ago
- Terminal Voice Assistant is a powerful and flexible tool designed to help users interact with their terminal using natural language comma…☆17Updated 9 months ago
- A python package for serving LLM on OpenAI-compatible API endpoints with prompt caching using MLX.☆73Updated 3 months ago
- Let's create synthetic textbooks together :)☆73Updated last year
- CaSIL is an advanced natural language processing system that implements a sophisticated four-layer semantic analysis architecture. It pro…☆64Updated 4 months ago
- a lightweight, open-source blueprint for building powerful and scalable LLM chat applications☆30Updated 9 months ago
- Who needs o1 anyways. Add CoT to any OpenAI compatible endpoint.☆41Updated 5 months ago
- Embed anything.☆29Updated 9 months ago
- 5X faster 60% less memory QLoRA finetuning☆21Updated 9 months ago
- LlamaCards is a web application that provides a dynamic interface for interacting with LLM models in real-time. This app allows users to …☆38Updated 6 months ago
- ☆111Updated 2 months ago
- Text generation in Python, as easy as possible☆55Updated this week
- ☆27Updated 6 months ago
- Complex RAG backend☆28Updated 11 months ago
- B-Llama3o a llama3 with Vision Audio and Audio understanding as well as text and Audio and Animation Data output.☆26Updated 9 months ago
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆53Updated 2 weeks ago
- automatically quant GGUF models☆160Updated this week
- A simple experiment on letting two local LLM have a conversation about anything!☆106Updated 8 months ago
- Very basic framework for composable parameterized large language model (Q)LoRA / (Q)Dora fine-tuning using mlx, mlx_lm, and OgbujiPT.☆37Updated 2 weeks ago
- ☆53Updated 9 months ago
- ☆65Updated 9 months ago
- PyPlexitas is an open-source Python CLI alternative to Perplexity AI, designed to perform web searches, scrape content, generate embeddin…☆35Updated 9 months ago
- ☆28Updated 5 months ago
- Tcurtsni: Reverse Instruction Chat, ever wonder what your LLM wants to ask you?☆21Updated 8 months ago