OpenAccess-AI-Collective / servereless-runpod-ggmlLinks
☆55Updated 2 years ago
Alternatives and similar repositories for servereless-runpod-ggml
Users that are interested in servereless-runpod-ggml are comparing it to the libraries listed below
Sorting:
- ☆52Updated last year
- RunPod Serverless Worker for Oobabooga Text Generation API for LLMs☆2Updated last year
- Deploy your GGML models to HuggingFace Spaces with Docker and gradio☆37Updated 2 years ago
- Easily create LLM automation/agent workflows☆59Updated last year
- Experimental sampler to make LLMs more creative☆31Updated last year
- Client-side toolkit for using large language models, including where self-hosted☆111Updated 7 months ago
- ☆115Updated 6 months ago
- ☆31Updated last year
- ☆44Updated last year
- Auto Data is a library designed for quick and effortless creation of datasets tailored for fine-tuning Large Language Models (LLMs).☆102Updated 8 months ago
- TheBloke's Dockerfiles☆305Updated last year
- An OpenAI-like LLaMA inference API☆112Updated last year
- ☆49Updated last year
- Creates an Langchain Agent which uses the WebUI's API and Wikipedia to work☆74Updated last year
- Locally running LLM with internet access☆95Updated last week
- Dagger functions to import Hugging Face GGUF models into a local ollama instance and optionally push them to ollama.com.☆116Updated last year
- 5X faster 60% less memory QLoRA finetuning☆21Updated last year
- Experimental LLM Inference UX to aid in creative writing☆114Updated 6 months ago
- Complex RAG backend☆28Updated last year
- The one who calls upon functions - Function-Calling Language Model☆36Updated last year
- Local character AI chatbot with chroma vector store memory and some scripts to process documents for Chroma☆34Updated 9 months ago
- A Qt GUI for large language models☆43Updated last year
- An endpoint server for efficiently serving quantized open-source LLMs for code.☆55Updated last year
- A collection of character cards for use in AI Roleplaying☆84Updated 2 years ago
- ☆40Updated last month
- run ollama & gguf easily with a single command☆52Updated last year
- Run language models on consumer hardware.☆25Updated last year
- ☆157Updated 11 months ago
- A fast batching API to serve LLM models☆183Updated last year
- A langchain app to visualise a debate using Tree-of-Thought reasoning☆60Updated last year