multiplexerai / Namespace-RAGLinks
☆13Updated last year
Alternatives and similar repositories for Namespace-RAG
Users that are interested in Namespace-RAG are comparing it to the libraries listed below
Sorting:
- ☆25Updated last year
- ☆39Updated last year
- An API for VoiceCraft.☆25Updated 11 months ago
- ☆17Updated 6 months ago
- An OpenAI API compatible LLM inference server based on ExLlamaV2.☆25Updated last year
- "a towel is about the most massively useful thing an interstellar AI hitchhiker can have"☆48Updated 8 months ago
- run ollama & gguf easily with a single command☆51Updated last year
- A Windows tool to query various LLM AIs. Supports branched conversations, history and summaries among others.☆31Updated this week
- Local LLM inference & management server with built-in OpenAI API☆31Updated last year
- Demo of an "always-on" AI assistant.☆24Updated last year
- Deploy your GGML models to HuggingFace Spaces with Docker and gradio☆36Updated 2 years ago
- Accepts a Hugging Face model URL, automatically downloads and quantizes it using Bits and Bytes.☆38Updated last year
- Python package wrapping llama.cpp for on-device LLM inference☆72Updated this week
- A stable, fast and easy-to-use inference library with a focus on a sync-to-async API☆45Updated 8 months ago
- A simple speech-to-text and text-to-speech AI chatbot that can be run fully offline.☆45Updated last year
- Tcurtsni: Reverse Instruction Chat, ever wonder what your LLM wants to ask you?☆22Updated 11 months ago
- Serving LLMs in the HF-Transformers format via a PyFlask API☆71Updated 9 months ago
- GRDN.AI app for garden optimization☆70Updated last year
- Complex RAG backend☆28Updated last year
- 100% Private & Simple. OSS 🐍 Code Interpreter for LLMs 🦙☆35Updated last year
- Gradio based tool to run opensource LLM models directly from Huggingface☆93Updated 11 months ago
- Embed anything.☆28Updated last year
- ☆50Updated 4 months ago
- ☆29Updated 8 months ago
- After my server ui improvements were successfully merged, consider this repo a playground for experimenting, tinkering and hacking around…☆54Updated 10 months ago
- a lightweight, open-source blueprint for building powerful and scalable LLM chat applications☆28Updated last year
- All the world is a play, we are but actors in it.☆50Updated this week
- Minimal, clean code implementation of RAG with mlx using gguf model weights☆51Updated last year
- Very basic framework for composable parameterized large language model (Q)LoRA / (Q)Dora fine-tuning using mlx, mlx_lm, and OgbujiPT.☆41Updated this week
- Based on kylemcdonald/i2i-realtime. The warping server for GenDJ real time webcam AI warping☆28Updated 2 months ago