multiplexerai / Namespace-RAG
☆13Updated last year
Alternatives and similar repositories for Namespace-RAG:
Users that are interested in Namespace-RAG are comparing it to the libraries listed below
- ☆24Updated last year
- ☆39Updated last year
- Local LLM inference & management server with built-in OpenAI API☆31Updated 11 months ago
- Tcurtsni: Reverse Instruction Chat, ever wonder what your LLM wants to ask you?☆21Updated 8 months ago
- An OpenAI API compatible LLM inference server based on ExLlamaV2.☆25Updated last year
- Complex RAG backend☆28Updated 11 months ago
- Embed anything.☆29Updated 9 months ago
- Gradio based tool to run opensource LLM models directly from Huggingface☆91Updated 8 months ago
- This repo provides a simple Gradio UI to run Qwen2 VL 72B AWQ in venv and have both image and video inferencing work.☆29Updated 5 months ago
- ☆16Updated last year
- ☆46Updated 4 months ago
- Simple LLM inference server☆20Updated 9 months ago
- Local LLaMAs/Models in VSCode☆53Updated last year
- run ollama & gguf easily with a single command☆49Updated 10 months ago
- Running Microsoft's BitNet inference framework via FastAPI, Uvicorn and Docker.☆21Updated 4 months ago
- Mycomind Daemon: A mycelium-inspired, advanced Mixture-of-Memory-RAG-Agents (MoMRA) cognitive assistant that combines multiple AI models …☆31Updated 8 months ago
- All the world is a play, we are but actors in it.☆47Updated this week
- "a towel is about the most massively useful thing an interstellar AI hitchhiker can have"☆48Updated 5 months ago
- An API for VoiceCraft.☆25Updated 8 months ago
- Serving LLMs in the HF-Transformers format via a PyFlask API☆71Updated 6 months ago
- JacQues is a Dash-based interactive web application that facilitates real-time chat and document management.☆22Updated 6 months ago
- ☆17Updated 3 months ago
- Scripts and tools for optimizing quantizations in llama.cpp with GGUF imatrices.☆14Updated 2 months ago
- After my server ui improvements were successfully merged, consider this repo a playground for experimenting, tinkering and hacking around…☆56Updated 7 months ago
- GPT-4 Level Conversational QA Trained In a Few Hours☆59Updated 7 months ago
- A guidance compatibility layer for llama-cpp-python☆34Updated last year
- A stable, fast and easy-to-use inference library with a focus on a sync-to-async API☆45Updated 5 months ago
- GPT-2 small trained on phi-like data☆65Updated last year
- Accepts a Hugging Face model URL, automatically downloads and quantizes it using Bits and Bytes.☆38Updated last year
- LIVA - Local Intelligent Voice Assistant☆61Updated 6 months ago