multiplexerai / Namespace-RAG
☆13Updated 11 months ago
Alternatives and similar repositories for Namespace-RAG:
Users that are interested in Namespace-RAG are comparing it to the libraries listed below
- ☆24Updated 11 months ago
- ☆39Updated 11 months ago
- Running Microsoft's BitNet inference framework via FastAPI, Uvicorn and Docker.☆19Updated 2 months ago
- Local LLM inference & management server with built-in OpenAI API☆31Updated 9 months ago
- Text generation in Python, as easy as possible☆51Updated this week
- A stable, fast and easy-to-use inference library with a focus on a sync-to-async API☆44Updated 4 months ago
- run ollama & gguf easily with a single command☆49Updated 8 months ago
- After my server ui improvements were successfully merged, consider this repo a playground for experimenting, tinkering and hacking around…☆56Updated 5 months ago
- Screenshot LLM is a Python application that leverages the power of AI to analyze screenshots. Built with PyQt6 for a user-friendly interf…☆33Updated 2 months ago
- Serving LLMs in the HF-Transformers format via a PyFlask API☆69Updated 4 months ago
- "a towel is about the most massively useful thing an interstellar AI hitchhiker can have"☆48Updated 3 months ago
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆43Updated 10 months ago
- Gradio based tool to run opensource LLM models directly from Huggingface☆90Updated 7 months ago
- A simple experiment on letting two local LLM have a conversation about anything!☆101Updated 6 months ago
- This repo provides a simple Gradio UI to run Qwen2 VL 72B AWQ in venv and have both image and video inferencing work.☆26Updated 3 months ago
- 100% Local Document deep search with LLMs☆25Updated 4 months ago
- GPT-4 Level Conversational QA Trained In a Few Hours☆58Updated 5 months ago
- LlamaCards is a web application that provides a dynamic interface for interacting with LLM models in real-time. This app allows users to …☆37Updated 5 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆63Updated 2 months ago
- JacQues is a Dash-based interactive web application that facilitates real-time chat and document management.☆22Updated 4 months ago
- ☆16Updated last month
- Easy to use, High Performant Knowledge Distillation for LLMs☆40Updated 2 weeks ago
- A Windows tool to query various LLM AIs. Supports branched conversations, history and summaries among others.☆29Updated this week
- Simple LLM inference server☆20Updated 7 months ago
- An API for VoiceCraft.☆26Updated 7 months ago
- Accepts a Hugging Face model URL, automatically downloads and quantizes it using Bits and Bytes.☆38Updated 10 months ago
- Mycomind Daemon: A mycelium-inspired, advanced Mixture-of-Memory-RAG-Agents (MoMRA) cognitive assistant that combines multiple AI models …☆30Updated 6 months ago
- Embed anything.☆28Updated 8 months ago
- LIVA - Local Intelligent Voice Assistant☆61Updated 5 months ago
- Branch Out Your Conversations☆27Updated 2 weeks ago