multiplexerai / Namespace-RAG
☆13Updated last year
Alternatives and similar repositories for Namespace-RAG:
Users that are interested in Namespace-RAG are comparing it to the libraries listed below
- ☆25Updated last year
- ☆39Updated last year
- Local LLM inference & management server with built-in OpenAI API☆31Updated last year
- ☆17Updated 4 months ago
- An OpenAI API compatible LLM inference server based on ExLlamaV2.☆25Updated last year
- Tcurtsni: Reverse Instruction Chat, ever wonder what your LLM wants to ask you?☆22Updated 10 months ago
- Local LLaMAs/Models in VSCode☆53Updated last year
- Serving LLMs in the HF-Transformers format via a PyFlask API☆71Updated 8 months ago
- ☆13Updated last week
- A Python library to orchestrate LLMs in a neural network-inspired structure☆47Updated 7 months ago
- run ollama & gguf easily with a single command☆50Updated 11 months ago
- Accepts a Hugging Face model URL, automatically downloads and quantizes it using Bits and Bytes.☆38Updated last year
- Local character AI chatbot with chroma vector store memory and some scripts to process documents for Chroma☆34Updated 7 months ago
- A stable, fast and easy-to-use inference library with a focus on a sync-to-async API☆45Updated 7 months ago
- Minimal, clean code implementation of RAG with mlx using gguf model weights☆49Updated last year
- Very basic framework for composable parameterized large language model (Q)LoRA / (Q)Dora fine-tuning using mlx, mlx_lm, and OgbujiPT.☆40Updated 2 months ago
- Mycomind Daemon: A mycelium-inspired, advanced Mixture-of-Memory-RAG-Agents (MoMRA) cognitive assistant that combines multiple AI models …☆32Updated 9 months ago
- Gradio based tool to run opensource LLM models directly from Huggingface☆91Updated 10 months ago
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆43Updated last year
- Simple and fast server for GPTQ-quantized LLaMA inference☆24Updated last year
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆66Updated 6 months ago
- A Windows tool to query various LLM AIs. Supports branched conversations, history and summaries among others.☆30Updated this week
- Based on kylemcdonald/i2i-realtime. The warping server for GenDJ real time webcam AI warping☆27Updated last month
- GRDN.AI app for garden optimization☆70Updated last year
- ☆28Updated 7 months ago
- Claudetools is a Python library that enables function calling with the Claude 3 family of language models from Anthropic.☆38Updated 3 months ago
- GPT-4 Level Conversational QA Trained In a Few Hours☆61Updated 8 months ago
- Complex RAG backend☆28Updated last year
- ☆16Updated this week
- Easily create LLM automation/agent workflows☆59Updated last year