multiplexerai / Namespace-RAG
☆13Updated last year
Alternatives and similar repositories for Namespace-RAG:
Users that are interested in Namespace-RAG are comparing it to the libraries listed below
- ☆25Updated last year
- ☆39Updated last year
- Gradio based tool to run opensource LLM models directly from Huggingface☆90Updated 9 months ago
- Local LLM inference & management server with built-in OpenAI API☆31Updated 11 months ago
- An API for VoiceCraft.☆25Updated 9 months ago
- ☆17Updated 4 months ago
- An OpenAI API compatible LLM inference server based on ExLlamaV2.☆25Updated last year
- JacQues is a Dash-based interactive web application that facilitates real-time chat and document management.☆22Updated 7 months ago
- 🦛 CHONK your texts with Chonkie ✨ - The no-nonsense RAG chunking library☆24Updated 5 months ago
- GPT-4 Level Conversational QA Trained In a Few Hours☆59Updated 7 months ago
- After my server ui improvements were successfully merged, consider this repo a playground for experimenting, tinkering and hacking around…☆54Updated 8 months ago
- ☆47Updated 5 months ago
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆28Updated 3 months ago
- All the world is a play, we are but actors in it.☆49Updated this week
- A Python library to orchestrate LLMs in a neural network-inspired structure☆46Updated 6 months ago
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆43Updated last year
- Embed anything.☆29Updated 10 months ago
- Use smol agents to do research and then update csv coumns with its findings.☆37Updated 2 months ago
- Serving LLMs in the HF-Transformers format via a PyFlask API☆71Updated 7 months ago
- A simple experiment on letting two local LLM have a conversation about anything!☆110Updated 9 months ago
- ☆22Updated last year
- ☆16Updated last year
- This repo provides a simple Gradio UI to run Qwen2 VL 72B AWQ in venv and have both image and video inferencing work.☆30Updated 6 months ago
- Ollama models of NousResearch/Hermes-2-Pro-Mistral-7B-GGUF☆32Updated last year
- BH hackathon☆14Updated last year
- ☆20Updated last year
- Tcurtsni: Reverse Instruction Chat, ever wonder what your LLM wants to ask you?☆21Updated 9 months ago
- A quick and optimized solution to manage llama based gguf quantized models, download gguf files, retreive messege formatting, add more mo…☆12Updated last year
- Minimal, clean code implementation of RAG with mlx using gguf model weights☆49Updated 11 months ago
- Based on kylemcdonald/i2i-realtime. The warping server for GenDJ real time webcam AI warping☆27Updated 2 weeks ago