multiplexerai / Namespace-RAGLinks
☆13Updated last year
Alternatives and similar repositories for Namespace-RAG
Users that are interested in Namespace-RAG are comparing it to the libraries listed below
Sorting:
- ☆25Updated last year
- ☆40Updated last year
- Gradio based tool to run opensource LLM models directly from Huggingface☆96Updated last year
- Serving LLMs in the HF-Transformers format via a PyFlask API☆72Updated last year
- An OpenAI API compatible LLM inference server based on ExLlamaV2.☆25Updated last year
- GRDN.AI app for garden optimization☆69Updated 2 months ago
- An API for VoiceCraft.☆25Updated last year
- A simple experiment on letting two local LLM have a conversation about anything!☆112Updated last year
- LLaVA server (llama.cpp).☆183Updated 2 years ago
- Embed anything.☆27Updated last year
- ☆135Updated last month
- After my server ui improvements were successfully merged, consider this repo a playground for experimenting, tinkering and hacking around…☆53Updated last year
- Experimental LLM Inference UX to aid in creative writing☆128Updated last year
- Local LLaMAs/Models in VSCode☆54Updated 2 years ago
- run ollama & gguf easily with a single command☆52Updated last year
- All the world is a play, we are but actors in it.☆49Updated 6 months ago
- ☆50Updated last year
- ☆28Updated 9 months ago
- LIVA - Local Intelligent Voice Assistant☆61Updated last year
- A stable, fast and easy-to-use inference library with a focus on a sync-to-async API☆47Updated last year
- 100% Local Document deep search with LLMs☆26Updated last year
- Minimal, clean code implementation of RAG with mlx using gguf model weights☆53Updated last year
- Client-side toolkit for using large language models, including where self-hosted☆115Updated this week
- Low-Rank adapter extraction for fine-tuned transformers models☆180Updated last year
- ☆17Updated last year
- A simple Jupyter Notebook for learning MLX text-completion fine-tuning!☆123Updated last year
- Let's create synthetic textbooks together :)☆76Updated 2 years ago
- Python package wrapping llama.cpp for on-device LLM inference☆100Updated 3 months ago
- Based on kylemcdonald/i2i-realtime. The warping server for GenDJ real time webcam AI warping☆31Updated 10 months ago
- Complex RAG backend☆29Updated last year