multiplexerai / Namespace-RAGLinks
☆13Updated last year
Alternatives and similar repositories for Namespace-RAG
Users that are interested in Namespace-RAG are comparing it to the libraries listed below
Sorting:
- ☆25Updated last year
- ☆39Updated last year
- An OpenAI API compatible LLM inference server based on ExLlamaV2.☆25Updated last year
- Gradio based tool to run opensource LLM models directly from Huggingface☆94Updated last year
- Serving LLMs in the HF-Transformers format via a PyFlask API☆71Updated 10 months ago
- All the world is a play, we are but actors in it.☆50Updated 2 weeks ago
- Experimental LLM Inference UX to aid in creative writing☆119Updated 7 months ago
- Embed anything.☆28Updated last year
- Local LLM inference & management server with built-in OpenAI API☆31Updated last year
- "a towel is about the most massively useful thing an interstellar AI hitchhiker can have"☆48Updated 9 months ago
- Transplants vocabulary between language models, enabling the creation of draft models for speculative decoding WITHOUT retraining.☆39Updated 3 weeks ago
- Let's create synthetic textbooks together :)☆75Updated last year
- ☆38Updated last year
- ☆49Updated last year
- LIVA - Local Intelligent Voice Assistant☆61Updated 11 months ago
- A simple Jupyter Notebook for learning MLX text-completion fine-tuning!☆120Updated 8 months ago
- A simple experiment on letting two local LLM have a conversation about anything!☆110Updated last year
- This repo provides a simple Gradio UI to run Qwen2 VL 72B AWQ in venv and have both image and video inferencing work.☆30Updated 10 months ago
- GPT-2 small trained on phi-like data☆67Updated last year
- run ollama & gguf easily with a single command☆52Updated last year
- Minimal, clean code implementation of RAG with mlx using gguf model weights☆52Updated last year
- Python package wrapping llama.cpp for on-device LLM inference☆80Updated 3 weeks ago
- After my server ui improvements were successfully merged, consider this repo a playground for experimenting, tinkering and hacking around…☆54Updated 11 months ago
- LLaVA server (llama.cpp).☆181Updated last year
- GRDN.AI app for garden optimization☆70Updated last year
- AutoNL - Natural Language Automation tool☆86Updated last year
- Local LLaMAs/Models in VSCode☆53Updated 2 years ago
- A fast batching API to serve LLM models☆185Updated last year
- A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.☆64Updated last year
- GPT-4 Level Conversational QA Trained In a Few Hours☆63Updated 11 months ago