anastasiosyal / phi4-multimodal-instruct-serverLinks
Phi4 Multimodal Instruct - OpenAI endpoint and Docker Image for self-hosting
☆40Updated 11 months ago
Alternatives and similar repositories for phi4-multimodal-instruct-server
Users that are interested in phi4-multimodal-instruct-server are comparing it to the libraries listed below
Sorting:
- Running Microsoft's BitNet via Electron, React & Astro☆52Updated 4 months ago
- ☆209Updated last month
- ☆135Updated last month
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆88Updated last week
- Tcurtsni: Reverse Instruction Chat, ever wonder what your LLM wants to ask you?☆23Updated last year
- ☆109Updated 5 months ago
- Serving LLMs in the HF-Transformers format via a PyFlask API☆72Updated last year
- An extension that lets the AI take the wheel, allowing it to use the mouse and keyboard, recognize UI elements, and prompt itself :3...no…☆127Updated last year
- SPLAA is an AI assistant framework that utilizes voice recognition, text-to-speech, and tool-calling capabilities to provide a conversati…☆28Updated 9 months ago
- Service for testing out the new Qwen2.5 omni model☆63Updated 9 months ago
- Something similar to Apple Intelligence?☆60Updated last year
- ☆30Updated last year
- This is the Mixture-of-Agents (MoA) concept, adapted from the original work by TogetherAI. My version is tailored for local model usage a…☆118Updated last year
- This small API downloads and exposes access to NeuML's txtai-wikipedia and full wikipedia datasets, taking in a query and returning full …☆103Updated 5 months ago
- Mycomind Daemon: A mycelium-inspired, advanced Mixture-of-Memory-RAG-Agents (MoMRA) cognitive assistant that combines multiple AI models …☆35Updated last year
- A simple experiment on letting two local LLM have a conversation about anything!☆112Updated last year
- A real-time shared memory layer for multi-agent LLM systems.☆53Updated 3 weeks ago
- Mixture-of-Ollamas☆30Updated last year
- Docker compose to run vLLM on Windows☆114Updated 2 years ago
- Realtime tts reading of large textfiles by your favourite voice. +Translation via LLM (Python script)☆52Updated last year
- automatically quant GGUF models☆219Updated last month
- klmbr - a prompt pre-processing technique to break through the barrier of entropy while generating text with LLMs☆86Updated last year
- ☆17Updated last year
- ☆29Updated 9 months ago
- This repo provides a simple Gradio UI to run Qwen2 VL 72B AWQ in venv and have both image and video inferencing work.☆33Updated last year
- Efficient computer use agent powered by Meta Llama 4 Maverick☆45Updated 9 months ago
- The hearth of The Pulsar App, fast, secure and shared inference with modern UI☆59Updated last year
- LLM based agents with proactive interactions, long-term memory, external tool integration, and local deployment capabilities.☆108Updated 6 months ago
- 🔥 LitLytics - an affordable, simple analytics platform that leverages LLMs to automate data analysis☆103Updated last year
- Own your AI, search the web with it🌐😎☆94Updated last year