phildougherty / qwen2.5-VL-inference-openaiLinks
Inference service for Qwen2.5-VL-7b model
☆209Updated 10 months ago
Alternatives and similar repositories for qwen2.5-VL-inference-openai
Users that are interested in qwen2.5-VL-inference-openai are comparing it to the libraries listed below
Sorting:
- A real-time speech-to-speech chatbot powered by Whisper Small, Llama 3.2, and Kokoro-82M.☆245Updated last year
- Open Deep Researcher with openai compatible endpoint, now completely local with ollama, local playwright via searxng with citations and p…☆153Updated 10 months ago
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆267Updated 10 months ago
- Service for testing out the new Qwen2.5 omni model☆62Updated 9 months ago
- Cascading voice assistant combining real-time speech recognition, AI reasoning, and neural text-to-speech capabilities.☆127Updated 4 months ago
- This is the Mixture-of-Agents (MoA) concept, adapted from the original work by TogetherAI. My version is tailored for local model usage a…☆118Updated last year
- Experimenting with AutoGen to see if an entire book can be written with AI agents☆344Updated 10 months ago
- Local LLM Powered Recursive Search & Smart Knowledge Explorer☆258Updated 3 months ago
- ☆178Updated 5 months ago
- Speech-to-speech AI assistant with natural conversation flow, mid-speech interruption, vision capabilities and AI-initiated follow-ups. F…☆281Updated 9 months ago
- Deeper Seeker is an simpler OSS version of OpenAI's latest Deep Research feature in ChatGPT.It is an agentic research tool to reason , cr…☆411Updated 8 months ago
- Generate Web Pages and Components with text prompts, with Local Models. (or Cloud Models, if you want)☆400Updated 3 weeks ago
- A Python-based web-assisted large language model (LLM) search assistant using Llama.cpp☆368Updated last year
- A lightweight recreation of OS1/Samantha from the movie Her, running locally in the browser☆115Updated 6 months ago
- The MCP Code Executor is an MCP server that allows LLMs to execute Python code within a specified Conda environment.☆213Updated 8 months ago
- List of curated use cases built using Sesame's CSM 1B☆73Updated 8 months ago
- multi1: create o1-like reasoning chains with multiple AI providers (and locally). Supports LiteLLM as backend too for 100+ providers at o…☆350Updated last year
- 📋 NotebookMLX - An Open Source version of NotebookLM (Ported NotebookLlama)☆334Updated 10 months ago
- Pocket Flow Tutorial Project: Build Cursor with Cursor☆226Updated 10 months ago
- Docker compose to run vLLM on Windows☆114Updated 2 years ago
- Simple UI for Llama-3.2-11B-Vision & Molmo-7B-D☆134Updated last year
- A Conversational Speech Generation Model with Gradio UI and OpenAI compatible API. UI and API support CUDA, MLX and CPU devices.☆210Updated 8 months ago
- Self-host the powerful Dia TTS model. This server offers a user-friendly Web UI, flexible API endpoints (incl. OpenAI compatible), suppor…☆340Updated 8 months ago
- PocketFlow's node-based workflow structure, with Manus' agents and tools!☆292Updated 3 months ago
- Jina DeepSearch UI☆127Updated 5 months ago
- ☆205Updated 4 months ago
- Command-line personal assistant using your favorite proprietary or local models with access to over 30+ tools☆111Updated 7 months ago
- llmbasedos — Local-First OS Where Your AI Agents Wake Up and Work☆281Updated 3 weeks ago
- MCP Server to Use HuggingFace spaces, easy configuration and Claude Desktop mode.☆381Updated 7 months ago
- ☆200Updated 10 months ago