phildougherty / qwen2.5-VL-inference-openaiView external linksLinks
Inference service for Qwen2.5-VL-7b model
☆208Mar 24, 2025Updated 10 months ago
Alternatives and similar repositories for qwen2.5-VL-inference-openai
Users that are interested in qwen2.5-VL-inference-openai are comparing it to the libraries listed below
Sorting:
- 使用FastAPI+vLLM部署Qwen2.5☆25Sep 29, 2024Updated last year
- Open source tool for transcirption and subtitling, alternative to happyscribe.☆33Feb 12, 2025Updated last year
- a browser gui for nvidia smi☆20Mar 17, 2025Updated 11 months ago
- run ollama & gguf easily with a single command☆52May 15, 2024Updated last year
- private-machine is an AI companion system with emotion, needs and goals simulation. Very silly, not based on real science.☆28Nov 13, 2025Updated 3 months ago
- Interact privately with your documents using the power of GPT, 100% privately, no data leaks☆10May 22, 2023Updated 2 years ago
- Open-source clone of OpenAI's Deep Research. Works with any transformer, gpt4free, & runs in browser. No Firecrawl needed.☆12Jun 12, 2025Updated 8 months ago
- ☆15Apr 9, 2025Updated 10 months ago
- A thin cython wrapper around llama.cpp, whisper.cpp and stable-diffusion.cpp☆16Feb 10, 2026Updated last week
- ☆12Jan 20, 2026Updated 3 weeks ago
- Writing Tools, Apple's AI-inspired app, enchants Windows, enhancing your pen with AI LLMs. One hotkey press, system-wide, fixes grammar, …☆26Jul 26, 2025Updated 6 months ago
- 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)☆30Jun 9, 2025Updated 8 months ago
- A practical demo using Atomic Agents, showing how to build your own code generation agent that actually executes the code it writes in a …☆15Nov 24, 2024Updated last year
- An MCP server providing intelligent transcript processing capabilities, featuring natural formatting, contextual repair, and smart summar…☆18Mar 14, 2025Updated 11 months ago
- Service for testing out the new Qwen2.5 omni model☆63Apr 30, 2025Updated 9 months ago
- A Windows tool to query various LLM AIs. Supports branched conversations, history and summaries among others.☆35Oct 21, 2025Updated 3 months ago
- A Retrieval-Augmented Generation (RAG) system running DeepSeek R1 Distill LLama 70B model using Groq's fast inference API.☆13Jan 29, 2025Updated last year
- RUN LLAMA-3 70B llm with NVIDIA endpoints☆14Apr 20, 2024Updated last year
- 基于电商数据微调的Qwen2.5系列的电商大模型,电商数据sft后电商大模型。是https://github.com/leeguandong/EcommerceLLM的升级版本。qwen2.5的效果很好。☆13Oct 4, 2024Updated last year
- Dataset Crafting w/ RAG/Wikipedia ground truth and Efficient Fine-Tuning Using MLX and Unsloth. Includes configurable dataset annotation …☆193Jul 21, 2024Updated last year
- Run a <400ms latency Voice Agent on just 4GB VRAM. Fully offline, no API keys required. Optimized for GTX 1650 and edge robotics with zer…☆56Updated this week
- MCP server for searching npm packages☆15Jan 7, 2026Updated last month
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆31May 1, 2025Updated 9 months ago
- unsloth-5090-multiple☆60May 21, 2025Updated 8 months ago
- 🖥️ Run AI Agent in your browser.☆15,588Aug 31, 2025Updated 5 months ago
- 从零构建了Agent中最重要的功能-function call☆17Oct 16, 2024Updated last year
- ☆20Jun 28, 2025Updated 7 months ago
- A powerful AI Agent Demo playground that combines the intelligence of AI agents LLM with real-time speech-to-speech models integration. B…☆20Jun 16, 2025Updated 8 months ago
- Face Verification API☆11Sep 27, 2021Updated 4 years ago
- ☆11Updated this week
- Personal voice assistant, with voice interruption and Twilio support☆18Feb 24, 2025Updated 11 months ago
- This repository contains a Multimodal Retrieval-Augmented Generation (RAG) Pipeline that integrates images, audio, and text for advanced …☆25Jan 19, 2025Updated last year
- 基于vllm部署qwen2.5_vl实现视频流的实时识别☆20Apr 1, 2025Updated 10 months ago
- Smart proxy for LLM APIs that enables model-specific parameter control, automatic mode switching (like Qwen3's /think and /no_think), and…☆50May 19, 2025Updated 8 months ago
- ACE-Step: A Step Towards Music Generation Foundation Model☆50May 20, 2025Updated 8 months ago
- Dify 1.0 Plugin Convert your Dify tools's API to MCP compatible API☆23Apr 25, 2025Updated 9 months ago
- OpenAI-compatible TTS API that unifies multiple backends with smart chunking for unlimited-length generation☆46Dec 8, 2025Updated 2 months ago
- A tool for migrating projects with hard-coded strings to Tolgee JS☆20Nov 14, 2024Updated last year
- Thematic Generalization Benchmark: measures how effectively various LLMs can infer a narrow or specific "theme" (category/rule) from a sm…☆63Sep 22, 2025Updated 4 months ago