HimariO / llama.cpp.qwen2.5vl
Port of Facebook's LLaMA model in C/C++
☆46Updated last week
Alternatives and similar repositories for llama.cpp.qwen2.5vl:
Users that are interested in llama.cpp.qwen2.5vl are comparing it to the libraries listed below
- Port of Facebook's LLaMA model in C/C++☆92Updated this week
- 研究GOT-OCR-项目落地加速,不限语言☆60Updated 6 months ago
- Explore LLM model deployment based on AXera's AI chips☆100Updated this week
- LM inference server implementation based on *.cpp.☆169Updated this week
- xllamacpp - a Python wrapper of llama.cpp☆35Updated last week
- GLM Series Edge Models☆136Updated 2 months ago
- qwen2 and llama3 cpp implementation☆44Updated 10 months ago
- run ChatGLM2-6B in BM1684X☆49Updated last year
- ☆225Updated 2 months ago
- llm deploy project based onnx.☆36Updated 6 months ago
- run chatglm3-6b in BM1684X☆38Updated last year
- ☆41Updated 5 months ago
- Inference deployment of the llama3☆11Updated last year
- 大模型部署实战:TensorRT-LLM, Triton Inference Server, vLLM☆26Updated last year
- Service for testing out the new Qwen2.5 omni model☆35Updated 3 weeks ago
- A CPU Realtime VLM in 500M. Surpassed Moondream2 and SmolVLM. Training from scratch with ease.☆185Updated this week
- Mixture-of-Experts (MoE) Language Model☆186Updated 7 months ago
- automatically quant GGUF models☆168Updated this week
- A pipeline parallel training script for LLMs.☆137Updated 3 weeks ago
- Get up and running with Llama 3, Mistral, Gemma, and other large language models.☆26Updated last week
- A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.☆101Updated last week
- ☆29Updated last year
- 基于MNN-llm的安卓手机部署大语言模型:Qwen1.5-0.5B-Chat☆76Updated last year
- 🔥Your Daily Dose of AI Research from Hugging Face 🔥 Stay updated with the latest AI breakthroughs! This bot automatically collects and…☆50Updated this week
- 基于通义千问 Qwen2.5-Omni 的实时语音对话系统,使用在线API服务,支持实时语音交互、动态语音活动检测和流式音频处理。A real-time voice conversation system based on Qwen2.5-Omni Online-API, …☆43Updated last week
- SealAI's stable diffusion implementation☆76Updated 4 months ago
- Built on the robust XTuner backend framework, XTuner Chat GUI offers a user-friendly platform for quick and efficient local model inferen…☆13Updated last year
- ggml学习笔记,ggml是一个机器学习的推理框架☆15Updated last year
- Run generative AI models in sophgo BM1684X☆199Updated this week
- 大模型API性能指标比较 - 深入分析TTFT、TPS等关键指标☆17Updated 7 months ago