openvino-dev-samples / Qwen2.openvinoLinks
This sample shows how to deploy Qwen2 using OpenVINO
☆39Updated 10 months ago
Alternatives and similar repositories for Qwen2.openvino
Users that are interested in Qwen2.openvino are comparing it to the libraries listed below
Sorting:
- run chatglm3-6b in BM1684X☆40Updated last year
- run ChatGLM2-6B in BM1684X☆50Updated last year
- 基于MNN-llm的安卓手机部署大语言模型:Qwen1.5-0.5B-Chat☆82Updated last year
- qwen2 and llama3 cpp implementation☆46Updated last year
- 研究GOT-OCR-项目落地加速,不限语言☆61Updated 9 months ago
- unify-easy-llm(ULM)旨在打造一个简易的一键式大模型训练工具,支持Nvidia GPU、Ascend NPU等不同硬件以及常用的大模型。☆57Updated last year
- GLM Series Edge Models☆147Updated 2 months ago
- This is Microsoft-Phi-3-NvidiaNIMWorkshop☆23Updated 11 months ago
- qwen models finetuning☆103Updated 5 months ago
- Phi2-Chinese-0.2B 从0开始训练自己的Phi2中文小模型,支持接入langchain加载本地知识库做检索增强生成RAG。Training your own Phi2 small chat model from scratch.☆563Updated last year
- 部署你自己的OpenAI api🤩, 基于flask, transformers (使用 Baichuan2-13B-Chat-4bits 模型, 可以运行在单张Tesla T4显卡) ,实现了OpenAI中Chat, Models和Completions接口,包含流式响…☆95Updated last year
- Alpaca Chinese Dataset -- 中文指令微调数据集☆213Updated 10 months ago
- Phi3 中文后训练模型仓库☆321Updated 8 months ago
- A demo built on Megrez-3B-Instruct, integrating a web search tool to enhance the model's question-and-answer capabilities.☆38Updated 8 months ago
- 使用FastAPI+vLLM部署Qwen2.5☆22Updated 10 months ago
- vLLM Documentation in Chinese Simplified / vLLM 中文文档☆92Updated 3 months ago
- 将SmolVLM2的视觉头与Qwen3-0.6B模型进行了拼接微调☆226Updated 2 weeks ago
- 想要从零开始训练一个中文的mini大语言模型,可以进行基本的对话,模型大小根据手头的机器决定☆61Updated last year
- llm-export can export llm model to onnx.☆302Updated 6 months ago
- Finetune Llama 3, Mistral & Gemma LLMs 2-5x faster with 80% less memory☆28Updated last year
- A Multi-modal RAG Project with Dataset from Honor of Kings, one of the most popular smart phone games in China☆66Updated 11 months ago
- Python3 package for Chinese/English OCR,use paddleocr-v5 onnx model(~20MB), with ultra-fast inference speed. 基于ppocr-v5-onnx模型推理,中英文OCR开源…☆96Updated 3 weeks ago
- 演示Gemma中文指令微调的教程☆46Updated last year
- Qwen-Efficient-Tuning☆44Updated last year
- Explore LLM model deployment based on AXera's AI chips☆109Updated last week
- Port of Facebook's LLaMA model in C/C++☆98Updated this week
- Efficient inference of large language models.☆150Updated last month
- OrionStar-Yi-34B-Chat 是一款开源中英文Chat模型,由猎户星空基于Yi-34B开源模型、使用15W+高质量语料微调而成。☆261Updated last year
- ☆350Updated last year
- Music large model based on InternLM2-chat.☆22Updated 7 months ago