openvino-dev-samples / Qwen2.openvino
This sample shows how to deploy Qwen2 using OpenVINO
☆35Updated 4 months ago
Alternatives and similar repositories for Qwen2.openvino:
Users that are interested in Qwen2.openvino are comparing it to the libraries listed below
- 研究GOT-OCR-项目落地加速,不限语言☆57Updated 3 months ago
- unify-easy-llm(ULM)旨在打造一个简易的一键式大模型训练工具,支持Nvidia GPU、Ascend NPU等不同硬件以及常用的大模型。☆54Updated 6 months ago
- run ChatGLM2-6B in BM1684X☆49Updated 10 months ago
- 属于每个人的公众号”查特查特“上线啦!新问题、新方法、新发现,欢迎提PR!☆41Updated last year
- 专注于对话系统领域的技术分享,重点写《Dify应用操作和源码剖析》专栏。☆71Updated 6 months ago
- GLM Series Edge Models☆127Updated 3 weeks ago
- 本项目致力于为大模型领域的初学者提供全面的知识体系,包括基础和高阶内容,以便开发者能迅速掌握大模型技术栈并全面了解相关知识。☆44Updated 3 weeks ago
- 训练一个对中文支持更好的LLaVA模型,并开源训练代码和数据。☆44Updated 4 months ago
- A demo built on Megrez-3B-Instruct, integrating a web search tool to enhance the model's question-and-answer capabilities.☆35Updated last month
- ☆201Updated last month
- ☆25Updated 3 months ago
- Pseudo Streaming SenseVoice with Hotwords☆171Updated last month
- Port of Facebook's LLaMA model in C/C++☆80Updated last week
- 一些大语言模型和多模态模型的应用,主要包括Rag,小模型,Agent,跨模态搜索,OCR等等☆147Updated 2 months ago
- 如需体验textin文档解析,请点击https://cc.co/16YSIy☆72Updated 2 months ago
- 基于 Langchain,快速集成GLM-4 AllTools 功能的插件☆46Updated 6 months ago
- ☆58Updated 3 months ago
- 纯c++的全平台llm加速库,支持python调用,支持baichuan, glm, llama, moss基座,手机端流畅运行chatglm-6B级模型单卡可达10000+token / s,☆45Updated last year
- vLLM Documentation in Chinese Simplified / vLLM 中文文档☆25Updated 3 weeks ago
- 部署你自己的OpenAI api🤩, 基于flask, transformers (使用 Baichuan2-13B-Chat-4bits 模型, 可以运行在单张Tesla T4显卡) ,实现了OpenAI中Chat, Models和Completions接口,包含流式响…☆88Updated last year
- This is a user guide for the MiniCPM and MiniCPM-V series of small language models (SLMs) developed by ModelBest. “面壁小钢炮” focuses on achi…☆187Updated 3 months ago
- share data, prompt data , pretraining data☆35Updated last year
- DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including …☆220Updated this week
- Finetune Llama 3, Mistral & Gemma LLMs 2-5x faster with 80% less memory☆25Updated 8 months ago
- 演示 vllm 对中文大语言模型的神奇效果☆31Updated last year
- Legal-Eagle-InternLM 是一个基于商汤科技和上海人工智能实验室推出的书生浦语大模型InternLM的法律问答机器人。旨在为用户提供符合3H(即Helpful、Honest、Harmless)原则的专业、智能、全面的法律服务的法律领域大模型。☆51Updated 11 months ago
- Qwen1.5-SFT(阿里, Ali), Qwen_Qwen1.5-2B-Chat/Qwen_Qwen1.5-7B-Chat微调(transformers)/LORA(peft)/推理☆50Updated 8 months ago
- 想要从零开始训练一个中文的mini大语言模型,可以进行基本的对话,模型大小根据手头的机器决定☆56Updated 5 months ago
- qwen-7b and qwen-14b finetuning☆88Updated 9 months ago
- qwen2 and llama3 cpp implementation☆39Updated 7 months ago