openvino-dev-samples / Qwen2.openvino
This sample shows how to deploy Qwen2 using OpenVINO
☆38Updated 7 months ago
Alternatives and similar repositories for Qwen2.openvino
Users that are interested in Qwen2.openvino are comparing it to the libraries listed below
Sorting:
- run ChatGLM2-6B in BM1684X☆49Updated last year
- Music large model based on InternLM2-chat.☆22Updated 4 months ago
- 研究GOT-OCR-项目落地加速,不限语言☆60Updated 6 months ago
- run chatglm3-6b in BM1684X☆38Updated last year
- unify-easy-llm(ULM)旨在打造一个简易的一键式大模型训练工具,支持Nvidia GPU、Ascend NPU等不同硬件以及常用的大模型。☆55Updated 9 months ago
- ☆44Updated 6 months ago
- A demo built on Megrez-3B-Instruct, integrating a web search tool to enhance the model's question-and-answer capabilities.☆38Updated 5 months ago
- A Multi-modal RAG Project with Dataset from Honor of Kings, one of the most popular smart phone games in China☆66Updated 8 months ago
- A dataset template for guiding chat-models to self-cognition, including information about the model’s identity, capabilities, usage, limi…☆27Updated last year
- ☢️ TensorRT 2023复赛——基于TensorRT-LLM的Llama模型推断加速优化☆47Updated last year
- Explore LLM model deployment based on AXera's AI chips☆103Updated this week
- qwen2 and llama3 cpp implementation☆44Updated 11 months ago
- 训练一个对中文支持更好的LLaVA模型,并开源训练代码和数据。☆56Updated 8 months ago
- ☆40Updated 2 months ago
- A knowledge base backend system for LLMs with full-text search, semantic retrieval, and knowledge graph querying. Ready-to-use modules fo…☆27Updated last month
- 大模型部署实战:TensorRT-LLM, Triton Inference Server, vLLM☆26Updated last year
- 部署你自己的OpenAI api🤩, 基于flask, transformers (使用 Baichuan2-13B-Chat-4bits 模型, 可以运行在单张Tesla T4显卡) ,实现了OpenAI中Chat, Models和Completions接口,包含流式响…☆93Updated last year
- 本项目致力于为大模型领域的初学者提供全面的知识体系,包括基础和高阶内容,以便开发者能迅速掌握大模型技术栈并全面了解相关知识。☆58Updated 4 months ago
- ggml学习笔记,ggml是一个机器学习的推理框架☆15Updated last year
- GLM Series Edge Models