CadenCao / vllm-qwen1.5-StreamChatLinks
用VLLM框架部署千问1.5并进行流式输出
☆89Updated last year
Alternatives and similar repositories for vllm-qwen1.5-StreamChat
Users that are interested in vllm-qwen1.5-StreamChat are comparing it to the libraries listed below
Sorting:
- Emotion text classification using Llama3-8b with LoRA and FlashAttention. Based on LLaMA-Factory.☆67Updated last year
- 本项目展示了如何利用 GPT 自动化检索仓库内的文件(如 PDF、XLS、Word 等)并完成多模态任务。可将家庭摄像头的视频帧送入仓库,可以自动化判断家庭是否危险的事情(利用大模型对世界的理解力)。☆91Updated 6 months ago
- 本项目旨在结合以往研究人员的代表性工作,从多个维度评估sft数据,并自动化过滤sft数据。☆44Updated last year
- [NeurIPS 2025] DanmakuTPPBench: A Multi-modal Benchmark for Temporal Point Process Modeling and Understanding☆71Updated this week
- This project uses yolov8 combined with bytetrack to achieve multi-target tracking☆65Updated 11 months ago
- 打造首个开源版的KimiChat!☆107Updated 4 months ago
- [ACL2024 Findings] Knowledge-to-SQL: Enhancing SQL Generation with Data Expert LLM☆57Updated 3 weeks ago
- 中/英文 拼音/字符 模糊匹配库☆37Updated 2 months ago
- ☆101Updated last year
- ☆164Updated last week
- 实体关系联合抽取☆187Updated last year
- 一个基于多个大语言模型的智能学术范文写作系统,能够根据输入的开题报告或研究设计文档,自动生成包含引用的学术范文的各章节内容。☆220Updated 2 months ago
- (NeurIPS‘24) LLM4EA: <Entity Alignment with Noisy Annotations from Large Language Models>☆56Updated last month
- Lazada Tiktok Shopee SHEIN Shop Open Platform Api (Easy Cross Border)☆93Updated last week
- ☆143Updated last year
- 记录我的编程学习与成长之旅。涵盖技术笔记、项目实践、心得体会与日常思考☆85Updated this week
- A machine learning-driven solution designed to detect fraudulent activities in bank payment systems☆130Updated 9 months ago
- [CVPR 2025] MDP: Multidimensional Vision Model Pruning with Latency Constraint☆107Updated last week
- Zero Graph – Minimalist LLM framework designed for AI Agent programming☆104Updated 2 months ago
- 2025技术分享(FullStack Frontend Focus),分享常用知识点。代码纯手打+AI验证,只做精品!!!☆154Updated 2 months ago
- Harnessing the Power of AI to Navigate the Information Age – Uncovering Truth, Promoting Transparency, and Championing Fact-Based Discour…☆148Updated 2 years ago
- A high-performance Swift wrapper for MaxMind's GeoIP2 databases, offering thread-safe IP geolocation lookups with optimized memory manage…☆101Updated 4 months ago
- 本项目使用YOLOv4模型,并在对数字信号灯进行数字识别时采用opencv算法。☆99Updated 2 years ago
- 极简高效、易于集成、灵活扩展、上下文管理强大、适合新手的 LLM 智能体开发框架☆99Updated 2 months ago
- Inscriptions on CoreDao, powered by Insdexer.☆149Updated last year
- GAL-DAWN: An Novel High performance computing Library of Graph Algorithms based on DAWN, CUDA/C++☆86Updated 5 months ago
- ☆143Updated last year
- 实时语音翻译工具☆64Updated 4 months ago
- Python API for triggering TeamCity by REST API☆59Updated 5 months ago
- A toolkit that helps you automatically deletes old Docker images from an AWS ECR repository, keeping only the latest N images.☆52Updated 7 months ago