A high-throughput and memory-efficient inference and serving engine for LLMs
☆77Oct 28, 2024Updated last year
Alternatives and similar repositories for vllm_musa
Users that are interested in vllm_musa are comparing it to the libraries listed below
Sorting:
- MUSA Templates for Linear Algebra Subroutines☆42Jan 30, 2026Updated last month
- Solutions to AoC 2022 in zig☆11May 6, 2023Updated 2 years ago
- a static analytical model for LLM distributed training☆119Jan 8, 2026Updated last month
- ☆62Feb 4, 2026Updated last month
- A Winograd Minimal Filter Implementation in CUDA☆28Aug 25, 2021Updated 4 years ago
- ☆24Mar 15, 2023Updated 2 years ago
- An IR for efficiently simulating distributed ML computation.☆32Jan 13, 2024Updated 2 years ago
- Several optimization methods of half-precision general matrix vector multiplication (HGEMV) using CUDA core.☆73Sep 8, 2024Updated last year
- ☆32Aug 24, 2022Updated 3 years ago
- ☆155Mar 4, 2025Updated last year
- DeepSeek-V3/R1 inference performance simulator☆179Mar 27, 2025Updated 11 months ago
- An object detection model for NMNIST larger video frame☆12Feb 24, 2022Updated 4 years ago
- Decentralized, transparent, verifiable and anonymous voting app☆12Feb 20, 2023Updated 3 years ago
- Directed masked autoencoders☆14Feb 20, 2026Updated 2 weeks ago
- [Qt5开发及实例(第3版)][陆文周][程序源代码]☆10May 23, 2018Updated 7 years ago
- ☆12Mar 31, 2021Updated 4 years ago
- 2024维护(复刻)版本的yolov5+deepsort目标检测和追踪,能显示目标类别,能训练自己数据集.包含了一部分测试视频供常识,提供了txt和json两种格式的识别输出方式.可用于识别项目,路面识别,智能交通,毕设等各种.☆10Feb 28, 2024Updated 2 years ago
- Code for the paper "Match, Compare, or Select? An Investigation of Large Language Models for Entity Matching" (COLING 2025)☆19Jan 3, 2026Updated 2 months ago
- ☆20May 24, 2025Updated 9 months ago
- A well-tested TOML parsing library for Zig☆43Sep 28, 2024Updated last year
- ☆11Aug 23, 2015Updated 10 years ago
- Work in progress rust bindings to ggml☆12May 1, 2023Updated 2 years ago
- YoloV6 for a bare Raspberry Pi using ncnn.☆11Jun 12, 2024Updated last year
- YOLOv5s inference In C# and Training In Python☆10May 30, 2022Updated 3 years ago
- YOLOv12 TensorRT 端到端模型加速推理和INT8量化实现☆13Mar 5, 2025Updated last year
- A tracking scheme developed by integrating six tracking methods, DeepSORT StrongSORT OSNet HybridSORT, OCSORT, and ByteTrack, using yolov…☆12Feb 22, 2024Updated 2 years ago
- ☆12Mar 13, 2023Updated 2 years ago
- a simple API to use CUPTI☆11Aug 19, 2025Updated 6 months ago
- Github老玩家自己搭的服务器,老飞飞原版,可联机-天马座☆11May 14, 2019Updated 6 years ago
- Command-Line Argument Parser for C++20☆23Jan 1, 2026Updated 2 months ago
- ☆13Jan 7, 2025Updated last year
- ☆12Mar 24, 2021Updated 4 years ago
- linear algebra package. like gonum/mat, but small. lets say gonum-lite☆12Jul 8, 2023Updated 2 years ago
- YOLOv1 implementation using PyTorch☆11Jan 18, 2023Updated 3 years ago
- A low-resource native app for sharing space with co-workers and friends.☆15Feb 20, 2025Updated last year
- A FantasyConsole compiled as WebAssembly and written in Zig☆14Feb 13, 2023Updated 3 years ago
- A simple tool for parsing the profile.json file of mxnet☆14Aug 1, 2018Updated 7 years ago
- A unified programming framework for high and portable performance across FPGAs and GPUs☆11Mar 23, 2025Updated 11 months ago
- Lightweight behavior tree implementation in Rust☆11Jan 4, 2026Updated 2 months ago