wqzustc / High-Performance-Tensor-Processing-EnginesLinks
Some Hardware Architectures for GEMM
☆288Updated 8 months ago
Alternatives and similar repositories for High-Performance-Tensor-Processing-Engines
Users that are interested in High-Performance-Tensor-Processing-Engines are comparing it to the libraries listed below
Sorting:
- ☆140Updated 6 months ago
- Official implementation of "REASONING COMPILER: LLM-Guided Optimizations for Efficient Model Serving" (NeurIPS 2025)☆99Updated 2 months ago
- ☆24Updated last year
- LLM Serving simulation for multi-core NPU☆113Updated last month
- YiRage (Yield Revolutionary AGile Engine) - Multi-Backend LLM Inference Optimization. Extends Mirage with comprehensive support for CUDA,…☆36Updated last week
- Host shell scripts: configure FPGA's DMA-SG via PCIe XDMA.☆26Updated 7 months ago
- Vitis HLS 2022.2 projects source code: C design, C simulation, RTL simulation.【vitis_hls工程】☆23Updated 7 months ago
- Step-by-step optimization of TPU MatMul Kernels☆85Updated 6 months ago
- [NeurIPS'25] KVCOMM: Online Cross-context KV-cache Communication for Efficient LLM-based Multi-agent Systems☆127Updated 3 months ago
- CXLMemSim: A pure software simulated CXL.mem for performance characterization☆599Updated this week
- CXL remote offloading data movement aware compiler☆72Updated last month
- Learning from Teaching Regularization: Generalizable Correlations Should be Easy to Imitate (NeurIPS 2024)☆33Updated last year
- A toolkit enhances PyTorch with specialized functions for low-bit quantized neural networks.☆196Updated last year
- ☆260Updated this week
- 没分支的 rCore-Tutorial☆30Updated last year
- [NeurIPS 2025] Accelerating Parallel Diffusion Model Serving with Residual Compression☆40Updated 3 months ago
- GigaDatasets: A Unified and Lightweight Framework for Data Processing, Curation, and Visualization☆574Updated last month
- [MM 2024] Official code for VeCAF: Vision-language Collaborative Active Finetuning with Training Objective Awareness☆52Updated last year
- ☆48Updated 6 months ago
- NeuroMinecraftGenesis (NMG): A revolutionary AI system that draws on the DiscoRL self-evolving algorithm, six-dimensional cognitive engin…☆88Updated last month
- ☆82Updated 7 months ago
- ☆172Updated 2 weeks ago
- This is a deep learning project applied to signal integrity and RF analysis. Automated modeling, simulation, and data storage of HFSS for…☆73Updated last month
- A manifesto and playbook for AI-native software engineering in the LLM era / AI-Native的软件工程宣言☆291Updated 2 months ago
- Virtual to Real, Synthetic Data, Vehicle Re-identification☆104Updated last year
- High Performance Distributed Database with MySQL Compatible API, Great Scalability, Full ACID Distributed Transactions, and Tiered S3 Sto…☆475Updated this week
- LIMU-BERT-X is a sensor foundation model pretrained with 1.43 millions hours of IMU data☆57Updated last month
- Your personal AI research assistant that serves you a daily dose of arXiv papers, tailored to your interests from Zotero or the pdfs from…☆141Updated 6 months ago
- The Python implementation of some deep text hashing (also called deep semantic hashing) Models☆80Updated 2 months ago
- [Neurips 2025] R-KV: Redundancy-aware KV Cache Compression for Reasoning Models☆1,174Updated 3 months ago