Opara is a lightweight and resource-aware DNN Operator parallel scheduling framework to accelerate the execution of DNN inference on GPUs.
☆23Dec 19, 2024Updated last year
Alternatives and similar repositories for Opara
Users that are interested in Opara are comparing it to the libraries listed below
Sorting:
- Prophet is a predictable communication scheduling strategy to schedule the gradient transfer in an adequate order, with the aim of maximi…☆16Sep 13, 2023Updated 2 years ago
- λDNN is a cost-efficient function resource provisioning framework to minimize the monetary cost and guarantee the performance for DDNN tr…☆23Oct 25, 2023Updated 2 years ago
- Tetris, a model predictive control (MPC)-based container scheduling strategy to judiciously make migration decisions for long-running con…☆25Dec 30, 2024Updated last year
- This is the proof-of-concept CPU implementation of ASPEN used for the NeurIPS'23 paper ASPEN: Breaking Operator Barriers for Efficient Pa…☆13Apr 4, 2024Updated last year
- Optimize tensor program fast with Felix, a gradient descent autotuner.☆33Updated this week
- Horizontal Fusion☆24Jan 7, 2022Updated 4 years ago
- ☆38Mar 14, 2024Updated last year
- Continuous Pipelined Speculative Decoding☆16Jan 4, 2026Updated 2 months ago
- ☆48Jul 13, 2024Updated last year
- Hands-on experience programming AI Engines using Vitis Unified Software Platform☆40Jul 24, 2024Updated last year
- Quantum Binary Neural Networks☆15Oct 20, 2019Updated 6 years ago
- Rust CLI tool for syncing Claude Code conversation history across machines using git repositories.☆22Feb 27, 2026Updated last week
- ☆10Aug 22, 2023Updated 2 years ago
- This is the official code for CoRL 2022 "Robustness Certification of Visual Perception Models via Camera Motion Smoothing"☆11Apr 5, 2023Updated 2 years ago
- Residual vector quantization for KV cache compression in large language model☆11Oct 22, 2024Updated last year
- ☆12Jan 18, 2026Updated last month
- libsmctrl论文的复现,添加了python端接口,可以在python端灵活调用接口来分配计算资源☆12May 21, 2024Updated last year
- survey and analysis of kv-stores in academia and industry☆10Aug 31, 2019Updated 6 years ago
- A tensorflow implementation of YOLOv4. CSPDarknet53 PAN SPP CIoU Mish,☆13Sep 11, 2020Updated 5 years ago
- Baselines for Model-Based Optimization installation fixes and compatible with newer AMPERE+ GPUs (e.g. 3090)☆11Apr 30, 2023Updated 2 years ago
- Evaluate Transformers from the Hub 🔥☆14Nov 27, 2023Updated 2 years ago
- ☆12Jun 29, 2024Updated last year
- ☆12Jul 18, 2024Updated last year
- Eye-MMS: Miniature multi-scale segmentation network of key eye-regions in embedded applications☆12Jul 4, 2022Updated 3 years ago
- A dataset of egocentric vision, eye-tracking and full body kinematics from human locomotion in out-of-the-lab environments. Also, differe…☆12Nov 5, 2023Updated 2 years ago
- ☆16Updated this week
- Lightweight views for your models☆16Dec 12, 2023Updated 2 years ago
- Deft: A Scalable Tree Index for Disaggregated Memory☆23Apr 23, 2025Updated 10 months ago
- ☆12Sep 16, 2025Updated 5 months ago
- Python implementation of a Genetic Algorithm for the Resource-Constrained Project Scheduling Problem☆14May 29, 2023Updated 2 years ago
- SFS: A Smart OS Scheduler for Serverless Function Workloads (SC'22)☆13Dec 15, 2022Updated 3 years ago
- ☆14May 23, 2023Updated 2 years ago
- A Data Science pipeline for Algorithmic Trading: A comparative study in applications to Finance and cryptoeconomics☆14Jul 1, 2022Updated 3 years ago
- ☆18Mar 4, 2025Updated last year
- ☆13Mar 12, 2025Updated 11 months ago
- Official code of MoSA (Mixture of Sparse Adapters).☆13Dec 14, 2023Updated 2 years ago
- ☆12Mar 18, 2024Updated last year
- TileGraph is an experimental DNN compiler that utilizes static code generation and kernel fusion techniques.☆12Sep 18, 2024Updated last year
- ☆16Sep 27, 2023Updated 2 years ago