Opara is a lightweight and resource-aware DNN Operator parallel scheduling framework to accelerate the execution of DNN inference on GPUs.
☆23Dec 19, 2024Updated last year
Alternatives and similar repositories for Opara
Users that are interested in Opara are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ebrowser, an energy-efficient and lightweight human interaction framework without degrading the user experience in mobile Web browsers.☆12Sep 7, 2023Updated 2 years ago
- iSpot is a lightweight and cost-effective instance provisioning framework for Directed Acyclic Graph (DAG)-style big data analytics, in …☆11Sep 7, 2023Updated 2 years ago
- ☆12Sep 20, 2023Updated 2 years ago
- DelayStage is a simple yet effective stage delay scheduling strategy to interleave the cluster resources across the parallel stages, so a…☆14Sep 7, 2023Updated 2 years ago
- spotDNN is a heterogeneity-aware spot instance provisioning framework to provide predictable performance for DDNN training workloads in t…☆15Sep 7, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Prophet is a predictable communication scheduling strategy to schedule the gradient transfer in an adequate order, with the aim of maximi…☆16Sep 13, 2023Updated 2 years ago
- Reading paper list for iCloud group☆14May 3, 2026Updated last month
- Tetris, a model predictive control (MPC)-based container scheduling strategy to judiciously make migration decisions for long-running con…☆26Dec 30, 2024Updated last year
- iGniter, an interference-aware GPU resource provisioning framework for achieving predictable performance of DNN inference in the cloud.☆39Jun 11, 2024Updated 2 years ago
- This is the proof-of-concept CPU implementation of ASPEN used for the NeurIPS'23 paper ASPEN: Breaking Operator Barriers for Efficient Pa…☆13Apr 4, 2024Updated 2 years ago
- ☆13Feb 17, 2025Updated last year
- Created a simple neural network using C++17 standard and the Eigen library that supports both forward and backward propagation.☆11Jul 27, 2024Updated last year
- ☆32Feb 8, 2026Updated 4 months ago
- Optimize tensor program fast with Felix, a gradient descent autotuner.☆33Mar 5, 2026Updated 3 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Tutorials to GPU programming. Reading notes.☆19Apr 27, 2023Updated 3 years ago
- Baselines for Model-Based Optimization installation fixes and compatible with newer AMPERE+ GPUs (e.g. 3090)☆11Apr 30, 2023Updated 3 years ago
- Hands-on experience programming AI Engines using Vitis Unified Software Platform☆41Jul 24, 2024Updated last year
- ☆17Dec 8, 2023Updated 2 years ago
- ☆10Oct 5, 2023Updated 2 years ago
- ☆14May 30, 2023Updated 3 years ago
- Benchmark for Biophysical Sequence Optimization Algorithms☆22Apr 15, 2026Updated 2 months ago
- qulacs-visualizer is a quantum circuit drawing library for qulacs.☆11Aug 26, 2025Updated 9 months ago
- Official implementation of NeurIPS'23 paper "Sample-efficient Multi-objective Molecular Optimization with GFlowNets"☆20Dec 24, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Code for "Multi-Objective GFlowNets"☆20Jul 12, 2023Updated 2 years ago
- ☆26Oct 1, 2025Updated 8 months ago
- a Log-Structured Merged-Tree store engine☆16Sep 21, 2023Updated 2 years ago
- A Project dedicated to making GPU Partitioning on Windows easier!☆15Jan 10, 2022Updated 4 years ago
- ☆19Jun 3, 2021Updated 5 years ago
- ChatIoT: Zero-code Generation of Trigger-action Based IoT Programs☆19Feb 16, 2025Updated last year
- ☆49Jul 13, 2024Updated last year
- libsmctrl论文的复现,添加了python端接口,可以在python端灵活调用接口来分配计算资源☆12May 21, 2024Updated 2 years ago
- A source-to-source compiler for optimizing CUDA dynamic parallelism by aggregating launches☆15Jun 21, 2019Updated 6 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆19Mar 4, 2025Updated last year
- Official PyTorch implementation of "LayerMerge: Neural Network Depth Compression through Layer Pruning and Merging" (ICML 2024)☆31Apr 13, 2026Updated 2 months ago
- ☆13Jun 29, 2024Updated last year
- ☆20Feb 5, 2022Updated 4 years ago
- TileGraph is an experimental DNN compiler that utilizes static code generation and kernel fusion techniques.☆11Sep 18, 2024Updated last year
- Official implementation for the paper Lancet: Accelerating Mixture-of-Experts Training via Whole Graph Computation-Communication Overlapp…☆14May 20, 2026Updated 3 weeks ago
- Eye-MMS: Miniature multi-scale segmentation network of key eye-regions in embedded applications☆12Jul 4, 2022Updated 3 years ago