Opara is a lightweight and resource-aware DNN Operator parallel scheduling framework to accelerate the execution of DNN inference on GPUs.
☆23Dec 19, 2024Updated last year
Alternatives and similar repositories for Opara
Users that are interested in Opara are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ebrowser, an energy-efficient and lightweight human interaction framework without degrading the user experience in mobile Web browsers.☆12Sep 7, 2023Updated 2 years ago
- iSpot is a lightweight and cost-effective instance provisioning framework for Directed Acyclic Graph (DAG)-style big data analytics, in …☆11Sep 7, 2023Updated 2 years ago
- ☆12Sep 20, 2023Updated 2 years ago
- DelayStage is a simple yet effective stage delay scheduling strategy to interleave the cluster resources across the parallel stages, so a…☆14Sep 7, 2023Updated 2 years ago
- spotDNN is a heterogeneity-aware spot instance provisioning framework to provide predictable performance for DDNN training workloads in t…☆15Sep 7, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Prophet is a predictable communication scheduling strategy to schedule the gradient transfer in an adequate order, with the aim of maximi…☆16Sep 13, 2023Updated 2 years ago
- λDNN is a cost-efficient function resource provisioning framework to minimize the monetary cost and guarantee the performance for DDNN tr…☆23Oct 25, 2023Updated 2 years ago
- Tetris, a model predictive control (MPC)-based container scheduling strategy to judiciously make migration decisions for long-running con…☆26Dec 30, 2024Updated last year
- This is the proof-of-concept CPU implementation of ASPEN used for the NeurIPS'23 paper ASPEN: Breaking Operator Barriers for Efficient Pa…☆13Apr 4, 2024Updated 2 years ago
- ☆13Feb 17, 2025Updated last year
- Created a simple neural network using C++17 standard and the Eigen library that supports both forward and backward propagation.☆11Jul 27, 2024Updated last year
- ☆31Feb 8, 2026Updated 3 months ago
- ☆10Aug 22, 2023Updated 2 years ago
- Optimize tensor program fast with Felix, a gradient descent autotuner.☆33Mar 5, 2026Updated 2 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Tutorials to GPU programming. Reading notes.☆19Apr 27, 2023Updated 3 years ago
- Baselines for Model-Based Optimization installation fixes and compatible with newer AMPERE+ GPUs (e.g. 3090)☆11Apr 30, 2023Updated 3 years ago
- Quantum Binary Neural Networks☆16Oct 20, 2019Updated 6 years ago
- [NeurIPS 2025 Spotlight] Implementation of "KLASS: KL-Guided Fast Inference in Masked Diffusion Models"☆31Jan 3, 2026Updated 4 months ago
- Hands-on experience programming AI Engines using Vitis Unified Software Platform☆40Jul 24, 2024Updated last year
- ☆17Dec 8, 2023Updated 2 years ago
- ☆14May 30, 2023Updated 2 years ago
- Benchmark for Biophysical Sequence Optimization Algorithms☆22Apr 15, 2026Updated 3 weeks ago
- qulacs-visualizer is a quantum circuit drawing library for qulacs.☆11Aug 26, 2025Updated 8 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- About Official PyTorch implementation of "Query-Efficient Black-Box Red Teaming via Bayesian Optimization" (ACL'23)☆15Jul 9, 2023Updated 2 years ago
- The Atlas multi-GPU quantum circuit simulator.☆15Aug 17, 2024Updated last year
- Horizontal Fusion☆24Jan 7, 2022Updated 4 years ago
- Code for "Multi-Objective GFlowNets"☆19Jul 12, 2023Updated 2 years ago
- ☆18Jun 3, 2021Updated 4 years ago
- ChatIoT: Zero-code Generation of Trigger-action Based IoT Programs☆19Feb 16, 2025Updated last year
- ☆49Jul 13, 2024Updated last year
- libsmctrl论文的复现,添加了python端接口,可以在python端灵活调用接口来分配计算资源☆12May 21, 2024Updated last year
- A Data Science pipeline for Algorithmic Trading: A comparative study in applications to Finance and cryptoeconomics☆14Jul 1, 2022Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆39Mar 14, 2024Updated 2 years ago
- A source-to-source compiler for optimizing CUDA dynamic parallelism by aggregating launches☆15Jun 21, 2019Updated 6 years ago
- SV-Sim: Scalable PGAS-based State Vector Simulation of Quantum Circuits☆22Feb 2, 2024Updated 2 years ago
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆19Apr 24, 2026Updated 2 weeks ago
- ☆19Mar 4, 2025Updated last year
- Official PyTorch implementation of "LayerMerge: Neural Network Depth Compression through Layer Pruning and Merging" (ICML 2024)☆31Apr 13, 2026Updated 3 weeks ago
- ☆13Jun 29, 2024Updated last year