LaLaRAND: Flexible Layer-by-Layer CPU/GPU Scheduling for Real-Time DNN Tasks
☆18Mar 25, 2022Updated 4 years ago
Alternatives and similar repositories for LaLaRAND
Users that are interested in LaLaRAND are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆27Aug 31, 2023Updated 2 years ago
- Labs of 2019 Web Information Processing and Application in USTC.☆11Jan 15, 2020Updated 6 years ago
- ☆13Jan 28, 2026Updated 3 months ago
- CUDA C simple application for Nvidia's GPU☆11Jun 7, 2022Updated 3 years ago
- Pie: Programmable LLM Serving☆152Updated this week
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Multi-branch model for concurrent execution☆18Jun 27, 2023Updated 2 years ago
- Luthier, a GPU binary instrumentation tool for AMD GPUs☆28Updated this week
- HTML/JS port of CUDA Occupancy Calculator☆17Nov 23, 2021Updated 4 years ago
- The artifact for NDSS '25 paper "ASGARD: Protecting On-Device Deep Neural Networks with Virtualization-Based Trusted Execution Environmen…☆15Oct 16, 2025Updated 6 months ago
- ☆78May 28, 2023Updated 2 years ago
- ☆10Dec 26, 2023Updated 2 years ago
- ☆24Sep 11, 2025Updated 7 months ago
- ☆17May 14, 2020Updated 5 years ago
- A deep learning solvation model☆13Aug 24, 2021Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- BadgerTrap is a tool to instrument x86-64 TLB misses.☆13Nov 13, 2016Updated 9 years ago
- Code for the WWW'23 paper "Sanitizing Sentence Embeddings (and Labels) for Local Differential Privacy"☆12Feb 20, 2023Updated 3 years ago
- ☆19Feb 28, 2022Updated 4 years ago
- (elastic) cuckoo hashing☆17Jun 20, 2020Updated 5 years ago
- Tacker: Tensor-CUDA Core Kernel Fusion for Improving the GPU Utilization while Ensuring QoS☆33Feb 10, 2025Updated last year
- ☆24Jun 24, 2020Updated 5 years ago
- It contains Data Augmentaion, Strided convolution, Batch Normalization, Leaky Relu, Global Average pooling, L2 Regularization, learning …☆12Jun 3, 2018Updated 7 years ago
- ☆14Oct 30, 2024Updated last year
- RPCNIC: A High-Performance and Reconfigurable PCIe-attached RPC Accelerator [HPCA2025]☆15Dec 9, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- icml24☆14Feb 24, 2025Updated last year
- Official resporitory for "IPDPS' 24 QSync: Quantization-Minimized Synchronous Distributed Training Across Hybrid Devices".☆20Feb 23, 2024Updated 2 years ago
- 健康学习到150岁 - 人体系统调优不完全指南☆14May 30, 2022Updated 3 years ago
- [PACT'24] GraNNDis. A fast and unified distributed graph neural network (GNN) training framework for both full-batch (full-graph) and min…☆10Aug 13, 2024Updated last year
- Demo code for CVPR2023 paper "Sparsifiner: Learning Sparse Instance-Dependent Attention for Efficient Vision Transformers"☆15Jul 4, 2023Updated 2 years ago
- ☆19Oct 24, 2024Updated last year
- Fork of gem5 with support for manycore architectures. Includes models and scripts to evaluate a software-defined-vector architecture.☆13Oct 14, 2021Updated 4 years ago
- YOLOv8 C++ DET、SEG、POSE TENSORRT 推理库,便于学习开发拓展与工作中实际部署☆18Aug 22, 2023Updated 2 years ago
- Repository for OpenCL codes.☆12Jul 30, 2015Updated 10 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [ACM EuroSys 2023] Fast and Efficient Model Serving Using Multi-GPUs with Direct-Host-Access☆56Aug 6, 2025Updated 9 months ago
- Jetson embedded platform-target deep learning inference acceleration framework with TensorRT☆30Oct 10, 2025Updated 6 months ago
- 中 国科学技术大学计算机学院课程资源(https://mbinary.xyz/ustc-cs/)☆19Feb 27, 2019Updated 7 years ago
- Hi-Speed DNN Training with Espresso: Unleashing the Full Potential of Gradient Compression with Near-Optimal Usage Strategies (EuroSys '2…☆15Sep 21, 2023Updated 2 years ago
- Official implementation repository for the paper Towards General Conceptual Model Editing via Adversarial Representation Engineering.☆20Dec 6, 2024Updated last year
- ☆14Feb 5, 2025Updated last year
- Artifact for "DX100: A Programmable Data Access Accelerator for Indirection (ISCA 2025)" paper☆17Nov 6, 2025Updated 6 months ago