☆33Jun 6, 2023Updated 2 years ago
Alternatives and similar repositories for hotline
Users that are interested in hotline are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 🔮 Execution time predictions for deep neural network training iterations across different GPUs.☆14Dec 16, 2024Updated last year
- ☆13Mar 6, 2023Updated 3 years ago
- benchmark models for TNN, ncnn, MNN☆20Jun 10, 2020Updated 5 years ago
- Utilities for paper writing.☆12Jan 11, 2026Updated 2 months ago
- You Only Search Once: On Lightweight Differentiable Architecture Search for Resource-Constrained Embedded Platforms☆12Apr 17, 2023Updated 2 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Compiler for Dynamic Neural Networks☆45Nov 13, 2023Updated 2 years ago
- ☆25Mar 15, 2023Updated 3 years ago
- RL-Scope: Cross-Stack Profiling for Deep Reinforcement Learning Workloads☆47Apr 7, 2021Updated 4 years ago
- 🏙 Interactive performance profiling and debugging tool for PyTorch neural networks.☆64Jan 21, 2025Updated last year
- Supplemental materials for The ASPLOS 2025 / EuroSys 2025 Contest on Intra-Operator Parallelism for Distributed Deep Learning☆25May 12, 2025Updated 10 months ago
- a data collection of related work: Toward Understanding Deep Learning Framework Bugs☆17Oct 23, 2023Updated 2 years ago
- ☆23Apr 25, 2023Updated 2 years ago
- Dynamic Tensor Rematerialization prototype (modified PyTorch) and simulator. Paper: https://arxiv.org/abs/2006.09616☆133Jul 6, 2023Updated 2 years ago
- Renee: End-to-end training of extreme classification models☆23Sep 29, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆36Mar 12, 2026Updated 2 weeks ago
- AnyDSL traversal code☆15Feb 18, 2019Updated 7 years ago
- A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster☆160Apr 20, 2024Updated last year
- An experimental ahead of time compiler for Relay.☆49Apr 21, 2020Updated 5 years ago
- Distributed machine learning platform☆13Aug 20, 2015Updated 10 years ago
- Thunder Research Group's Collective Communication Library☆50Jul 8, 2025Updated 8 months ago
- [MLSys 2023] Pre-train and Search: Efficient Embedding Table Sharding with Pre-trained Neural Cost Models☆16May 5, 2023Updated 2 years ago
- ☆13Nov 15, 2022Updated 3 years ago
- [EuroSys'25] Mist: Efficient Distributed Training of Large Language Models via Memory-Parallelism Co-Optimization☆22Feb 5, 2026Updated last month
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- 方便扩展的Cuda算子理解和优化框架,仅用在学习使用☆18Jun 13, 2024Updated last year
- Complete solution to enable RDMA (on both InfiniBand and RoCE) and accelerate TCP to bare metal performance on Kubernetes☆11Aug 1, 2018Updated 7 years ago
- A light-weight neural network optimizer for different software/hardware backends.☆20Nov 23, 2020Updated 5 years ago
- ☆14Mar 8, 2025Updated last year
- A PyTorch implementation of "Self-Supervised GNN that Jointly Learns to Augment" or "Jointly Learnable Data Augmentations for Self-Superv…☆13Dec 13, 2021Updated 4 years ago
- ☆18Oct 15, 2020Updated 5 years ago
- Packed Memory Array☆17May 14, 2014Updated 11 years ago
- This is the open-source version of TinyTS. The code is dirty so far. We may clean the code in the future.☆20Aug 11, 2025Updated 7 months ago
- An n-ary relational dataset derived from Wikidata☆15Mar 23, 2021Updated 5 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Differentiable Combinatorial Scheduling at Scale (ICML'24). Mingju Liu, Yingjie Li, Jiaqi Yin, Zhiru Zhang, Cunxi Yu.☆22Oct 31, 2024Updated last year
- PyTorch implementation of paper "Response Length Perception and Sequence Scheduling: An LLM-Empowered LLM Inference Pipeline".☆92May 23, 2023Updated 2 years ago
- ☆20Dec 27, 2021Updated 4 years ago
- This is the project repository of our ESEC/FSE 2021 paper: A Comprehensive Study of Deep Learning Compiler Bugs.☆23Aug 15, 2023Updated 2 years ago
- An implementation of the FP-Growth algorithm in pure Rust.☆16Apr 20, 2021Updated 4 years ago
- A GPU-driven system framework for scalable AI applications☆125Feb 5, 2025Updated last year
- A unified programming framework for high and portable performance across FPGAs and GPUs☆11Mar 23, 2025Updated last year