Memory footprint reduction for transformer models
☆11Jan 24, 2023Updated 3 years ago
Alternatives and similar repositories for Tempo
Users that are interested in Tempo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆12May 3, 2020Updated 6 years ago
- ⚡ Bring some magic to i.sjtu.edu.cn☆22Jan 3, 2020Updated 6 years ago
- A custom AI chip to be taped out soon!☆46Dec 20, 2025Updated 4 months ago
- ToolPlanner: A Tool Augmented LLM for Multi Granularity Instructions with Path Planning and Feedback☆19Dec 3, 2024Updated last year
- ☆24Feb 1, 2025Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- GPU-accelerated AES encryption project☆11Feb 13, 2015Updated 11 years ago
- 面向多平台编译优化的深度学习中间表示☆10Oct 28, 2024Updated last year
- Implementation of algorithms for memory optimized deep neural network training☆10Jul 23, 2020Updated 5 years ago
- Python C++ Code Manager☆15Sep 29, 2024Updated last year
- Auto-differentiation library for C++☆12Jan 16, 2022Updated 4 years ago
- Making an Ace Combat-Style flight shooting game (only one mission) with Unity.☆12Jan 31, 2024Updated 2 years ago
- Scalable radix top-k selection on GPUs.☆23Jan 27, 2025Updated last year
- [ASP-DAC 2025] "NeuronQuant: Accurate and Efficient Post-Training Quantization for Spiking Neural Networks" Official Implementation☆19Mar 6, 2025Updated last year
- [ACL 2026 🔥] CASS: Nvidia to AMD Transpilation with Data, Models, and Benchmark☆34Apr 20, 2026Updated 2 weeks ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Distribution System Simulator based on OpenDSS and OpenDSSDirect.py. Modern Syntax, DataFrames, Pint, Networkx, Algorithmic Agents.☆14Jan 21, 2022Updated 4 years ago
- AutodiffEngine☆13Apr 1, 2019Updated 7 years ago
- Implementation of vDNN++; an improvement over vDNN☆18Dec 7, 2018Updated 7 years ago
- simple solution based on Gradient Boost and Random Forest, rank 24/3251 (top 1%) within 60 lines of python code☆14Jun 21, 2019Updated 6 years ago
- ☆16Jul 29, 2025Updated 9 months ago
- antkillerfarm's crazy magic☆17Oct 3, 2024Updated last year
- ☆30Aug 4, 2025Updated 9 months ago
- Data pre-processing and training code on Open-X-Embodiment with pytorch☆11Jan 20, 2025Updated last year
- Thinking is hard - automate it☆18Aug 24, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Lecture notes at SJTU☆40Feb 12, 2021Updated 5 years ago
- [EuroSys'25] Mist: Efficient Distributed Training of Large Language Models via Memory-Parallelism Co-Optimization☆23Apr 13, 2026Updated 3 weeks ago
- 方便扩展的Cuda算子理解和优化框架,仅用在学习使用☆18Jun 13, 2024Updated last year
- The official implementation of HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization☆19Mar 7, 2025Updated last year
- TiledKernel is a code generation library based on macro kernels and memory hierarchy graph data structure.☆19May 12, 2024Updated last year
- ByteCheckpoint: An Unified Checkpointing Library for LFMs☆278Feb 2, 2026Updated 3 months ago
- [NeurIPS 2024 D&B Track] DACO: Towards Application-Driven and Comprehensive Data Analysis via Code Generation☆12Mar 5, 2025Updated last year
- DL Dataloader Benchmarks☆20Jan 27, 2025Updated last year
- DietCode Code Release☆65Jul 21, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- The only known (by 2022) open-source, easy-to-understand basic algorithm implementations in TD-CEM. (Please star and fork this project if…☆15Mar 1, 2022Updated 4 years ago
- A single header-only C++ library for automatic / algorithmic differentiation.☆16Nov 29, 2022Updated 3 years ago
- ☆11May 12, 2017Updated 8 years ago
- 🔮 Execution time predictions for deep neural network training iterations across different GPUs.☆14Dec 16, 2024Updated last year
- Differentiable Combinatorial Scheduling at Scale (ICML'24). Mingju Liu, Yingjie Li, Jiaqi Yin, Zhiru Zhang, Cunxi Yu.☆22Oct 31, 2024Updated last year
- A PyTorch implementation of SEED, originally created by Google Research for TensorFlow 2.☆14Dec 8, 2020Updated 5 years ago
- Code for COLING 2022 paper "FactMix: Using a Few Labeled In-domain Examples to Generalize to Cross-domain Named Entity Recognition"☆15Jan 15, 2023Updated 3 years ago