Jittor / cuttLinks
CUDA Tensor Transpose (cuTT) library
☆10Updated 4 years ago
Alternatives and similar repositories for cutt
Users that are interested in cutt are comparing it to the libraries listed below
Sorting:
- ☆10Updated 5 years ago
- ☆16Updated 5 years ago
- A communication library for deep learning☆51Updated last year
- Personal collection of references for high performance mixed precision training.☆41Updated 6 years ago
- Easy Multiprocessing for Python☆42Updated 5 years ago
- Fairring (FAIR + Herring) is a plug-in for PyTorch that provides a process group for distributed training that outperforms NCCL at large …☆65Updated 3 years ago
- ☆41Updated 4 years ago
- ☆14Updated 3 years ago
- Implementation of Neural Arithmetic Logic Units (https://arxiv.org/pdf/1808.00508.pdf)☆31Updated 7 years ago
- Example repository for custom C++/CUDA operators for TorchScript☆114Updated 3 years ago
- Official Pytorch Implementation for the paper 'SUPER-ADAM: Faster and Universal Framework of Adaptive Gradients'☆17Updated 4 years ago
- Official implementation of "UNAS: Differentiable Architecture Search Meets Reinforcement Learning", CVPR 2020 Oral☆63Updated 2 years ago
- TVMScript kernel for deformable attention☆25Updated 4 years ago
- PyTorch Examples repo for "ReZero is All You Need: Fast Convergence at Large Depth"☆62Updated last year
- (Batched) advanced indexing for PyTorch.☆53Updated last year
- A simple middleware to improving GPU utilization then speedup online inference.☆19Updated 4 years ago
- A codebase & model zoo for pretrained backbone based on MegEngine.☆32Updated 2 years ago
- Customized matrix multiplication kernels☆57Updated 3 years ago
- ☆33Updated last year
- ☆41Updated 4 years ago
- profile tools for pytorch nn models☆42Updated 5 years ago
- AdaX: Adaptive Gradient Descent with Exponential Long Term Momery☆34Updated 5 years ago
- This repository contains the code for the paper in Findings of EMNLP 2021: "EfficientBERT: Progressively Searching Multilayer Perceptron …☆33Updated 2 years ago
- Distributed DataLoader For Pytorch Based On Ray☆25Updated 4 years ago
- An object detection codebase based on MegEngine.☆28Updated 3 years ago
- Inference framework for MoE layers based on TensorRT with Python binding☆41Updated 4 years ago
- Torch Distributed Experimental☆117Updated last year
- ActNN: Reducing Training Memory Footprint via 2-Bit Activation Compressed Training☆199Updated 3 years ago
- Code of "Visualizing and Understanding Object Detecor"☆20Updated 4 years ago
- ICLR 2021 Stats & Graphs☆31Updated 3 years ago