GeeeekExplorer / cupytorch
A small framework mimics PyTorch using CuPy or NumPy
☆27Updated 3 years ago
Alternatives and similar repositories for cupytorch:
Users that are interested in cupytorch are comparing it to the libraries listed below
- This repository contains the code for the paper in Findings of EMNLP 2021: "EfficientBERT: Progressively Searching Multilayer Perceptron …☆32Updated last year
- Distributed DataLoader For Pytorch Based On Ray☆24Updated 3 years ago
- Various test models in WNNX format. It can view with `pip install wnetron && wnetron`☆12Updated 2 years ago
- An object detection codebase based on MegEngine.☆28Updated 2 years ago
- differentiable top-k operator☆21Updated 3 months ago
- ☆37Updated last year
- (ACL-IJCNLP 2021) Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language Models.☆21Updated 2 years ago
- 🤔 When in Doubt: Improving Classification Performance with Alternating Normalization [Findings of EMNLP2021]☆14Updated 3 years ago
- Notes of my introduction about NLP in Fudan University☆37Updated 3 years ago
- 逻辑回归和单层softmax的解析解☆12Updated 3 years ago
- 基于Transformer的单模型、多尺度的VAE模型☆55Updated 3 years ago
- A simple program scheduler for your code on different devices.☆11Updated 7 months ago
- IntLLaMA: A fast and light quantization solution for LLaMA☆18Updated last year
- ICLR 2021 Stats & Graphs☆31Updated 2 years ago
- paddle code convert toolkit☆22Updated 2 years ago
- Code for the paper "Query-Key Normalization for Transformers"☆38Updated 4 years ago
- A Tight-fisted Optimizer☆47Updated 2 years ago
- Contextual Position Encoding but with some custom CUDA Kernels https://arxiv.org/abs/2405.18719☆22Updated 9 months ago
- ☆14Updated 2 years ago
- Must-read papers on improving efficiency for pre-trained language models.☆103Updated 2 years ago
- ☆22Updated last year
- An open-source project for long-tail classification☆39Updated 3 years ago
- Easy Multiprocessing for Python☆43Updated 4 years ago
- Codes For Sharing☆38Updated 4 years ago
- ☆100Updated 3 years ago
- ☆16Updated 3 years ago
- Fork of diux-dev/imagenet18☆15Updated 6 years ago
- Python下shuffle几百G文件☆33Updated 3 years ago
- [EVA ICLR'23; LARA ICML'22] Efficient attention mechanisms via control variates, random features, and importance sampling☆82Updated 2 years ago
- A simple middleware to improving GPU utilization then speedup online inference.☆19Updated 4 years ago