GeeeekExplorer / cupytorch
A small framework mimics PyTorch using CuPy or NumPy
☆27Updated 2 years ago
Alternatives and similar repositories for cupytorch:
Users that are interested in cupytorch are comparing it to the libraries listed below
- This repository contains the code for the paper in Findings of EMNLP 2021: "EfficientBERT: Progressively Searching Multilayer Perceptron …☆32Updated last year
- differentiable top-k operator☆21Updated 3 weeks ago
- 逻辑回归和单层softmax的解析解☆12Updated 3 years ago
- 🤔 When in Doubt: Improving Classification Performance with Alternating Normalization [Findings of EMNLP2021]☆14Updated 3 years ago
- A Tight-fisted Optimizer☆47Updated last year
- # Unified Normalization (ACM MM'22) By Qiming Yang, Kai Zhang, Chaoxiang Lan, Zhi Yang, Zheyang Li, Wenming Tan, Jun Xiao, and Shiliang P…☆34Updated last year
- Official implementation for paper "Relational Surrogate Loss Learning", ICLR 2022☆36Updated 2 years ago
- Notes of my introduction about NLP in Fudan University☆37Updated 3 years ago
- The official implementation of You Only Compress Once: Towards Effective and Elastic BERT Compression via Exploit-Explore Stochastic Natu…☆48Updated 3 years ago
- Lion and Adam optimization comparison☆56Updated last year
- Search for typos in code or text, automatically fix some typos. 查找文本或代码中的拼写错误/打字错误,自动修改部分 typos。【The typos lib is extensible and customiz…☆9Updated last year
- Various test models in WNNX format. It can view with `pip install wnetron && wnetron`☆12Updated 2 years ago
- ☆16Updated 3 years ago
- An object detection codebase based on MegEngine.☆28Updated 2 years ago
- ☆22Updated last year
- 😎 A simple and easy-to-use toolkit for GPU scheduling.☆42Updated 3 years ago
- Distributed DataLoader For Pytorch Based On Ray☆24Updated 3 years ago
- paddle code convert toolkit☆22Updated last year
- ICLR 2021 Stats & Graphs☆31Updated 2 years ago
- ☆13Updated last year
- Calculating FLOPs of Pre-trained Models in NLP☆18Updated 3 years ago
- Finetune CPM-1☆24Updated 3 years ago
- (ACL-IJCNLP 2021) Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language Models.☆21Updated 2 years ago
- A simple program scheduler for your code on different devices.☆11Updated 5 months ago
- ☆18Updated 8 months ago
- 基于Transformer的单模型、多尺度的VAE模型☆55Updated 3 years ago
- IntLLaMA: A fast and light quantization solution for LLaMA☆18Updated last year
- ☆13Updated 2 years ago
- A pre-trained model with multi-exit transformer architecture.☆53Updated 2 years ago
- Code for the paper "Query-Key Normalization for Transformers"☆36Updated 3 years ago