GeeeekExplorer / cupytorch
A small framework mimics PyTorch using CuPy or NumPy
☆27Updated 2 years ago
Related projects: ⓘ
- This repository contains the code for the paper in Findings of EMNLP 2021: "EfficientBERT: Progressively Searching Multilayer Perceptron …☆32Updated last year
- 😎 A simple and easy-to-use toolkit for GPU scheduling.☆40Updated 3 years ago
- A Tight-fisted Optimizer☆46Updated last year
- Codes For Sharing☆36Updated 3 years ago
- 🤔 When in Doubt: Improving Classification Performance with Alternating Normalization [Findings of EMNLP2021]☆14Updated 2 years ago
- Official implementation for paper "Relational Surrogate Loss Learning", ICLR 2022☆37Updated last year
- 逻辑回归和单层softmax的解析解☆12Updated 3 years ago
- 基于Transformer的单模型、多尺度的VAE模型☆53Updated 3 years ago
- Distributed DataLoader For Pytorch Based On Ray☆24Updated 2 years ago
- An object detection codebase based on MegEngine.☆28Updated last year
- (ACL-IJCNLP 2021) Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language Models.☆21Updated 2 years ago
- Lion and Adam optimization comparison☆56Updated last year
- Contextual Position Encoding but with some custom CUDA Kernels https://arxiv.org/abs/2405.18719☆18Updated 3 months ago
- A simple program scheduler for your code on different devices.☆11Updated last month
- Search for typos in code or text, automatically fix some typos. 查找文本或代码中的拼写错误/打字错误,自动修改部分 typos。【The typos lib is extensible and customiz…☆9Updated last year
- Implementation of IceFormer: Accelerated Inference with Long-Sequence Transformers on CPUs (ICLR 2024).☆18Updated 3 months ago
- 使用c++以及cuda加速神经网络样例(实现矩阵加法和矩阵乘法)☆53Updated 3 years ago
- [EVA ICLR'23; LARA ICML'22] Efficient attention mechanisms via control variates, random features, and importance sampling☆78Updated last year
- PyTorch implementation of MLP-Mixer☆36Updated 3 years ago
- An open-source project for long-tail classification☆38Updated 2 years ago
- ☆13Updated 5 months ago
- # Unified Normalization (ACM MM'22) By Qiming Yang, Kai Zhang, Chaoxiang Lan, Zhi Yang, Zheyang Li, Wenming Tan, Jun Xiao, and Shiliang P…☆35Updated last year
- Sparse Attention with Linear Units☆17Updated 3 years ago
- The accompanying code for "Memory-efficient Transformers via Top-k Attention" (Ankit Gupta, Guy Dar, Shaya Goodman, David Ciprut, Jonatha…☆58Updated 3 years ago
- ☆28Updated 3 months ago
- A fully differentiable architecture search for GANs☆17Updated 3 years ago
- ☆37Updated last year
- Various test models in WNNX format. It can view with `pip install wnetron && wnetron`☆12Updated 2 years ago
- Notes of my introduction about NLP in Fudan University☆37Updated 3 years ago
- IntLLaMA: A fast and light quantization solution for LLaMA☆18Updated last year