MLPruning, PyTorch, NLP, BERT, Structured Pruning
☆20Jun 29, 2021Updated 4 years ago
Alternatives and similar repositories for MLPruning
Users that are interested in MLPruning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆18Oct 15, 2020Updated 5 years ago
- ☆34Updated this week
- ☆43Jan 30, 2024Updated 2 years ago
- ☆17Apr 1, 2020Updated 6 years ago
- Benchmark for matrix multiplications between dense and block sparse (BSR) matrix in TVM, blocksparse (Gray et al.) and cuSparse.☆23Aug 21, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Jax implementation of the AdaHessian optimizer☆19Mar 11, 2021Updated 5 years ago
- 机器学习(Machine Learning)、深度学习(Deep Learning)、对抗神经网络(GAN),图神经网络(GNN),NLP,大数据相关的发展路书(roadmap), 并附海量源码(python,pytorch)带大家消化基本知识点,突破面试,完成从新手到合格…☆10Feb 25, 2020Updated 6 years ago
- The good practice in the VQA system such as pos-tag attention, structed triplet learning and triplet attention is very general and can be…☆19Jan 23, 2018Updated 8 years ago
- [CVPR'19] Trust Region Based Adversarial Attack☆20Dec 11, 2020Updated 5 years ago
- Prune a model while finetuning or training.☆407Jun 21, 2022Updated 3 years ago
- [ WSDM '22 ] On Sampling Collaborative Filtering Datasets☆20Jan 13, 2022Updated 4 years ago
- NAACL 2022: Can Rationalization Improve Robustness? https://arxiv.org/abs/2204.11790☆27Nov 21, 2022Updated 3 years ago
- ☆10Sep 27, 2021Updated 4 years ago
- GPU implementation of Winograd convolution☆10Oct 23, 2017Updated 8 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- OpenPAI SDK☆19Dec 10, 2022Updated 3 years ago
- CUDA templates for tile-sparse matrix multiplication based on CUTLASS.☆52Mar 1, 2018Updated 8 years ago
- Concise Reasoning via Reinforcement Learning☆13Apr 16, 2025Updated last year
- ☆23Apr 5, 2023Updated 3 years ago
- Neural network approximators of linear algebra operations on GPU with PyTorch☆17May 30, 2022Updated 4 years ago
- Pytorch implementation of our paper accepted by NeurIPS 2022 -- Learning Best Combination for Efficient N:M Sparsity☆22Jan 13, 2023Updated 3 years ago
- pyhessian is a TensorFlow module which can be used to estimate Hessian matrices☆25Mar 26, 2021Updated 5 years ago
- **ASCM4ABSA** - Our code and proposed data for NLPCC 2022 paper titled "Aspect-specific Context Modeling for Aspect-based Sentiment Analy…☆12Mar 26, 2023Updated 3 years ago
- BERT系列模型、搜搜、剪枝、蒸馏☆13Sep 10, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A platform to display the carbon neutralization information for researchers, decision-makers, and other participants in the community.☆18Aug 16, 2022Updated 3 years ago
- STABILIZING GRADIENTS FOR DEEP NEURAL NETWORKS VIA EFFICIENT SVD PARAMETERIZATION☆16Jun 5, 2018Updated 8 years ago
- ☆17Jul 1, 2020Updated 5 years ago
- Code of the COLING22 paper "uChecker: Masked Pretrained Language Models as Unsupervised Chinese Spelling Checkers"☆19Aug 17, 2022Updated 3 years ago
- Sparse kernels for GNNs based on TVM☆17Nov 18, 2020Updated 5 years ago
- ☆19Nov 10, 2024Updated last year
- pre-trained vision and language model summary☆12Apr 20, 2021Updated 5 years ago
- PyTorch implementation of "Cluster-GCN: An Efficient Algorithm for Training Deep and Large Graph Convolutional Networks"☆14Mar 25, 2023Updated 3 years ago
- [KDD 2023] code for "Test accuracy vs. generalization gap: model selection in NLP without accessing training or testing data" https://arx…☆12Oct 17, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- GCNs Analysis: Visualization, Error Cases etc.☆14Feb 15, 2023Updated 3 years ago
- This repository contains the code for "How many data points is a prompt worth?"☆48Apr 7, 2021Updated 5 years ago
- [SIGIR '22] Code for our SIGIR 2022 accepted paper : P3 Ranker: Mitigating the Gaps between Pre-training and Ranking Fine-tuning with Pr…☆18Sep 24, 2023Updated 2 years ago
- Project for Dynamic Capsule Attention☆12Dec 7, 2019Updated 6 years ago
- Batch MultiHead Graph Attention Pytorch☆12Apr 4, 2020Updated 6 years ago
- ☆16Mar 18, 2023Updated 3 years ago
- Code for Neural Execution Engines: Learning to Execute Subroutines☆18Jan 11, 2021Updated 5 years ago