GRAIN: Gradient-based Intra-attention Pruning on Pre-trained Language Models
☆19Jul 12, 2023Updated 2 years ago
Alternatives and similar repositories for GRAIN
Users that are interested in GRAIN are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NeurIPS 2024] Search for Efficient LLMs☆16Jan 16, 2025Updated last year
- ☆23Nov 26, 2024Updated last year
- A PyTorch-based model pruning toolkit for pre-trained language models☆389Aug 31, 2023Updated 2 years ago
- ☆12Oct 9, 2023Updated 2 years ago
- Rust bindings for Kubernetes Container Storage Interface generated from Protobuf using Tonic/Prost☆14Aug 4, 2021Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- DiTAS: Quantizing Diffusion Transformers via Enhanced Activation Smoothing (WACV 2025)☆13Feb 7, 2026Updated 2 months ago
- ConstraintRadioGroup☆11Sep 16, 2020Updated 5 years ago
- ☆14May 21, 2024Updated last year
- ☆30Jul 22, 2024Updated last year
- Radio Group implementation without the need for nested layout. To be mainly used with ConstraintLayout☆12Jun 25, 2019Updated 6 years ago
- ☆13Jun 2, 2022Updated 3 years ago
- The implementation of Text Classification with Negative Supervision (ACL, 2020)☆10Oct 8, 2020Updated 5 years ago
- bert分类pytorch版本☆11Apr 14, 2021Updated 5 years ago
- 利用BERT预训练模型进行文本生成,可用于对话、摘要、问题生成等任务。 目前支持策略,词表的插入和删除、自定义Character Embedding、随机词替换等☆10Jun 1, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- TPU + GPU,基于疫情期间网民微博评论的情感分析项目☆10Jul 17, 2024Updated last year
- 利用大语言模型进行卧底游戏,包括谁是卧底及衍生的发现AI卧底游戏等。☆11Sep 6, 2024Updated last year
- 记录有用的Git repos☆12Jul 28, 2024Updated last year
- AutodiffEngine☆13Apr 1, 2019Updated 7 years ago
- Extreme Multi-label Text Classification based on X-BERT with GCN and Clustering modules☆11Nov 10, 2019Updated 6 years ago
- An implementation of <Group Fisher Pruning for Practical Network Compression> based on pytorch and mmcv☆18Nov 21, 2021Updated 4 years ago
- ☆13Sep 6, 2022Updated 3 years ago
- 高质量闲聊数据介绍☆30Dec 12, 2018Updated 7 years ago
- An implementation of the DISP-LLM method from the NeurIPS 2024 paper: Dimension-Independent Structural Pruning for Large Language Models.☆25Aug 6, 2025Updated 8 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 基于rust语言开发的一套运维监控探针,支持widnows、linux 、macos系统☆31Nov 16, 2021Updated 4 years ago
- ☆14Jan 10, 2024Updated 2 years ago
- Introduction to Rust - 构建 Rust 语言的完整知识体系☆22Dec 30, 2025Updated 3 months ago
- A lightweight, high-performance message queue implementation using Rust.☆34Jan 23, 2025Updated last year
- 微信集赞,朋友圈集赞,支持多种集赞模式,快速生成1000赞。☆18Mar 4, 2023Updated 3 years ago
- ☆14Oct 21, 2020Updated 5 years ago
- [CVPR 2025] Official PyTorch implementation of Not All Parameters Matter: Masking Diffusion Models for Enhancing Generation Ability☆32Apr 2, 2026Updated 2 weeks ago
- Implementation of paper "Transferring Robustness for Graph Neural Network Against Poisoning Attacks".☆20Feb 26, 2020Updated 6 years ago
- Repository for the paper "Automating App Review Response Generation"☆11Nov 16, 2021Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- NLP - Semantic Role Labeling using GCN, Bert and Biaffine Attention Layer. Developed in Pytorch☆13Sep 12, 2020Updated 5 years ago
- ☆34Mar 28, 2025Updated last year
- 此仓库代码为本人参加的CCF-BDCI-2022 赛道:Web攻击检测与分类识别 (多分类任务),比赛rank-23。队员:Furen Xu☆15Feb 6, 2023Updated 3 years ago
- nlp包括对话的数据集收集整理☆14Mar 8, 2020Updated 6 years ago
- 一些纪录☆11May 22, 2017Updated 8 years ago
- Repo for the EMNLP'24 Paper "Dual-Space Knowledge Distillation for Large Language Models". A general white-box KD framework for both same…☆63Mar 21, 2026Updated last month
- [ICLR 2024] This is the official PyTorch implementation of "QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Mod…☆39Mar 11, 2024Updated 2 years ago