Unofficial Pytorch implementation of MiniLM and MiniLMv2
☆23Jan 30, 2022Updated 4 years ago
Alternatives and similar repositories for Pytorch-MiniLM
Users that are interested in Pytorch-MiniLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Active Learning Helps Pretrained Models Learn the Intended Task (https://arxiv.org/abs/2204.08491) by Alex Tamkin, Dat Nguyen, Salil Desh…☆11Nov 22, 2022Updated 3 years ago
- This repository is the official PyTorch implementation of "Distilling Linguistic Context for Language Model Compression" by GeondoPark, G…☆35Dec 3, 2021Updated 4 years ago
- Code for EMNLP 2021 main conference paper "Dynamic Knowledge Distillation for Pre-trained Language Models"☆41Aug 9, 2022Updated 3 years ago
- Code for EMNLP 2023 findings paper "A Closer Look into Using Large Language Models for Automatic Evaluation"☆19Oct 9, 2023Updated 2 years ago
- [AAAI 2021] "ROSITA: Refined BERT cOmpreSsion with InTegrAted techniques", Yuanxin Liu, Zheng Lin, Fengcheng Yuan☆14Oct 18, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Official PyTorch implementation of DistiLLM: Towards Streamlined Distillation for Large Language Models (ICML 2024)☆256Mar 13, 2025Updated last year
- Polyreactivity Website☆21Jun 26, 2023Updated 2 years ago
- ☆11Apr 19, 2021Updated 4 years ago
- Code for NeurIPS 2024 paper "A SARS-CoV-2 Interaction Dataset and VHH Sequence Corpus for Antibody Language Models"☆15Oct 17, 2024Updated last year
- ☆22Dec 11, 2024Updated last year
- Source code of EMNLP 2022 Findings paper "SparseAdapter: An Easy Approach for Improving the Parameter-Efficiency of Adapters"☆22Feb 28, 2026Updated last month
- [NAACL 2022] "Learning to Win Lottery Tickets in BERT Transfer via Task-agnostic Mask Training", Yuanxin Liu, Fandong Meng, Zheng Lin, Pe…☆15Oct 18, 2022Updated 3 years ago
- Generate interleaved text and image content in a structured format you can directly pass to downstream APIs.☆29Oct 18, 2024Updated last year
- Learning Memory Access Pattern☆12Feb 28, 2019Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- PyTorch implementation of Count-Based Exploration with Neural Density Models☆10Mar 22, 2018Updated 8 years ago
- PR2-specific functionality related to pickup and place tasks.☆20Aug 28, 2013Updated 12 years ago
- Large-scale exact string matching tool☆17Mar 7, 2025Updated last year
- ☆11Jun 14, 2019Updated 6 years ago
- Fast and Robust Early-Exiting Framework for Autoregressive Language Models with Synchronized Parallel Decoding (EMNLP 2023 Long)☆65Sep 28, 2024Updated last year
- Rescoring methods for end-to-end Automatic Speech Recognition☆26Sep 23, 2020Updated 5 years ago
- An official implementation of ProbeGen☆13Oct 20, 2024Updated last year
- Here is my implementation of Center Loss with Keras☆11May 2, 2018Updated 7 years ago
- Know What and Know Where: An Object-and-Room Informed Sequential BERT for Indoor Vision-Language Navigation☆16Feb 7, 2022Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A curated list of few-shot learning in NLP. :-)☆65Oct 30, 2021Updated 4 years ago
- Code of Robust Lottery Tickets for Pre-trained Language Models (ACL2022)☆20Jul 18, 2022Updated 3 years ago
- bumble bee transformer☆14Apr 19, 2021Updated 4 years ago
- Super Tickets in Pre-Trained Language Models: From Model Compression to Improving Generalization (ACL 2021)☆19Jul 28, 2021Updated 4 years ago
- Self-Contrastive Learning: Single-viewed Supervised Contrastive Framework using Sub-network (AAAI 2023)☆21Oct 28, 2023Updated 2 years ago
- An open-chat service w/ some hip, happenin' UI—mostly for seminar Q&As.☆13Apr 21, 2019Updated 6 years ago
- python package for self-attention gan implemented as extension of PyTorch nn.Module. paper -> https://arxiv.org/abs/1805.08318☆19Sep 14, 2018Updated 7 years ago
- Implementation of EMNLP2020 accepted paper: "TopicBERT: Topic-aware BERT for Efficient Document Classification"☆42Nov 15, 2020Updated 5 years ago
- A hands-on tutorial on how to use Active Learning with Transformer models.☆15Oct 3, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Semi-supervised spoken language understanding (SLU) via self-supervised speech and language model pretraining☆12Mar 23, 2021Updated 5 years ago
- Official implementation of "OffsetBias: Leveraging Debiased Data for Tuning Evaluators"☆26Sep 11, 2024Updated last year
- ☆13Mar 6, 2023Updated 3 years ago
- Code base for the EMNLP 2021 Findings paper: Cartography Active Learning☆14Jun 3, 2025Updated 10 months ago
- Official Implementation of Curriculum of Data Augmentation for Long-tailed Recognition (CUDA) (ICLR'23 Spotlight)☆23May 26, 2023Updated 2 years ago
- pytorch implementation for Patient Knowledge Distillation for BERT Model Compression☆204Sep 20, 2019Updated 6 years ago
- ☆32Mar 13, 2024Updated 2 years ago