Implementation of the paper Parameter-Efficient Transfer Learning for NLP, Houlsby [Google], 2019. Published in ICML 2019.
☆34Jun 12, 2023Updated 3 years ago
Alternatives and similar repositories for bert_adapter
Users that are interested in bert_adapter are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Filter dialog data with a simple entropy-based method (see ACL paper)☆14Oct 4, 2019Updated 6 years ago
- Automated neural architecture search algorithms implemented in PyTorch and Autogluon toolkit.☆12Apr 17, 2020Updated 6 years ago
- ☆505Oct 25, 2023Updated 2 years ago
- Python library for backtranslation (with Google Translate)☆12Jan 11, 2020Updated 6 years ago
- ☆21May 14, 2026Updated 3 weeks ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆19May 11, 2021Updated 5 years ago
- The official implementation of the paper "Self-Updatable Large Language Models by Integrating Context into Model Parameters"☆15May 18, 2025Updated last year
- ☆26Jan 27, 2018Updated 8 years ago
- Code of ImageNet training and evaluation for the paper: RENAS: Reinforced Evolutionary Neural Architecture Search☆20May 15, 2019Updated 7 years ago
- TOWARDS AN AUTOMATIC TURING TEST: LEARNING TO EVALUATE DIALOGUE RESPONSES☆30Aug 25, 2017Updated 8 years ago
- [KDD'22] Learned Token Pruning for Transformers☆98Feb 27, 2023Updated 3 years ago
- [MM 2023 Oral] Online Distillation-enhanced Multi-modal Transformer for Sequential Recommendation☆17Jan 10, 2024Updated 2 years ago
- This repositorie es the code of the paper Optimizing Reusable Knowledge for Continual Learning via Metalearning.☆11Oct 12, 2021Updated 4 years ago
- Official repository for "PAIR: Planning and Iterative Refinement in Pre-trained Transformers for Long Text Generation"☆31Apr 17, 2021Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- code and resources for our paper "Achieving Joint Training Accuracy in Continual Learning" in AAAI2025☆14Feb 25, 2025Updated last year
- ☆21Apr 27, 2023Updated 3 years ago
- How Does Selective Mechanism Improve Self-attention Networks?☆29Mar 16, 2021Updated 5 years ago
- PyTorch port of "Efficient Neural Architecture Search via Parameters Sharing"☆53Jun 7, 2018Updated 8 years ago
- The inplementation of Hierarchical Recurrent Attention Network☆38Oct 30, 2025Updated 7 months ago
- 2020语言与智能技术竞赛:面向推荐的对话任务☆51Jun 17, 2021Updated 4 years ago
- ☆13Nov 19, 2022Updated 3 years ago
- This is a curated list of "Continual Learning with Pretrained Models" research.☆20May 29, 2025Updated last year
- Enhancing Recipe Retrieval with Foundation Models: A Data Augmentation Perspective☆15Oct 22, 2024Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- A good example of deformable convolutional network for mnist classification☆21Oct 15, 2019Updated 6 years ago
- Pytorch implementation of DiffMask☆58Jun 12, 2023Updated 3 years ago
- Code for the "Evolving Reservoirs for Meta Reinforcement Learning" paper☆12Apr 22, 2024Updated 2 years ago
- 一个针对中文聊天机器人的公开数据集☆11Sep 11, 2019Updated 6 years ago
- 基于c++ muduo网络库的集群聊天服务器,使用nginx实现负载均衡,使用reids消息队列实现跨服务器通信☆12Feb 23, 2024Updated 2 years ago
- ☆18May 10, 2023Updated 3 years ago
- 华为哪吒模型实践指南☆21Jan 7, 2022Updated 4 years ago
- Ladder Side-Tuning在CLUE上的简单尝试☆23Jun 20, 2022Updated 3 years ago
- A Multi-tasking and Multi-stage Chinese Minority Pre-Trained Language Model☆12Jul 24, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [ICML 2025] Official PyTorch implementation of "NegMerge: Sign-Consensual Weight Merging for Machine Unlearning"☆16Nov 25, 2025Updated 6 months ago
- Crawler used to crawl papers☆25Oct 24, 2018Updated 7 years ago
- D ratio is a performance metric to analyse the efficiency of algorithms that predict asset return or asset prices☆25Feb 22, 2024Updated 2 years ago
- Models and training scripts for the English, German and Russian MAGEC systems described in R. Grundkiewicz, M. Junczys-Dowmunt: Minimally…☆12Jul 7, 2021Updated 4 years ago
- Teaching a humanoid to walk(ish), then displaying in your browser (using tensorflow.js and reinforcement learning)☆10Sep 7, 2020Updated 5 years ago
- Information extraction dataset zoo.☆43Apr 10, 2022Updated 4 years ago
- ☆17Jun 21, 2024Updated last year