☆45Sep 12, 2021Updated 4 years ago
Alternatives and similar repositories for mt5-soft-prompt-tuning
Users that are interested in mt5-soft-prompt-tuning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Prompt tuning toolkit for GPT-2 and GPT-Neo☆90Sep 27, 2021Updated 4 years ago
- Collection of scripts to pretrain T5 in unsupervised text, using PyTorch Lightning. CORD-19 pretraining provided as example.☆32Apr 26, 2021Updated 5 years ago
- Overwrite huggingface BART and GPT with copy mechanism☆21May 3, 2023Updated 3 years ago
- Code for AAAI 2022 paper Unsupervised Sentence Representation via Contrastive Learning with Mixing Negatives☆23Jun 14, 2022Updated 3 years ago
- Multi-Figurative Language Generation (COLING 2022)☆12Jan 30, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆12Feb 14, 2023Updated 3 years ago
- Train large COMET (T5-3B/GPT2-XL) with small memory (on 11GB memory GPUs like 1080/2080) using DeepSpeed.☆14Jan 23, 2022Updated 4 years ago
- Convenient Text-to-Text Training for Transformers☆18Dec 10, 2021Updated 4 years ago
- ☆31Jun 1, 2023Updated 2 years ago
- Concealed Data Poisoning Attacks on NLP Models☆21Sep 4, 2023Updated 2 years ago
- 基于PaddleNLP开源的抽取式UIE进行医学命名实体识别(torch实现)☆44Aug 5, 2022Updated 3 years ago
- Code for "Sample-efficient Deep Reinforcement Learning of Mobile Manipulation for 6-DOF Trajectory Following"☆13Mar 19, 2025Updated last year
- Introductory lecture on Pytorch☆17May 8, 2022Updated 4 years ago
- Implementation of "The Power of Scale for Parameter-Efficient Prompt Tuning"☆167Sep 8, 2021Updated 4 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- 文言文信息抽取(实体识别+关系抽取)☆10Feb 24, 2023Updated 3 years ago
- 2018 高校算法大赛神策杯第五名解决方案☆18Oct 22, 2018Updated 7 years ago
- Code for KE-Blender, EMNLP 2021☆18Mar 1, 2022Updated 4 years ago
- Implementing BERT + CRF with PyTorch for Chinese NER.☆10Mar 7, 2022Updated 4 years ago
- ☆10Aug 14, 2020Updated 5 years ago
- CCKS2020面向金融领域的小样本跨类迁移事件抽取baseline☆56Dec 8, 2022Updated 3 years ago
- Code for ACL 2021 paper: Accelerating BERT Inference for Sequence Labeling via Early-Exit☆28Aug 19, 2022Updated 3 years ago
- Code for our ACL 2023 paper: Causality-aware Concept Extraction based on Knowledge-guided Prompting☆14Aug 19, 2023Updated 2 years ago
- Covid Doctor chatbot using DialoGPT☆13May 25, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Zero-Shot Learning in Named Entity Recognition with Common Sense Knowledge☆17Nov 16, 2021Updated 4 years ago
- ☆10Aug 16, 2022Updated 3 years ago
- some baselines for lic2020 (http://lic2020.cipsc.org.cn/)☆218Apr 11, 2020Updated 6 years ago
- 基于span分类和负采样的嵌套实体识别☆13Mar 9, 2023Updated 3 years ago
- Fine-grained Gating for Reading Comprehension☆19Sep 12, 2017Updated 8 years ago
- Finetune Bloom big language model with Lora method☆32Jun 9, 2023Updated 2 years ago
- Constrained learning using boxes for event-event relation extraction☆12Aug 5, 2022Updated 3 years ago
- The source code of paper "PAIR-LEVEL SUPERVISED CONTRASTIVE LEARNING FOR NATURAL LANGUAGE INFERENCE" at ICASSP 2022.☆46May 15, 2023Updated 2 years ago
- SimCSE☆15Oct 1, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Implementation of "The Power of Scale for Parameter-Efficient Prompt Tuning"☆59Jun 27, 2022Updated 3 years ago
- Official Code for paper "FLIP: Fine-grained Alignment between ID-based Models and Pretrained Language Models for CTR Prediction" (RecSys …☆18Jul 23, 2024Updated last year
- Code of the paper “A Fin-BERT-based Event Extraction Method for Chinese Financial Domain”☆12May 22, 2024Updated last year
- Code for paper Target-Guided Dialogue Response Generation Using Commonsense and Data Augmentation☆14Jun 10, 2022Updated 3 years ago
- Prefix-Tuning: Optimizing Continuous Prompts for Generation☆962Apr 26, 2024Updated 2 years ago
- Source code and data for ACL 2019 Long Paper ``Semantic Parsing with Dual Learning".☆23Feb 21, 2021Updated 5 years ago
- [COLING22] An End-to-End Library for Evaluating Natural Language Generation☆94Dec 18, 2023Updated 2 years ago