archinetai / vat-pytorch
Virtual Adversarial Training (VAT) techniques in PyTorch
☆17Updated 2 years ago
Alternatives and similar repositories for vat-pytorch:
Users that are interested in vat-pytorch are comparing it to the libraries listed below
- ☆10Updated 3 years ago
- A package for fine tuning of pretrained NLP transformers using Semi Supervised Learning☆15Updated 3 years ago
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆30Updated this week
- This repository contains some of the code used in the paper "Training Language Models with Langauge Feedback at Scale"☆27Updated 2 years ago
- A visualizer to display attention weights on text☆23Updated 6 years ago
- [EMNLP 2021] MuVER: Improving First-Stage Entity Retrieval with Multi-View Entity Representations☆31Updated 2 years ago
- Implementation of COCO-LM, Correcting and Contrasting Text Sequences for Language Model Pretraining, in Pytorch☆45Updated 4 years ago
- Code for paper: "LASeR: Learning to Adaptively Select Reward Models with Multi-Arm Bandits"☆13Updated 6 months ago
- Apply Iprompt on GLM with innovative new methods. Currently support Chinese QA, English QA and Chinese poem generation.☆20Updated 2 years ago
- ☆16Updated 9 months ago
- ☆11Updated 2 years ago
- Code for the paper "Query-Key Normalization for Transformers"☆39Updated 4 years ago
- An Empirical Study On Contrastive Search And Contrastive Decoding For Open-ended Text Generation☆27Updated 10 months ago
- ☆16Updated 3 years ago
- Directed masked autoencoders☆14Updated 2 years ago
- (ACL-IJCNLP 2021) Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language Models.☆21Updated 2 years ago
- Code for "Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective"☆32Updated 11 months ago
- Conditionally Adaptive Multi-Task Learning: Improving Transfer Learning in NLP Using Fewer Parameters & Less Data☆56Updated 3 years ago
- [ICML 2023] Tuning Language Models as Training Data Generators for Augmentation-Enhanced Few-Shot Learning☆42Updated last year
- Symbolic Brittleness in Sequence Models: on Systematic Generalization in Symbolic Mathematics (AAAI 2022)☆14Updated 3 years ago
- Fine-Tuning Pre-trained Transformers into Decaying Fast Weights☆19Updated 2 years ago
- ☆19Updated 2 years ago
- Code for EMNLP 2020 paper CoDIR☆41Updated 2 years ago
- Adding new tasks to T0 without catastrophic forgetting☆33Updated 2 years ago
- Pretraining summarization models using a corpus of nonsense☆13Updated 3 years ago
- This is a simple torch implementation of the high performance Multi-Query Attention☆16Updated last year
- Code for the PAPA paper☆27Updated 2 years ago
- Robust Self-augmentation for NER with Meta-reweighting☆29Updated 2 years ago
- Few-shot Learning with Auxiliary Data☆27Updated last year
- This repository is the official implementation of our EMNLP 2022 paper ELMER: A Non-Autoregressive Pre-trained Language Model for Efficie…☆26Updated 2 years ago