archinetai / vat-pytorch
Virtual Adversarial Training (VAT) techniques in PyTorch
☆16Updated 2 years ago
Related projects: ⓘ
- ☆10Updated 2 years ago
- [ICLR 2022] Pretraining Text Encoders with Adversarial Mixture of Training Signal Generators☆24Updated last year
- This repository contains the data and code for the paper "Diverse Text Generation via Variational Encoder-Decoder Models with Gaussian Pr…☆24Updated 2 years ago
- Code for the paper "Query-Key Normalization for Transformers"☆33Updated 3 years ago
- [ACL 2023 Findings] What In-Context Learning “Learns” In-Context: Disentangling Task Recognition and Task Learning☆21Updated last year
- Adding new tasks to T0 without catastrophic forgetting☆30Updated last year
- ☆48Updated last year
- Source code for SIGIR 2022 paper.☆15Updated 2 years ago
- [EMNLP 2022] Language Model Pre-Training with Sparse Latent Typing☆15Updated last year
- MotifClass: Weakly Supervised Text Classification with Higher-order Metadata Information (WSDM'22)☆12Updated 5 months ago
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆27Updated this week
- This repository contains the code for the EMNLP'23 paper "AdaSent: Efficient Domain-Adapted Sentence Embeddings for Few-Shot Classificati…☆11Updated 3 months ago
- Towards Semantics-Enhanced Pre-Training: Can Lexicon Definitions Help Learning Sentence Meanings? (AAAI 2021)☆9Updated 3 years ago
- Symbolic Brittleness in Sequence Models: on Systematic Generalization in Symbolic Mathematics (AAAI 2022)☆14Updated 2 years ago
- Code for "Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective"☆28Updated 4 months ago
- Few-shot Learning with Auxiliary Data☆26Updated 9 months ago
- (ACL-IJCNLP 2021) Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language Models.☆21Updated 2 years ago
- [EMNLP 2023] Knowledge Rumination for Pre-trained Language Models☆14Updated last year
- [ICML 2023] Tuning Language Models as Training Data Generators for Augmentation-Enhanced Few-Shot Learning☆38Updated last year
- ☆11Updated last year
- The codebase for the paper: A Closer Look at How Fine-tuning Changes BERT☆20Updated last year
- Lite Self-Training☆29Updated last year
- ☆18Updated 3 years ago
- ☆36Updated last month
- This repository contains some of the code used in the paper "Training Language Models with Langauge Feedback at Scale"☆26Updated last year
- Code for Residual Energy-Based Models for Text Generation in PyTorch.☆22Updated 3 years ago
- This is a code repository for the ACL 2022 paper "ODE Transformer: An Ordinary Differential Equation-Inspired Model for Sequence Generati…☆28Updated last year
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆33Updated 6 months ago
- The official implementation of the paper "Text Classification in the Wild: a Large-scale Long-tailed Name Normalization Dataset"(ICASSP 2…☆11Updated last year
- Apply Iprompt on GLM with innovative new methods. Currently support Chinese QA, English QA and Chinese poem generation.☆20Updated 2 years ago