Yuanhy1997 / HyPe
HyPe: Better Pre-trained Language Model Fine-tuning with Hidden Representation Perturbation [ACL 2023]
☆14Updated last year
Related projects ⓘ
Alternatives and complementary repositories for HyPe
- Minimum Description Length probing for neural network representations☆16Updated last week
- Official code repo for paper "Great Memory, Shallow Reasoning: Limits of kNN-LMs"☆18Updated 2 months ago
- Embedding Recycling for Language models☆38Updated last year
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Updated last year
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆29Updated this week
- Adding new tasks to T0 without catastrophic forgetting☆30Updated 2 years ago
- Repository for Teaching Broad Reasoning Skills for Multi-Step QA by Generating Hard Contexts, EMNLP22☆17Updated last year
- Tasks for describing differences between text distributions.☆16Updated 3 months ago
- Repository for Skill Set Optimization☆12Updated 3 months ago
- ☆18Updated 5 months ago
- Fine-Tuning Pre-trained Transformers into Decaying Fast Weights☆19Updated 2 years ago
- Learning to Model Editing Processes☆26Updated 2 years ago
- An attempt to merge ESBN with Transformers, to endow Transformers with the ability to emergently bind symbols☆14Updated 3 years ago
- ☆31Updated 10 months ago
- Few-shot Learning with Auxiliary Data☆26Updated 11 months ago
- ☆25Updated 11 months ago
- ☆19Updated last year
- Repo for "Zemi: Learning Zero-Shot Semi-Parametric Language Models from Multiple Tasks" ACL 2023 Findings☆16Updated last year
- [NeurIPS 2023] Sparse Modular Activation for Efficient Sequence Modeling☆35Updated 11 months ago
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆44Updated 11 months ago
- code for "Natural Language to Code Translation with Execution"☆39Updated 2 years ago
- This repository contains some of the code used in the paper "Training Language Models with Langauge Feedback at Scale"☆26Updated last year
- A Benchmark for Robust, Multi-evidence, Multi-answer Question Answering☆16Updated last year
- A package for fine tuning of pretrained NLP transformers using Semi Supervised Learning☆15Updated 3 years ago
- An Empirical Study On Contrastive Search And Contrastive Decoding For Open-ended Text Generation☆26Updated 5 months ago
- [ICLR 2023] PyTorch code of Summarization Programs: Interpretable Abstractive Summarization with Neural Modular Trees☆23Updated last year
- ☆26Updated 4 months ago
- See https://github.com/cuda-mode/triton-index/ instead!☆11Updated 6 months ago
- ☆11Updated 2 years ago