Yuanhy1997 / HyPeLinks
HyPe: Better Pre-trained Language Model Fine-tuning with Hidden Representation Perturbation [ACL 2023]
☆14Updated 2 years ago
Alternatives and similar repositories for HyPe
Users that are interested in HyPe are comparing it to the libraries listed below
Sorting:
- Embedding Recycling for Language models☆38Updated 2 years ago
- Minimum Description Length probing for neural network representations☆18Updated 5 months ago
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆30Updated last week
- [NeurIPS 2023] Sparse Modular Activation for Efficient Sequence Modeling☆37Updated last year
- ☆20Updated last year
- Learning to Model Editing Processes☆26Updated 3 years ago
- ☆22Updated 5 months ago
- Starbucks: Improved Training for 2D Matryoshka Embeddings☆21Updated 2 weeks ago
- Adding new tasks to T0 without catastrophic forgetting☆33Updated 2 years ago
- ☆12Updated last year
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32Updated last year
- ☆26Updated last year
- Code repo for "Model-Generated Pretraining Signals Improves Zero-Shot Generalization of Text-to-Text Transformers" (ACL 2023)☆22Updated last year
- A probabilitic model for contextual word representation. Accepted to ACL2023 Findings.☆23Updated last year
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…☆31Updated last year
- A package for fine tuning of pretrained NLP transformers using Semi Supervised Learning☆14Updated 3 years ago
- Fine-Tuning Pre-trained Transformers into Decaying Fast Weights☆19Updated 2 years ago
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Updated last year
- official repo of AAAI2024 paper Mitigating the Impact of False Negatives in Dense Retrieval with Contrastive Confidence Regularization☆13Updated last year
- Repository for Skill Set Optimization☆14Updated 11 months ago
- Scaling Sparse Fine-Tuning to Large Language Models☆16Updated last year
- CAMERO: Consistency Regularized Ensemble of Perturbed Language Models with Weight Sharing (ACL 2022)☆10Updated 3 years ago
- Efficient Scaling laws and collaborative pretraining.☆16Updated 5 months ago
- ☆11Updated 3 years ago
- Source-to-Source Debuggable Derivatives in Pure Python☆15Updated last year
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆48Updated last year
- ☆14Updated 9 months ago
- This repository contains the ToolSelect dataset which was used to fine-tune Llama-2 70B for tool selection.☆20Updated last year
- Aioli: A unified optimization framework for language model data mixing☆27Updated 5 months ago
- Code for our paper Resources and Evaluations for Multi-Distribution Dense Information Retrieval☆15Updated last year