Yuanhy1997 / HyPeLinks

HyPe: Better Pre-trained Language Model Fine-tuning with Hidden Representation Perturbation [ACL 2023]

☆14

Alternatives and similar repositories for HyPe

Users that are interested in HyPe are comparing it to the libraries listed below

Sorting:

allenai / EmbeddingRecycling
Embedding Recycling for Language models
☆38Updated 2 years ago
EleutherAI / mdl
Minimum Description Length probing for neural network representations
☆18Updated 5 months ago
kyegomez / Reka-Torch
Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch
☆30Updated last week
renll / SeqBoat
[NeurIPS 2023] Sparse Modular Activation for Efficient Sequence Modeling
☆37Updated last year
yikangshen / megablocks
☆20Updated last year
machelreid / editpro
Learning to Model Editing Processes
☆26Updated 3 years ago
salesforce / simplification
☆22Updated 5 months ago
ielab / Starbucks
Starbucks: Improved Training for 2D Matryoshka Embeddings
☆21Updated 2 weeks ago
ThomasScialom / T0_continual_learning
Adding new tasks to T0 without catastrophic forgetting
☆33Updated 2 years ago
zhichaoxu-shufe / context-aware-decoding-qfs
☆12Updated last year
EleutherAI / rnngineering
Engineering the state of RNN language models (Mamba, RWKV, etc.)
☆32Updated last year
prateeky2806 / ComPEFT
☆26Updated last year
gonglinyuan / metro_t0
Code repo for "Model-Generated Pretraining Signals Improves Zero-Shot Generalization of Text-to-Text Transformers" (ACL 2023)
☆22Updated last year
whyNLP / Probabilistic-Transformer
A probabilitic model for contextual word representation. Accepted to ACL2023 Findings.
☆23Updated last year
facebookresearch / lss_eval
This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…
☆31Updated last year
Ankush7890 / ssfinetuning
A package for fine tuning of pretrained NLP transformers using Semi Supervised Learning
☆14Updated 3 years ago
jenni-ai / T2FW
Fine-Tuning Pre-trained Transformers into Decaying Fast Weights
☆19Updated 2 years ago
HazyResearch / embroid
Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification
☆11Updated last year
wangskyGit / passage-sieve
official repo of AAAI2024 paper Mitigating the Impact of False Negatives in Dense Retrieval with Contrastive Confidence Regularization
☆13Updated last year
allenai / sso
Repository for Skill Set Optimization
☆14Updated 11 months ago
ducdauge / sft-llm
Scaling Sparse Fine-Tuning to Large Language Models
☆16Updated last year
cliang1453 / CAMERO
CAMERO: Consistency Regularized Ensemble of Perturbed Language Models with Weight Sharing (ACL 2022)
☆10Updated 3 years ago
IBM / ColPret
Efficient Scaling laws and collaborative pretraining.
☆16Updated 5 months ago
rycolab / probing-via-prompting
☆11Updated 3 years ago
srush / tangent
Source-to-Source Debuggable Derivatives in Pure Python
☆15Updated last year
google-research-datasets / swim-ir
SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…
☆48Updated last year
lucy3 / whos_filtered
☆14Updated 9 months ago
facebookresearch / ToolVerifier
This repository contains the ToolSelect dataset which was used to fine-tune Llama-2 70B for tool selection.
☆20Updated last year
HazyResearch / aioli
Aioli: A unified optimization framework for language model data mixing
☆27Updated 5 months ago
stanfordnlp / multi-distribution-retrieval
Code for our paper Resources and Evaluations for Multi-Distribution Dense Information Retrieval
☆15Updated last year