adihaviv / idiomemLinks

☆9

Alternatives and similar repositories for idiomem

Users that are interested in idiomem are comparing it to the libraries listed below

Sorting:

declare-lab / resta
Restore safety in fine-tuned language models through task arithmetic
☆28Updated last year
dannyallover / overthinking_the_truth
☆29Updated last year
SumilerGAO / SunGen
☆27Updated 2 years ago
yizhongw / llm-temporal-alignment
Methods and evaluation for aligning language models temporally
☆29Updated last year
MaheepChaudhary / SAE-Ravel
Providing the answer to "How to do patching on all available SAEs on GPT-2?". It is an official repository of the implementation of the p…
☆11Updated 5 months ago
ruiqi-zhong / nlparam
Augmenting Statistical Models with Natural Language Parameters
☆27Updated 9 months ago
yasumasaonoe / entity_knowledge_propagation
☆17Updated last year
ekinakyurek / influence
Code for "Tracing Knowledge in Language Models Back to the Training Data"
☆38Updated 2 years ago
yuzhaouoe / SAE-based-representation-engineering
[NAACL'25 Oral] Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering
☆60Updated 7 months ago
allenai / hyper-task-descriptions
Learning adapter weights from task descriptions
☆19Updated last year
tml-epfl / long-is-more-for-alignment
Long Is More for Alignment: A Simple but Tough-to-Beat Baseline for Instruction Fine-Tuning [ICML 2024]
☆17Updated last year
AkariAsai / ATTEMPT
This is the oficial repository for "Parameter-Efficient Multi-task Tuning via Attentional Mixtures of Soft Prompts" (EMNLP 2022)
☆102Updated 2 years ago
princeton-nlp / LM-Kernel-FT
A Kernel-Based View of Language Model Fine-Tuning https://arxiv.org/abs/2210.05643
☆75Updated last year
JasonForJoy / Model-Editing-Hurt
EMNLP 2024: Model Editing Harms General Abilities of Large Language Models: Regularization to the Rescue
☆35Updated last month
googleinterns / localizing-paragraph-memorization
☆14Updated last year
eric-mitchell / serac
Semi-Parametric Editing with a Retrieval-Augmented Counterfactual Model
☆68Updated 2 years ago
SteveKGYang / MetaAligner
Models, data, and codes for the paper: MetaAligner: Towards Generalizable Multi-Objective Alignment of Language Models
☆19Updated 9 months ago
PrasannS / rlhf-length-biases
☆28Updated last year
allenai / noncompliance
This repository contains data, code and models for contextual noncompliance.
☆23Updated 11 months ago
ChaosCodes / ProPETL
One Network, Many Masks: Towards More Parameter-Efficient Transfer Learning
☆39Updated last year
explanare / ravel
Evaluate interpretability methods on localizing and disentangling concepts in LLMs.
☆47Updated 8 months ago
deeplearning-wisc / args
☆40Updated last year
shadowkiller33 / Contrast-Instruction
☆19Updated last year
princeton-nlp / MABEL
EMNLP 2022: "MABEL: Attenuating Gender Bias using Textual Entailment Data" https://arxiv.org/abs/2210.14975
☆38Updated last year
hannamw / EAP-IG
☆37Updated last month
Zce1112zslx / IKE
☆41Updated last year
THU-KEG / Skill-Neuron
Source code for EMNLP2022 paper "Finding Skill Neurons in Pre-trained Transformers via Prompt Tuning".
☆18Updated 2 years ago
yanaiela / pararel
☆44Updated last year
tatsu-lab / linguistic_calibration
Align your LM to express calibrated verbal statements of confidence in its long-form generations.
☆26Updated last year
Princeton-SysML / kNNLM_privacy
Official implementation of Privacy Implications of Retrieval-Based Language Models (EMNLP 2023). https://arxiv.org/abs/2305.14888
☆35Updated last year