moqingyan / dsr-lmLinks

☆11

Alternatives and similar repositories for dsr-lm

Users that are interested in dsr-lm are comparing it to the libraries listed below

Sorting:

zh1yu4nyu / CodeIPPrompt
https://icml.cc/virtual/2023/poster/24354
☆10Updated 2 years ago
MadryLab / modeldiff
ModelDiff: A Framework for Comparing Learning Algorithms
☆58Updated 2 years ago
McGill-NLP / AdversarialTriggers
TACL 2025: Investigating Adversarial Trigger Transfer in Large Language Models
☆19Updated 3 months ago
ethz-spylab / superhuman-ai-consistency
☆30Updated 2 years ago
xiye17 / SAT-LM
SatLM: SATisfiability-Aided Language Models using Declarative Prompting (NeurIPS 2023)
☆50Updated last year
ApolloResearch / deception-detection
☆24Updated 9 months ago
Cranial-XIX / Continual-Learning-Private-Unlearning
Official PyTorch Implementation for Continual Learning and Private Unlearning
☆17Updated 3 years ago
MadryLab / datamodels-data
Data for "Datamodels: Predicting Predictions with Training Data"
☆97Updated 2 years ago
yuchen814 / CodeHalu
☆17Updated last year
weichen-yu / LM-Extraction
☆43Updated 2 years ago
thestephencasper / explore_establish_exploit_llms
☆31Updated 2 years ago
MadryLab / failure-directions
Distilling Model Failures as Directions in Latent Space
☆47Updated 2 years ago
joshuacnf / paradox-learning2reason
☆36Updated 11 months ago
azshue / AutoPoison
The official repository of the paper "On the Exploitability of Instruction Tuning".
☆65Updated last year
ejones313 / auditing-llms
☆59Updated 2 years ago
shauli-ravfogel / adv-kernel-removal
☆12Updated 3 years ago
locuslab / acr-memorization
☆37Updated 11 months ago
csfaculty / csfaculty.github.io
Interview questions for Computer Science faculty jobs
☆40Updated last year
crux-eval / eval-arena
☆28Updated 2 weeks ago
google-deepmind / distribution_shift_framework
This repository contains the code of the distribution shift framework presented in A Fine-Grained Analysis on Distribution Shift (Wiles e…
☆84Updated 3 weeks ago
aw31 / empirical-ntks
Efficient empirical NTKs in PyTorch
☆22Updated 3 years ago
thestephencasper / latent_adversarial_training
☆23Updated last year
pratyushmaini / localizing-memorization
Official Repository for ICML 2023 paper "Can Neural Network Memorization Be Localized?"
☆20Updated 2 years ago
shauli-ravfogel / rlace-icml
☆36Updated 3 years ago
Yujun-Yan / Neural-Execution-Engines
Code for Neural Execution Engines: Learning to Execute Subroutines
☆17Updated 4 years ago
agiresearch / TrustAgent
TrustAgent: Towards Safe and Trustworthy LLM-based Agents
☆54Updated 9 months ago
MadryLab / pretraining-distribution-shift-robustness
☆14Updated last year
arobey1 / advbench
☆44Updated 2 years ago
reddy-lab-code-research / PPOCoder
Code for the TMLR 2023 paper "PPOCoder: Execution-based Code Generation using Deep Reinforcement Learning"
☆117Updated last year
ybjiaang / ACTIR
Code repository for the paper "Invariant and Transportable Representations for Anti-Causal Domain Shifts"
☆16Updated 3 years ago