decoding-comp-trust/comp-trust

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/decoding-comp-trust/comp-trust)

decoding-comp-trust / comp-trust

Codebase for decoding compressed trust.

☆27

Alternatives and similar repositories for comp-trust

Users that are interested in comp-trust are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

liutianlin0121 / decoding-time-realignment
View on GitHub
Implementation of "Decoding-time Realignment of Language Models", ICML 2024.
☆21Jun 17, 2024Updated 2 years ago
KaidiXu / LiRPA_Verify
View on GitHub
Code for paper "Fast and Complete: Enabling Complete Neural Network Verification with Rapid and Massively Parallel Incomplete Verifiers"
☆17Jan 27, 2023Updated 3 years ago
cometeme / funcoder
View on GitHub
Implementation for NeurIPS 2024 oral paper: Divide-and-Conquer Meets Consensus: Unleashing the Power of Functions in Code Generation
☆16Jan 27, 2025Updated last year
jyhong836 / llm-dp-finetune
View on GitHub
End-to-end codebase for finetuning LLMs (LLaMA 2, 3, etc.) with or without DP
☆17Sep 23, 2024Updated last year
zhxieml / remiss-jailbreak
View on GitHub
☆33Jun 24, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
AI-secure / DecodingTrust
View on GitHub
A Comprehensive Assessment of Trustworthiness in GPT Models
☆314Sep 16, 2024Updated last year
simonucl / PolySkill
View on GitHub
Official implementation of PolySkill, a framework that enables web agents to learn generalizable and compositional skills through polymor…
☆15Jul 6, 2026Updated 2 weeks ago
illidanlab / FADE
View on GitHub
[KDD2021] Federated Adversarial Debiasing for Fair and Transferable Representations: Optimize an adversarial domain-adaptation objective …
☆26Feb 23, 2023Updated 3 years ago
weichen-yu / LM-Extraction
View on GitHub
☆43May 23, 2023Updated 3 years ago
CHATS-lab / VibeLens
View on GitHub
Your agent is powerful but it doesn't know you. VibeLens visualizes agent sessions, personalizes your agents, provides dashboard analytic…
☆18Updated this week
tomhosking / hercules
View on GitHub
Hercules: Attributable and Scalable Opinion Summarization (ACL 2023)
☆20Nov 8, 2023Updated 2 years ago
git-disl / Safety-Tax
View on GitHub
This is the official code for the paper "Safety Tax: Safety Alignment Makes Your Large Reasoning Models Less Reasonable".
☆35Mar 11, 2025Updated last year
andersonbcdefg / dpo-lora
View on GitHub
direct preference optimization with only 1 model copy :)
☆14Oct 2, 2023Updated 2 years ago
VITA-Group / Shake-to-Leak
View on GitHub
[SatML 2024] Shake to Leak: Fine-tuning Diffusion Models Can Amplify the Generative Privacy Risk
☆16Mar 15, 2025Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
hanxuhu / chain-of-symbol-planning
View on GitHub
☆23May 25, 2023Updated 3 years ago
hedongxiao-tju / NSLM
View on GitHub
Code & data accompanying the paper ["Unveiling Implicit Deceptive Patterns in Multi-modal Fake News via Neuro-Symbolic Reasoning"].
☆13Dec 21, 2023Updated 2 years ago
zchuz / TimeBench
View on GitHub
The repository for ACL 2024 paper "TimeBench: A Comprehensive Evaluation of Temporal Reasoning Abilities in Large Language Models"
☆36Jun 29, 2024Updated 2 years ago
e-commerce-search / bert2dnn
View on GitHub
Large Scale BERT Distillation
☆33Mar 24, 2023Updated 3 years ago
liuzrcc / AIP
View on GitHub
Adversarial Item Promotion in visually-aware recommenders
☆17Sep 3, 2021Updated 4 years ago
UCSB-NLP-Chang / SelfDenoise
View on GitHub
☆14May 7, 2024Updated 2 years ago
tmlr-group / DAL
View on GitHub
[NeurIPS 2023] "Learning to Augment Distributions for Out-of-distribution Detection"
☆11Nov 14, 2023Updated 2 years ago
CryptoAILab / JailbreakEval
View on GitHub
[NDSS'25 Best Technical Poster] A collection of automated evaluators for assessing jailbreak attempts.
☆193Apr 1, 2025Updated last year
JasonForJoy / Model-Editing-Hurt
View on GitHub
EMNLP 2024: Model Editing Harms General Abilities of Large Language Models: Regularization to the Rescue
☆37May 26, 2025Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
adamkarvonen / SAE_BoardGameEval
View on GitHub
☆25Jan 28, 2025Updated last year
pixeli99 / OWS
View on GitHub
Official Pytorch Implementation of "Outlier-weighed Layerwise Sampling for LLM Fine-tuning" by Pengxiang Li, Lu Yin, Xiaowei Gao, Shiwei …
☆35Jun 3, 2025Updated last year
HKBUNLP / Mr.Harm-EMNLP2023
View on GitHub
Code for our EMNLP 2023 paper - Beneath the Surface: Unveiling Harmful Memes with Multimodal Reasoning Distilled from Large Language Mode…
☆15May 5, 2024Updated 2 years ago
kundank78 / SpatialTransformer
View on GitHub
Implementation of Spatial Transformer Networks in Pytorch.
☆14May 13, 2023Updated 3 years ago
illidanlab / inversion-influence-function
View on GitHub
Official codes for "Understanding Deep Gradient Leakage via Inversion Influence Functions", NeurIPS 2023
☆16Oct 13, 2023Updated 2 years ago
CryptoAILab / MergeGuard
View on GitHub
[CCS-LAMPS'24] LLM IP Protection Against Model Merging
☆16Oct 14, 2024Updated last year
UNITES-Lab / Occult
View on GitHub
[ICML‘25] Official code for paper "Occult: Optimizing Collaborative Communication across Experts for Accelerated Parallel MoE Training an…
☆13Apr 17, 2025Updated last year
swj0419 / detect-pretrain-code
View on GitHub
This repository provides an original implementation of Detecting Pretraining Data from Large Language Models by *Weijia Shi, *Anirudh Aji…
☆243Nov 3, 2023Updated 2 years ago
yfchen1994 / poisoning_membership
View on GitHub
☆20Oct 28, 2025Updated 8 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
amiya-special / AutoMIA
View on GitHub
☆15Apr 3, 2026Updated 3 months ago
LFhase / GIA-HAO
View on GitHub
[ICLR 2022] Understanding and Improving Graph Injection Attack by Promoting Unnoticeability
☆38Nov 27, 2023Updated 2 years ago
Breakend / SelfDestructingModels
View on GitHub
☆14Aug 9, 2023Updated 2 years ago
UCSB-NLP-Chang / Fairness-Reprogramming
View on GitHub
☆16Oct 16, 2023Updated 2 years ago
illidanlab / ABD
View on GitHub
[ICML2023] Revisiting Data-Free Knowledge Distillation with Poisoned Teachers
☆24Jul 7, 2024Updated 2 years ago
Sreyan88 / ACLM
View on GitHub
Code for ACL 2023 Paper: ACLM: A Selective-Denoising based Generative Data Augmentation Approach for Low-Resource Complex NER
☆22Jul 19, 2023Updated 3 years ago
RunpeiDong / DGMS
View on GitHub
[ICML 2022 Spotlight] Finding the Task-Optimal Low-Bit Sub-Distribution in Deep Neural Networks
☆11May 21, 2023Updated 3 years ago