Codebase for decoding compressed trust.
☆25May 7, 2024Updated last year
Alternatives and similar repositories for comp-trust
Users that are interested in comp-trust are comparing it to the libraries listed below
Sorting:
- Implementation of "Decoding-time Realignment of Language Models", ICML 2024.☆21Jun 17, 2024Updated last year
- This is the repositoary for our paper published at ICML24.☆11Jun 11, 2025Updated 9 months ago
- Learning adapter weights from task descriptions☆19Nov 12, 2023Updated 2 years ago
- End-to-end codebase for finetuning LLMs (LLaMA 2, 3, etc.) with or without DP☆16Sep 23, 2024Updated last year
- Implementation for NeurIPS 2024 oral paper: Divide-and-Conquer Meets Consensus: Unleashing the Power of Functions in Code Generation☆16Jan 27, 2025Updated last year
- ☆33Jun 24, 2024Updated last year
- [ICLR 2024] Inducing High Energy-Latency of Large Vision-Language Models with Verbose Images☆43Jan 25, 2024Updated 2 years ago
- ☆10Jun 19, 2023Updated 2 years ago
- A Comprehensive Assessment of Trustworthiness in GPT Models☆314Sep 16, 2024Updated last year
- Hercules: Attributable and Scalable Opinion Summarization (ACL 2023)☆20Nov 8, 2023Updated 2 years ago
- ☆43May 23, 2023Updated 2 years ago
- [EMNLP 2025] CompassVerifier: A Unified and Robust Verifier for LLMs Evaluation and Outcome Reward☆65Aug 10, 2025Updated 7 months ago
- ☆16Oct 11, 2023Updated 2 years ago
- ☆23May 25, 2023Updated 2 years ago
- ☆28Feb 27, 2025Updated last year
- direct preference optimization with only 1 model copy :)☆14Oct 2, 2023Updated 2 years ago
- The repository for ACL 2024 paper "TimeBench: A Comprehensive Evaluation of Temporal Reasoning Abilities in Large Language Models"☆34Jun 29, 2024Updated last year
- [SatML 2024] Shake to Leak: Fine-tuning Diffusion Models Can Amplify the Generative Privacy Risk☆16Mar 15, 2025Updated last year
- ☆16Jul 17, 2025Updated 8 months ago
- Official Pytorch Implementation of "Outlier-weighed Layerwise Sampling for LLM Fine-tuning" by Pengxiang Li, Lu Yin, Xiaowei Gao, Shiwei …☆35Jun 3, 2025Updated 9 months ago
- ☆20Feb 3, 2025Updated last year
- ☆24Jan 28, 2025Updated last year
- Inverse Scaling in Test-Time Compute☆25Dec 3, 2025Updated 3 months ago
- Large Scale BERT Distillation☆33Mar 24, 2023Updated 2 years ago
- [NDSS'25 Best Technical Poster] A collection of automated evaluators for assessing jailbreak attempts.☆188Apr 1, 2025Updated 11 months ago
- [NAACL'25 🏆 SAC Award] Official code for "Advancing MoE Efficiency: A Collaboration-Constrained Routing (C2R) Strategy for Better Expert…☆16Feb 4, 2025Updated last year
- Official implementation of the paper "Increasing Confidence in Adversarial Robustness Evaluations"☆20Updated this week
- Application and blog explaining my interpretations of In-run Data Shapley☆26Jan 30, 2025Updated last year
- Official codes for "Understanding Deep Gradient Leakage via Inversion Influence Functions", NeurIPS 2023☆15Oct 13, 2023Updated 2 years ago
- [ICML 2024] Code for the paper "MoE-RBench: Towards Building Reliable Language Models with Sparse Mixture-of-Experts"☆10Jul 1, 2024Updated last year
- Code for CVPR2018 "Iterative Learning with Open-set Noisy Labels"☆12Mar 12, 2021Updated 5 years ago
- EMNLP 2024: Model Editing Harms General Abilities of Large Language Models: Regularization to the Rescue☆38May 26, 2025Updated 9 months ago
- [ICLR 2022] Understanding and Improving Graph Injection Attack by Promoting Unnoticeability☆38Nov 27, 2023Updated 2 years ago
- Official code of the paper "A Stealthy Wrongdoer: Feature-Oriented Reconstruction Attack against Split Learning".☆15Sep 11, 2024Updated last year
- [ICML 2022 Spotlight] Finding the Task-Optimal Low-Bit Sub-Distribution in Deep Neural Networks☆11May 21, 2023Updated 2 years ago
- [CCS-LAMPS'24] LLM IP Protection Against Model Merging☆16Oct 14, 2024Updated last year
- SECOM: On Memory Construction and Retrieval for Personalized Conversational Agents, ICLR 2025☆55Mar 1, 2025Updated last year
- Official codes for FPR (Accepted by CVPR2025)☆14Mar 19, 2025Updated last year
- Adversarial Item Promotion in visually-aware recommenders☆16Sep 3, 2021Updated 4 years ago