Yangyi-Chen / PaperList-Trustworthy-ApplicationsLinks
Mostly recording papers about models' trustworthy applications. Intending to include topics like model evaluation & analysis, security, calibration, backdoor learning, robustness, et al.
☆21Updated 2 years ago
Alternatives and similar repositories for PaperList-Trustworthy-Applications
Users that are interested in PaperList-Trustworthy-Applications are comparing it to the libraries listed below
Sorting:
- ☆42Updated last year
- ICLR2024 Paper. Showing properties of safety tuning and exaggerated safety.☆89Updated last year
- Official repository for ICML 2024 paper "On Prompt-Driven Safeguarding for Large Language Models"☆100Updated 6 months ago
- [ICLR'24] RAIN: Your Language Models Can Align Themselves without Finetuning