ShuoTang123 / MATRIX
Implementation of the MATRIX framework (ICML 2024)
☆49Updated 11 months ago
Alternatives and similar repositories for MATRIX:
Users that are interested in MATRIX are comparing it to the libraries listed below
- ☆42Updated 5 months ago
- A Survey on the Honesty of Large Language Models☆57Updated 4 months ago
- ☆25Updated 10 months ago
- Data and Code for Paper VLSBench: Unveiling Visual Leakage in Multimodal Safety☆37Updated last month
- Paper List of Inference/Test Time Scaling/Computing☆181Updated last week
- [ICML 2024] Assessing the Brittleness of Safety Alignment via Pruning and Low-Rank Modifications☆76Updated 3 weeks ago
- ☆32Updated 6 months ago
- [NAACL 25 Demo] TrustEval: A modular and extensible toolkit for comprehensive trust evaluation of generative foundation models (GenFMs)☆97Updated 2 weeks ago
- Official implementation for "ALI-Agent: Assessing LLMs'Alignment with Human Values via Agent-based Evaluation"☆16Updated 2 months ago
- Awesome-Efficient-Inference-for-LRMs is a collection of state-of-the-art, novel, exciting, token-efficient methods for Large Reasoning Mo…☆56Updated this week
- RWKU: Benchmarking Real-World Knowledge Unlearning for Large Language Models. NeurIPS 2024☆72Updated 6 months ago
- ☆43Updated 2 months ago
- Accepted LLM Papers in NeurIPS 2024☆35Updated 6 months ago
- ☆88Updated 3 months ago
- AdaICL: Which Examples to Annotate of In-Context Learning? Towards Effective and Efficient Selection☆17Updated last year
- code repo for ICLR 2024 paper "Can LLMs Express Their Uncertainty? An Empirical Evaluation of Confidence Elicitation in LLMs"☆110Updated last year
- [ACL'24] Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization☆74Updated 8 months ago
- Official Repository for The Paper: Safety Alignment Should Be Made More Than Just a Few Tokens Deep☆89Updated 9 months ago
- Code for paper: Aligning Large Language Models with Representation Editing: A Control Perspective☆28Updated 2 months ago
- [ICLR 2025] Code and Data Repo for Paper "Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation"☆41Updated 4 months ago
- This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.☆70Updated this week
- Code for "CREAM: Consistency Regularized Self-Rewarding Language Models", ICLR 2025.☆20Updated 2 months ago
- The repository of the paper "REEF: Representation Encoding Fingerprints for Large Language Models," aims to protect the IP of open-source…☆41Updated 3 months ago
- [ACL 2024] Code and data for "Machine Unlearning of Pre-trained Large Language Models"☆57Updated 6 months ago
- ICLR 2025 Agent-Related Papers☆62Updated 5 months ago
- ☆34Updated 6 months ago
- [ACL'24] Chain of Thought (CoT) is significant in improving the reasoning abilities of large language models (LLMs). However, the correla…☆45Updated last month
- ☆34Updated last month
- ☆9Updated 5 months ago
- [ACL 2024] Shifting Attention to Relevance: Towards the Predictive Uncertainty Quantification of Free-Form Large Language Models☆47Updated 7 months ago