Jihuai-wpy / InferAligner
☆23Updated 7 months ago
Related projects: ⓘ
- ☆27Updated 3 months ago
- ☆110Updated last month
- Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning☆144Updated 7 months ago
- A curated list of LLM Interpretability related material - Tutorial, Library, Survey, Paper, Blog, etc..☆109Updated last week
- Paper list and datasets for the paper: A Survey on Data Selection for LLM Instruction Tuning☆27Updated 7 months ago
- 😎 up-to-date & curated list of awesome LMM hallucinations papers, methods & resources.☆140Updated 5 months ago
- The reinforcement learning codes for dataset SPA-VL☆15Updated 2 months ago
- UniGen: A Unified Framework for Dataset Generation via Large Language Model☆21Updated 2 weeks ago
- ☆28Updated 7 months ago
- [ICLR'24] RAIN: Your Language Models Can Align Themselves without Finetuning☆79Updated 3 months ago
- [ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".☆81Updated this week
- An Easy-to-use Hallucination Detection Framework for LLMs.☆48Updated 4 months ago
- Official repository for ICML 2024 paper "On Prompt-Driven Safeguarding for Large Language Models"☆64Updated 2 weeks ago
- ☆71Updated 8 months ago
- Shadow Alignment: The Ease of Subverting Safely-Aligned Language Models☆22Updated 11 months ago
- Code & Data for our Paper "Alleviating Hallucinations of Large Language Models through Induced Hallucinations"☆56Updated 6 months ago
- [ACL'24] A Knowledge-grounded Interactive Evaluation Framework for Large Language Models☆32Updated 2 months ago
- [NeurIPS 2023] Github repository for "Composing Parameter-Efficient Modules with Arithmetic Operations"☆54Updated 9 months ago
- MoCLE (First MLLM with MoE for instruction customization and generalization!) (https://arxiv.org/abs/2312.12379)☆28Updated 5 months ago
- RWKU: Benchmarking Real-World Knowledge Unlearning for Large Language Models☆50Updated 2 months ago
- 📰 Must-read papers on KV Cache Compression (constantly updating 🤗).☆34Updated this week
- The official GitHub repo for the paper "Course-Correction: Safety Alignment Using Synthetic Preferences"☆19Updated last month
- 【ACL 2024】 SALAD benchmark & MD-Judge☆81Updated this week
- The repository of the project "Fine-tuning Large Language Models with Sequential Instructions", code base comes from open-instruct and LA…☆30Updated 2 months ago
- Accepted by IJCAI-24 Survey Track☆117Updated 3 weeks ago
- Code associated with Tuning Language Models by Proxy (Liu et al., 2024)☆84Updated 5 months ago
- ☆14Updated 4 months ago
- Official repo for "AlignGPT: Multi-modal Large Language Models with Adaptive Alignment Capability"☆29Updated 2 months ago
- Official repository for paper "Weak-to-Strong Extrapolation Expedites Alignment"☆62Updated 3 months ago
- [ICML'2024] Can AI Assistants Know What They Don't Know?☆62Updated 7 months ago