wangclnlp / DeepSpeed-Chat-Extension
This repo contains some extensions of deepspeed-chat for fine-tuning LLMs (SFT+RLHF).
☆18Updated 9 months ago
Alternatives and similar repositories for DeepSpeed-Chat-Extension:
Users that are interested in DeepSpeed-Chat-Extension are comparing it to the libraries listed below
- Analyzing and Reducing Catastrophic Forgetting in Parameter Efficient Tuning☆30Updated 4 months ago
- Code for ACL 2024 accepted paper titled "SAPT: A Shared Attention Framework for Parameter-Efficient Continual Learning of Large Language …☆33Updated 2 months ago
- RWKU: Benchmarking Real-World Knowledge Unlearning for Large Language Models. NeurIPS 2024☆72Updated 6 months ago
- [ICLR 2025] Code&Data for the paper "Super(ficial)-alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization"☆13Updated 9 months ago
- ☆70Updated 3 months ago
- The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"☆37Updated last year
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆111Updated 6 months ago
- [arxiv:2412.04905] DEMO: Reframing Dialogue Interaction with Fine-grained Element Modeling☆13Updated 3 months ago
- [ICLR 25 Oral] RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style☆28Updated last week
- code for ACL2024-main: BatchEval: Towards Human-like Text Evaluation☆18Updated 10 months ago
- ☆12Updated 2 months ago
- [EMNLP 2024] ”ESC-Eval: Evaluating Emotion Support Conversations in Large Language Models“☆17Updated 9 months ago
- [ICML'2024] Can AI Assistants Know What They Don't Know?☆79Updated last year
- [ACL 2024] Learning to Edit: Aligning LLMs with Knowledge Editing☆35Updated 7 months ago
- [EMNLP 2024 Findings] Unlocking Continual Learning Abilities in Language Models☆25Updated 5 months ago
- Public code repo for paper "Aligning LLMs with Individual Preferences via Interaction"☆24Updated 5 months ago
- Language Imbalance Driven Rewarding for Multilingual Self-improving☆15Updated 5 months ago
- Code & Data for our Paper "Alleviating Hallucinations of Large Language Models through Induced Hallucinations"☆63Updated last year
- ☆69Updated last year
- This repository contains the code for SFT, RLHF, and DPO, designed for vision-based LLMs, including the LLaVA models and the LLaMA-3.2-vi…☆104Updated 5 months ago
- [ICLR'25] DataGen: Unified Synthetic Dataset Generation via Large Language Models☆44Updated 3 weeks ago
- ☆43Updated 9 months ago
- [ACL 2024] Unveiling Linguistic Regions in Large Language Models☆30Updated 9 months ago
- An Easy-to-use Hallucination Detection Framework for LLMs.☆58Updated 11 months ago
- Code associated with Tuning Language Models by Proxy (Liu et al., 2024)☆107Updated last year
- TRACE: A Comprehensive Benchmark for Continual Learning in Large Language Models☆66Updated last year
- Source code of “Reinforcement Learning with Token-level Feedback for Controllable Text Generation (NAACL 2024)☆11Updated 3 months ago
- One Network, Many Masks: Towards More Parameter-Efficient Transfer Learning☆39Updated last year
- Code Repo for EfficientRAG: Efficient Retriever for Multi-Hop Question Answering☆43Updated 3 weeks ago
- ☆43Updated 5 months ago