wangclnlp / DeepSpeed-Chat-Extension
This repo contains some extensions of deepspeed-chat for fine-tuning LLMs (SFT+RLHF).
☆17Updated 6 months ago
Alternatives and similar repositories for DeepSpeed-Chat-Extension:
Users that are interested in DeepSpeed-Chat-Extension are comparing it to the libraries listed below
- code for ACL2024-main: BatchEval: Towards Human-like Text Evaluation☆18Updated 8 months ago
- A hot-pluggable tool for visualizing LLaVA's attention.☆13Updated last year
- [ACL 2024] Masked Thought: Simply Masking Partial Reasoning Steps Can Improve Mathematical Reasoning Learning of Language Models☆15Updated 6 months ago
- RWKU: Benchmarking Real-World Knowledge Unlearning for Large Language Models. NeurIPS 2024☆65Updated 4 months ago
- [EMNLP 2024] ”ESC-Eval: Evaluating Emotion Support Conversations in Large Language Models“☆16Updated 7 months ago
- ☆62Updated last year
- A Survey on the Honesty of Large Language Models☆51Updated last month
- Repo for paper "CODIS: Benchmarking Context-Dependent Visual Comprehension for Multimodal Large Language Models".☆10Updated 3 months ago
- Official repo for "AlignGPT: Multi-modal Large Language Models with Adaptive Alignment Capability"☆32Updated 6 months ago
- [EMNLP’24 Main] Encoding and Controlling Global Semantics for Long-form Video Question Answering☆16Updated 3 months ago
- Code for ACL 2024 accepted paper titled "SAPT: A Shared Attention Framework for Parameter-Efficient Continual Learning of Large Language …☆29Updated 2 weeks ago
- Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning☆161Updated 11 months ago
- Official Repo for FoodieQA paper (EMNLP 2024)☆15Updated 2 months ago
- [ICLR 2025] Code&Data for the paper "Super(ficial)-alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization"☆11Updated 7 months ago
- LLM Unlearning☆141Updated last year
- Analyzing and Reducing Catastrophic Forgetting in Parameter Efficient Tuning☆29Updated 2 months ago
- ☆37Updated 3 months ago
- A trend starts from "Chain of Thought Prompting Elicits Reasoning in Large Language Models".☆40Updated last year
- ☆85Updated 4 months ago
- This repo is reproduction resources for linear alignment paper, still working☆17Updated 8 months ago
- [ACL 2023] Modeling What-to-ask and How-to-ask for Answer-unaware Conversational Question Generation☆13Updated last year
- The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"☆34Updated last year
- The Code for Lever LM: Configuring In-Context Sequence to Lever Large Vision Language Models☆13Updated 3 months ago
- [ICML'2024] Can AI Assistants Know What They Don't Know?☆77Updated 11 months ago
- Code & Data for our Paper "Alleviating Hallucinations of Large Language Models through Induced Hallucinations"☆62Updated 11 months ago
- Less is More: Mitigating Multimodal Hallucination from an EOS Decision Perspective (ACL 2024)☆41Updated 3 months ago
- [EMNLP 2023] MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions☆106Updated 4 months ago
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆100Updated 4 months ago
- The official code repository for PRMBench.☆60Updated last week
- ☆30Updated 11 months ago