wangclnlp / DeepSpeed-Chat-Extension
This repo contains some extensions of deepspeed-chat for fine-tuning LLMs (SFT+RLHF).
☆15Updated 2 months ago
Related projects: ⓘ
- ☆10Updated 3 weeks ago
- This is the repository for our paper: Untying the Reversal Curse via Bidirectional Language Model Editing☆9Updated 6 months ago
- ☆12Updated 8 months ago
- Code for EMNLP 2023 Findings paper: "Not All Demonstration Examples are Equally Beneficial: Reweighting Demonstration Examples for In-Co…☆12Updated 11 months ago
- A paper reading list maintained by ICI-MT, contains all papers and PPTs (if available) shared by our groups.☆10Updated 6 months ago
- Source code for Truth-Aware Context Selection: Mitigating the Hallucinations of Large Language Models Being Misled by Untruthful Contexts☆13Updated 2 weeks ago
- Synth-Empathy: Towards High-Quality Synthetic Empathy Data☆10Updated 3 weeks ago
- Official repo for "AlignGPT: Multi-modal Large Language Models with Adaptive Alignment Capability"☆29Updated 2 months ago
- Code for paper 'Are We Falling in a Middle-Intelligence Trap? An Analysis and Mitigation of the Reversal Curse'☆11Updated last month
- Code for Findings of EMNLP2023 paper "Coarse-to-Fine Dual Encoders are Better Frame Identification Learners"☆12Updated 11 months ago
- The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"☆28Updated 8 months ago
- ☆10Updated last year
- Implementation of our ACL2023 paper: Unifying Cross-Lingual and Cross-Modal Modeling Towards Weakly Supervised Multilingual Vision-Langua…☆14Updated last year
- EMNLP 2022 RLET: A Reinforcement Learning Based Approach for Explainable QA with Entailment Trees☆11Updated last year
- ☆29Updated last year
- ☆11Updated 4 months ago
- ☆12Updated last month
- ☆26Updated 2 weeks ago
- ☆13Updated 10 months ago
- Replication package of the paper "Exploring Parameter-Efficient Fine-Tuning Techniques for Code Generation with Large Language Models".☆13Updated last week
- This is the code repo for our paper "Say More with Less: Understanding Prompt Learning Behaviors through Gist Compression".☆10Updated 6 months ago
- Source code for EMNLP2022 paper "Finding Skill Neurons in Pre-trained Transformers via Prompt Tuning".☆16Updated last year
- ☆17Updated 2 months ago
- PyTorch implementation of StableMask (ICML'24)☆11Updated 2 months ago
- The code of “Improving Weak-to-Strong Generalization with Scalable Oversight and Ensemble Learning”☆15Updated 6 months ago
- DialCoT Meets PPO: Decomposing and Exploring Reasoning Paths in Smaller Language Models☆13Updated 10 months ago
- ControlLM is a method to control the personality traits and behaviors of language models in real-time at inference without costly trainin…☆10Updated 6 months ago
- One Network, Many Masks: Towards More Parameter-Efficient Transfer Learning☆36Updated last year
- Code for ACL 2024 paper "Soft Self-Consistency Improves Language Model Agents"☆14Updated last week
- EMNLP 2023 Plan, Verify and Switch: Integrated Reasoning with Diverse X-of-Thoughts☆23Updated 10 months ago