wangclnlp / DeepSpeed-Chat-ExtensionLinks
This repo contains some extensions of deepspeed-chat for fine-tuning LLMs (SFT+RLHF).
☆21Updated last year
Alternatives and similar repositories for DeepSpeed-Chat-Extension
Users that are interested in DeepSpeed-Chat-Extension are comparing it to the libraries listed below
Sorting:
- code for ACL2024-main: BatchEval: Towards Human-like Text Evaluation☆19Updated last year
- ☆23Updated 11 months ago
- This is the code of MMOA-RAG.☆101Updated 8 months ago
- [EMNLP 2024] ”ESC-Eval: Evaluating Emotion Support Conversations in Large Language Models“☆26Updated last year
- [SIGIR'24] The official implementation code of MOELoRA.☆186Updated last year
- Reinforced Multi-LLM Agents training☆69Updated 2 weeks ago
- ☆25Updated 9 months ago
- [ICLR 2025] Language Imbalance Driven Rewarding for Multilingual Self-improving☆24Updated 5 months ago
- Public code repo for COLING 2025 paper "Aligning LLMs with Individual Preferences via Interaction"☆41Updated 9 months ago
- ☆51Updated last year
- Official repo for "AlignGPT: Multi-modal Large Language Models with Adaptive Alignment Capability"☆34Updated last year
- Automatically update arXiv papers about LLM Reasoning, LLM Evaluation, LLM & MLLM and Video Understanding using Github Actions.☆135Updated this week
- DICE: Detecting In-distribution Data Contamination with LLM's Internal State☆11Updated last year
- This my attempt to create Self-Correcting-LLM based on the paper Training Language Models to Self-Correct via Reinforcement Learning by g…☆38Updated 6 months ago
- The implement of paper:"ReDeEP: Detecting Hallucination in Retrieval-Augmented Generation via Mechanistic Interpretability"☆57Updated 7 months ago
- ☆182Updated last week
- [2025-TMLR] A Survey on the Honesty of Large Language Models☆64Updated last year
- ☆75Updated 2 months ago
- [ICLR'25] DataGen: Unified Synthetic Dataset Generation via Large Language Models☆65Updated 10 months ago
- [ACL 2025] Knowledge Unlearning for Large Language Models☆48Updated 4 months ago
- The implementation for ICLR 2025 Oral: From Exploration to Mastery: Enabling LLMs to Master Tools via Self-Driven Interactions.☆52Updated 5 months ago
- Official Repo for FoodieQA paper (EMNLP 2024)☆19Updated 7 months ago
- [SIGIR'24] The official implementation code of MOELoRA.