This repo contains some extensions of deepspeed-chat for fine-tuning LLMs (SFT+RLHF).
☆21Jul 2, 2024Updated last year
Alternatives and similar repositories for DeepSpeed-Chat-Extension
Users that are interested in DeepSpeed-Chat-Extension are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A repository used to organize content related to Large Speech(Audio) Model, including paper, data, applications, tools and so on.☆28Nov 8, 2025Updated 4 months ago
- [npj Digital Medicine] An In-Depth Evaluation of Federated Learning on Biomedical Natural Language Processing for Information Extraction☆12May 1, 2024Updated last year
- The official implementation of paper "TRCE: Towards Reliable Malicious Concept Erasure in Text-to-Image Diffusion Models"☆17Mar 11, 2025Updated last year
- ☆10Sep 18, 2021Updated 4 years ago
- ☆11Aug 15, 2023Updated 2 years ago
- ☆16Aug 20, 2021Updated 4 years ago
- ☆16Aug 25, 2021Updated 4 years ago
- Official Implementation of Flash-Searcher: Fast and Effective Web Agents via DAG-Based Parallel Execution☆75Dec 8, 2025Updated 3 months ago
- Code for "Enhancing Cooperation through Selective Interaction and Long-term Experiences in Multi-Agent Reinforcement Learning", IJCAI24.☆14Feb 9, 2025Updated last year
- Official PyTorch implementation of our ICCV2023 paper “When Prompt-based Incremental Learning Does Not Meet Strong Pretraining”☆16Jan 8, 2024Updated 2 years ago
- [NeurIPS 2024] Federated Learning from Vision-Language Foundation Models: Theoretical Analysis and Method☆15Oct 1, 2024Updated last year
- 东北大学 C++课设/C课设 图书管理系统(最卷)☆10May 15, 2022Updated 3 years ago
- VHTest☆16Oct 31, 2024Updated last year
- Implementation of FedBary☆16Mar 24, 2025Updated 11 months ago
- A novel algorithm that distributes training-time model rewards to incentivize client contributions for federated learning. (ICLR-2024)☆18May 6, 2024Updated last year
- Code for Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation (EVOL-RL).☆48Oct 16, 2025Updated 5 months ago
- 为了准备来年的蓝桥杯个人赛的一些常考的算法模板☆17Nov 30, 2019Updated 6 years ago
- [CVPR-2024] Text-Enhanced Data-free Approach for Federated Class-Incremental Learning☆18Dec 26, 2024Updated last year
- This project based on Particle Swarm Optimization Algorithm. Try to solve Mobile Edge Computing optimization problem.☆11Jun 19, 2020Updated 5 years ago
- Official codes for "Training Deep Q-Network via Monte Carlo Tree Search for Adaptive Bitrate Control in Video Delivery"☆10Jul 21, 2023Updated 2 years ago
- MM-Instruct: Generated Visual Instructions for Large Multimodal Model Alignment☆35Jul 1, 2024Updated last year
- Sabre360: simulation testbed for 360° videos☆14Oct 14, 2020Updated 5 years ago
- Official Repository of Native Parallel Reasoner☆103Feb 5, 2026Updated last month
- [AAAI 2023 Oral] CoMAE: Single Model Hybrid Pre-training on Small-Scale RGB-D Datasets☆38Aug 20, 2024Updated last year
- [TMLR23] FedDAG: Federated DAG Structure Learning☆19Jan 7, 2023Updated 3 years ago
- This is the formal code implementation of the CVPR 2024 paper 'Traceable Federated Continual Learning'.☆18May 31, 2024Updated last year
- ☆56Nov 12, 2024Updated last year
- ParamMute: Suppressing Knowledge-Critical FFNs for Faithful Retrieval-Augmented Generation☆57Feb 2, 2026Updated last month
- Code for "Graph Contrastive Learning with Cohesive Subgraph Awareness"☆20Feb 29, 2024Updated 2 years ago
- VDPG: Adapting to Distribution Shift by Visual Domain Prompt Generation (ICLR 2024)☆18Jun 10, 2025Updated 9 months ago
- 🎮 A toolkit for Relation Extraction and more...☆24May 8, 2025Updated 10 months ago
- This workshop covers the entire process of using Milvus—from installation and basic concepts to core operations and practical application…☆34Jan 8, 2026Updated 2 months ago
- The official code for the paper 'Towards Fair Graph Federated Learning via Incentive Mechanisms'☆17May 23, 2024Updated last year
- 此项目为论文《FedServing: A Federated Prediction Serving Framework Based on Incentive Mechanism》的验证项目。基于 intel SGX ,实现将各个不同模型的推测结果在可信硬件中使用 truth…☆18Oct 23, 2023Updated 2 years ago
- Forked from *OneIE: A Joint Neural Model for Information Extraction with Global Features*☆21Sep 4, 2022Updated 3 years ago
- [CVPR2024] Think Twice Before Selection: Federated Evidential Active Learning for Medical Image Analysis with Domain Shifts☆32Jun 18, 2024Updated last year
- [EMNLP 2025] The official implementation for paper "Agentic-R1: Distilled Dual-Strategy Reasoning"☆103Aug 30, 2025Updated 6 months ago
- ☆34Jul 28, 2021Updated 4 years ago
- The classical algorithm for MIMO detection☆20Nov 4, 2019Updated 6 years ago