This repo contains some extensions of deepspeed-chat for fine-tuning LLMs (SFT+RLHF).
☆21Jul 2, 2024Updated last year
Alternatives and similar repositories for DeepSpeed-Chat-Extension
Users that are interested in DeepSpeed-Chat-Extension are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository contains the code for SFT, RLHF, and DPO, designed for vision-based LLMs, including the LLaVA models and the LLaMA-3.2-vi…☆123Jun 18, 2025Updated 11 months ago
- The project for speech translation☆12Sep 28, 2023Updated 2 years ago
- Code for the Internship at NEU-NLP☆21Apr 18, 2023Updated 3 years ago
- [npj Digital Medicine] An In-Depth Evaluation of Federated Learning on Biomedical Natural Language Processing for Information Extraction☆12May 1, 2024Updated 2 years ago
- ☆10Sep 18, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆11Aug 15, 2023Updated 2 years ago
- (CVPR 2024) Uniformity and Variance for Heterogeneous Federated Learning☆12Mar 6, 2024Updated 2 years ago
- ☆16Aug 25, 2021Updated 4 years ago
- Official Implementation of Flash-Searcher: Fast and Effective Web Agents via DAG-Based Parallel Execution☆79Dec 8, 2025Updated 5 months ago
- Code for "Enhancing Cooperation through Selective Interaction and Long-term Experiences in Multi-Agent Reinforcement Learning", IJCAI24.☆14Feb 9, 2025Updated last year
- Official PyTorch implementation of our ICCV2023 paper “When Prompt-based Incremental Learning Does Not Meet Strong Pretraining”☆16Jan 8, 2024Updated 2 years ago
- [ICLR'25] "Attention in Large Language Models Yields Efficient Zero-Shot Re-Rankers"☆43Mar 31, 2025Updated last year
- VHTest☆16Oct 31, 2024Updated last year
- Code for Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation (EVOL-RL).☆50Mar 31, 2026Updated last month
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Official repository for FedPerfix: Towards Partial Model Personalization of Vision Transformers in Federated Learning (ICCV2023)☆20Dec 1, 2023Updated 2 years ago
- (CVPR 2024) FLHetBench: Benchmarking Device and State Heterogeneity in Federated Learning☆20Jun 21, 2024Updated last year
- [CVPR-2024] Text-Enhanced Data-free Approach for Federated Class-Incremental Learning☆20Dec 26, 2024Updated last year
- This project based on Particle Swarm Optimization Algorithm. Try to solve Mobile Edge Computing optimization problem.☆11Jun 19, 2020Updated 5 years ago
- Official codes for "Training Deep Q-Network via Monte Carlo Tree Search for Adaptive Bitrate Control in Video Delivery"☆10Jul 21, 2023Updated 2 years ago
- MM-Instruct: Generated Visual Instructions for Large Multimodal Model Alignment☆35Jul 1, 2024Updated last year
- [TMLR23] FedDAG: Federated DAG Structure Learning☆19Jan 7, 2023Updated 3 years ago
- ☆56Nov 12, 2024Updated last year
- Code for "Graph Contrastive Learning with Cohesive Subgraph Awareness"☆20Feb 29, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [NeurIPS'25 Spotlight] ARM: Adaptive Reasoning Model☆68Apr 6, 2026Updated last month
- ☆25Mar 15, 2023Updated 3 years ago
- 🎮 A toolkit for Relation Extraction and more...☆24May 8, 2025Updated last year
- ☆20Sep 29, 2024Updated last year
- Source code for paper: An Efficient Transfer Learning Framework for Multiagent Reinforcement Learning☆24Sep 2, 2022Updated 3 years ago
- ☆16Sep 7, 2025Updated 8 months ago
- [CVPR2024] Think Twice Before Selection: Federated Evidential Active Learning for Medical Image Analysis with Domain Shifts☆31Jun 18, 2024Updated last year
- [EMNLP 2025] The official implementation for paper "Agentic-R1: Distilled Dual-Strategy Reasoning"☆104Apr 21, 2026Updated last month
- M-HalDetect Dataset Release☆29Nov 4, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆34Jul 28, 2021Updated 4 years ago
- CVPR 2024 - Fair Federated Learning under Domain Skew with Local Consistency and Domain Diversity☆28May 28, 2024Updated last year
- Fast and Robust Early-Exiting Framework for Autoregressive Language Models with Synchronized Parallel Decoding (EMNLP 2023 Long)☆65Sep 28, 2024Updated last year
- This is the code for paper "Correlation-aware Cooperative Multigroup Broadcast 360° Video Delivery Network: A Hierarchical Deep Reinforce…☆29Apr 3, 2021Updated 5 years ago
- (CVPR 2024) Communication-Efficient Federated Learning with Accelerated Client Gradient☆42Aug 15, 2025Updated 9 months ago
- The Official Repository for CVPR2023 Paper "NICO++: Towards Better Benchmarking for Domain Generalization".☆42Jul 29, 2023Updated 2 years ago
- ☆51Oct 29, 2023Updated 2 years ago