This repo contains some extensions of deepspeed-chat for fine-tuning LLMs (SFT+RLHF).
☆21Jul 2, 2024Updated last year
Alternatives and similar repositories for DeepSpeed-Chat-Extension
Users that are interested in DeepSpeed-Chat-Extension are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository contains the code for SFT, RLHF, and DPO, designed for vision-based LLMs, including the LLaVA models and the LLaMA-3.2-vi…☆121Jun 18, 2025Updated 10 months ago
- A repository used to organize content related to Large Speech(Audio) Model, including paper, data, applications, tools and so on.☆28Nov 8, 2025Updated 5 months ago
- Code for the Internship at NEU-NLP☆21Apr 18, 2023Updated 3 years ago
- 中文原生工业测评基准☆15Mar 21, 2024Updated 2 years ago
- The official implementation of paper "TRCE: Towards Reliable Malicious Concept Erasure in Text-to-Image Diffusion Models"☆17Mar 11, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆10Sep 18, 2021Updated 4 years ago
- ☆11Aug 15, 2023Updated 2 years ago
- (CVPR 2024) Uniformity and Variance for Heterogeneous Federated Learning☆12Mar 6, 2024Updated 2 years ago
- ☆16Aug 25, 2021Updated 4 years ago
- Official Implementation of Flash-Searcher: Fast and Effective Web Agents via DAG-Based Parallel Execution☆79Dec 8, 2025Updated 4 months ago
- Code for "Enhancing Cooperation through Selective Interaction and Long-term Experiences in Multi-Agent Reinforcement Learning", IJCAI24.☆14Feb 9, 2025Updated last year
- Official PyTorch implementation of our ICCV2023 paper “When Prompt-based Incremental Learning Does Not Meet Strong Pretraining”☆16Jan 8, 2024Updated 2 years ago
- [ICLR'25] "Attention in Large Language Models Yields Efficient Zero-Shot Re-Rankers"☆43Mar 31, 2025Updated last year
- VHTest☆16Oct 31, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code for Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation (EVOL-RL).☆50Mar 31, 2026Updated last month
- Simple implementation of Retrieval-Augmented Generation System☆29Oct 24, 2024Updated last year
- Official repository for FedPerfix: Towards Partial Model Personalization of Vision Transformers in Federated Learning (ICCV2023)☆20Dec 1, 2023Updated 2 years ago
- [CVPR-2024] Text-Enhanced Data-free Approach for Federated Class-Incremental Learning☆20Dec 26, 2024Updated last year
- An easy tool to transcode 360 VR videos to tile-based streamable MPEG-DASH 360 VR segment sets.☆14Jan 22, 2021Updated 5 years ago
- MM-Instruct: Generated Visual Instructions for Large Multimodal Model Alignment☆35Jul 1, 2024Updated last year
- [ICML 2026] Reasoning in Parallelism via Self-Distilled RL☆107Feb 5, 2026Updated 2 months ago
- [TMLR23] FedDAG: Federated DAG Structure Learning☆19Jan 7, 2023Updated 3 years ago
- This is the formal code implementation of the CVPR 2024 paper 'Traceable Federated Continual Learning'.☆18May 31, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆56Nov 12, 2024Updated last year
- Code for "Graph Contrastive Learning with Cohesive Subgraph Awareness"☆20Feb 29, 2024Updated 2 years ago
- Code repository for the paper OpCASH: Optimized Utilization of MEC Cache for 360-Degree Video Streaming with Dynamic Tiling☆12Jan 26, 2022Updated 4 years ago
- The official code for the paper 'Towards Fair Graph Federated Learning via Incentive Mechanisms'☆17May 23, 2024Updated last year
- SALI360: Design and Implementation of Saliency based Video Compression for 360 Video Streaming☆13Sep 27, 2021Updated 4 years ago
- Source code for paper: An Efficient Transfer Learning Framework for Multiagent Reinforcement Learning☆24Sep 2, 2022Updated 3 years ago
- A list of conferences and journals relevant to machine translation☆33Mar 17, 2022Updated 4 years ago
- Forked from *OneIE: A Joint Neural Model for Information Extraction with Global Features*☆21Sep 4, 2022Updated 3 years ago
- ☆16Sep 7, 2025Updated 7 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [CVPR2024] Think Twice Before Selection: Federated Evidential Active Learning for Medical Image Analysis with Domain Shifts☆31Jun 18, 2024Updated last year
- Official Implementation of FedRCL (CVPR 2024)☆27Jun 6, 2024Updated last year
- ☆28Dec 29, 2023Updated 2 years ago
- M-HalDetect Dataset Release☆29Nov 4, 2023Updated 2 years ago
- A tool for translating the content of LaTeX documents into various other natural languages (e.g., translating an arXiv paper from English…☆466Mar 12, 2026Updated last month
- An introduction to basic concepts of Transformers and key techniques of their recent advances.☆52Dec 21, 2023Updated 2 years ago
- CVPR 2024 - Fair Federated Learning under Domain Skew with Local Consistency and Domain Diversity☆28May 28, 2024Updated last year