This repo contains some extensions of deepspeed-chat for fine-tuning LLMs (SFT+RLHF).
☆21Jul 2, 2024Updated last year
Alternatives and similar repositories for DeepSpeed-Chat-Extension
Users that are interested in DeepSpeed-Chat-Extension are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [npj Digital Medicine] An In-Depth Evaluation of Federated Learning on Biomedical Natural Language Processing for Information Extraction☆12May 1, 2024Updated last year
- The official implementation of paper "TRCE: Towards Reliable Malicious Concept Erasure in Text-to-Image Diffusion Models"☆17Mar 11, 2025Updated last year
- ☆10Sep 18, 2021Updated 4 years ago
- ☆16Aug 20, 2021Updated 4 years ago
- Official Implementation of Flash-Searcher: Fast and Effective Web Agents via DAG-Based Parallel Execution☆78Dec 8, 2025Updated 4 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Official PyTorch implementation of our ICCV2023 paper “When Prompt-based Incremental Learning Does Not Meet Strong Pretraining”☆16Jan 8, 2024Updated 2 years ago
- [NeurIPS 2024] Federated Learning from Vision-Language Foundation Models: Theoretical Analysis and Method☆15Oct 1, 2024Updated last year
- The official implantation of SGPT (CVPR2024)☆17Jul 15, 2024Updated last year
- [ICLR'25] "Attention in Large Language Models Yields Efficient Zero-Shot Re-Rankers"☆42Mar 31, 2025Updated last year
- VHTest☆16Oct 31, 2024Updated last year
- Implementation of FedBary☆16Mar 24, 2025Updated last year
- Simple implementation of Retrieval-Augmented Generation System☆29Oct 24, 2024Updated last year
- Repo for the paper: PerAda: Parameter-Efficient Federated Learning Personalization with Generalization Guarantees (CVPR 2024)☆23Aug 14, 2024Updated last year
- [CVPR-2024] Text-Enhanced Data-free Approach for Federated Class-Incremental Learning☆20Dec 26, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- MM-Instruct: Generated Visual Instructions for Large Multimodal Model Alignment☆35Jul 1, 2024Updated last year
- [TMLR23] FedDAG: Federated DAG Structure Learning☆19Jan 7, 2023Updated 3 years ago
- This is the formal code implementation of the CVPR 2024 paper 'Traceable Federated Continual Learning'.☆18May 31, 2024Updated last year
- ☆56Nov 12, 2024Updated last year
- VDPG: Adapting to Distribution Shift by Visual Domain Prompt Generation (ICLR 2024)☆18Jun 10, 2025Updated 10 months ago
- 🎮 A toolkit for Relation Extraction and more...☆24May 8, 2025Updated 11 months ago
- Forked from *OneIE: A Joint Neural Model for Information Extraction with Global Features*☆21Sep 4, 2022Updated 3 years ago
- Official repository of the paper "FightLadder: A Benchmark for Competitive Multi-Agent Reinforcement Learning"☆35Jul 23, 2024Updated last year
- CVPR 2024 - Fair Federated Learning under Domain Skew with Local Consistency and Domain Diversity☆28May 28, 2024Updated last year
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Official repository for Online Class Incremental Learning on Stochastic Blurry Task Boundary via Mask and Visual Prompt Tuning on ICCV 20…☆29Oct 26, 2024Updated last year
- ☆62Jul 12, 2025Updated 9 months ago
- Code for NAACL 2022 paper (Main Track) "RAAT: Relation-Augmented Attention Transformer for Relation Modeling in Document-Level Event Ex…☆36Aug 2, 2022Updated 3 years ago
- [CVPR2024] FedHCA^2: Towards Hetero-Client Federated Multi-Task Learning☆35Feb 18, 2025Updated last year
- [CVPR 2024] Official Repository for "FedSelect: Personalized Federated Learning with Customized Selection of Parameters for Fine-Tuning"☆35Nov 4, 2024Updated last year
- The Official Repository for CVPR2023 Paper "NICO++: Towards Better Benchmarking for Domain Generalization".☆42Jul 29, 2023Updated 2 years ago
- (CVPR 2024) Communication-Efficient Federated Learning with Accelerated Client Gradient☆42Aug 15, 2025Updated 7 months ago
- ☆38Dec 5, 2025Updated 4 months ago
- Concurrency library☆17Oct 13, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Improving AMR parsing with Sequence-to-Sequence Pre-training☆42Oct 23, 2020Updated 5 years ago
- Codebase for fine-tuning Llama2 70B to generate math test questions and answers.☆11Aug 30, 2024Updated last year
- Code for the paper "Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns"☆18Mar 15, 2024Updated 2 years ago
- A flat container abstraction for Rust☆16Nov 24, 2025Updated 4 months ago
- ☆41Feb 7, 2024Updated 2 years ago
- Interactive, GPU accelerated computation graphs☆12Nov 21, 2024Updated last year
- ☆12May 2, 2022Updated 3 years ago