This repo contains some extensions of deepspeed-chat for fine-tuning LLMs (SFT+RLHF).
☆21Jul 2, 2024Updated 2 years ago
Alternatives and similar repositories for DeepSpeed-Chat-Extension
Users that are interested in DeepSpeed-Chat-Extension are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository contains the code for SFT, RLHF, and DPO, designed for vision-based LLMs, including the LLaVA models and the LLaMA-3.2-vi…☆122Jun 18, 2025Updated last year
- Beyond Decoder-only: Large Language Models Can be Good Encoders for Machine Translation☆30Jun 30, 2025Updated last year
- Official implementation of our paper "Towards Reasoning in Large Language Models via Multi-Agent Peer Review Collaboration".☆14Nov 18, 2024Updated last year
- The offical repo for "LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling"☆167May 15, 2026Updated last month
- The project for speech translation☆12Sep 28, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Code for the Internship at NEU-NLP☆21Apr 18, 2023Updated 3 years ago
- [npj Digital Medicine] An In-Depth Evaluation of Federated Learning on Biomedical Natural Language Processing for Information Extraction☆13May 1, 2024Updated 2 years ago
- Problems and Results of IWLS 2023 Programming Contest☆17Apr 12, 2025Updated last year
- The official implementation of paper "TRCE: Towards Reliable Malicious Concept Erasure in Text-to-Image Diffusion Models"☆17Mar 11, 2025Updated last year
- ☆11Aug 15, 2023Updated 2 years ago
- 🔥 A Survey on AI Auto-Research☆406Updated this week
- (CVPR 2024) Uniformity and Variance for Heterogeneous Federated Learning☆12Mar 6, 2024Updated 2 years ago
- ☆12Jul 4, 2024Updated last year
- ☆16Aug 25, 2021Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆17Aug 20, 2021Updated 4 years ago
- Code for "Enhancing Cooperation through Selective Interaction and Long-term Experiences in Multi-Agent Reinforcement Learning", IJCAI24.☆14Feb 9, 2025Updated last year
- Official Implementation of Flash-Searcher: Fast and Effective Web Agents via DAG-Based Parallel Execution☆84Dec 8, 2025Updated 6 months ago
- [NeurIPS 2024] Federated Learning from Vision-Language Foundation Models: Theoretical Analysis and Method☆15Oct 1, 2024Updated last year
- VHTest☆16Oct 31, 2024Updated last year
- Implementation of FedBary☆17Mar 24, 2025Updated last year
- [ICLR'25] "Attention in Large Language Models Yields Efficient Zero-Shot Re-Rankers"☆44Mar 31, 2025Updated last year
- 阿里天池智慧交通预测挑战赛-Top7 /1716队☆15Jun 9, 2018Updated 8 years ago
- Code for Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation (EVOL-RL).☆51Mar 31, 2026Updated 3 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Repo for the paper: PerAda: Parameter-Efficient Federated Learning Personalization with Generalization Guarantees (CVPR 2024)☆25Aug 14, 2024Updated last year
- Official repository for FedPerfix: Towards Partial Model Personalization of Vision Transformers in Federated Learning (ICCV2023)☆20Dec 1, 2023Updated 2 years ago
- [CVPR-2024] Text-Enhanced Data-free Approach for Federated Class-Incremental Learning☆22Dec 26, 2024Updated last year
- UniGen approximately uniform sampler☆38Jul 24, 2025Updated 11 months ago
- MM-Instruct: Generated Visual Instructions for Large Multimodal Model Alignment☆35Jul 1, 2024Updated 2 years ago
- [ICML 2026] Reasoning in Parallelism via Self-Distilled RL☆114Updated this week
- [TMLR23] FedDAG: Federated DAG Structure Learning☆19Jan 7, 2023Updated 3 years ago
- [AAAI 2023 Oral] CoMAE: Single Model Hybrid Pre-training on Small-Scale RGB-D Datasets☆38Aug 20, 2024Updated last year
- This is the formal code implementation of the CVPR 2024 paper 'Traceable Federated Continual Learning'.☆19May 31, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆31Aug 9, 2023Updated 2 years ago
- ☆56Nov 12, 2024Updated last year
- Code for "Graph Contrastive Learning with Cohesive Subgraph Awareness"☆20Feb 29, 2024Updated 2 years ago
- VDPG: Adapting to Distribution Shift by Visual Domain Prompt Generation (ICLR 2024)☆20Jun 10, 2025Updated last year
- 🎮 A toolkit for Relation Extraction and more...☆24May 8, 2025Updated last year
- The official code for the paper 'Towards Fair Graph Federated Learning via Incentive Mechanisms'☆18May 23, 2024Updated 2 years ago
- 此项目为论文《FedServing: A Federated Prediction Serving Framework Based on Incentive Mechanism》的验证项目。基于 intel SGX ,实现将各个不同模型的推测结果在可信硬件中使用 truth…☆18Oct 23, 2023Updated 2 years ago