This repo contains some extensions of deepspeed-chat for fine-tuning LLMs (SFT+RLHF).
☆21Jul 2, 2024Updated last year
Alternatives and similar repositories for DeepSpeed-Chat-Extension
Users that are interested in DeepSpeed-Chat-Extension are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository contains the code for SFT, RLHF, and DPO, designed for vision-based LLMs, including the LLaVA models and the LLaMA-3.2-vi…☆122Jun 18, 2025Updated 11 months ago
- A repository used to organize content related to Large Speech(Audio) Model, including paper, data, applications, tools and so on.☆28Nov 8, 2025Updated 7 months ago
- Official implementation of our paper "Towards Reasoning in Large Language Models via Multi-Agent Peer Review Collaboration".☆14Nov 18, 2024Updated last year
- The offical repo for "LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling"☆165May 15, 2026Updated 3 weeks ago
- The project for speech translation☆12Sep 28, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Code for the Internship at NEU-NLP☆21Apr 18, 2023Updated 3 years ago
- [npj Digital Medicine] An In-Depth Evaluation of Federated Learning on Biomedical Natural Language Processing for Information Extraction☆12May 1, 2024Updated 2 years ago
- The official implementation of paper "TRCE: Towards Reliable Malicious Concept Erasure in Text-to-Image Diffusion Models"☆17Mar 11, 2025Updated last year
- 🔥 A Survey on AI Auto-Research☆358May 19, 2026Updated 3 weeks ago
- ☆17Aug 20, 2021Updated 4 years ago
- Code for "Enhancing Cooperation through Selective Interaction and Long-term Experiences in Multi-Agent Reinforcement Learning", IJCAI24.☆14Feb 9, 2025Updated last year
- Official Implementation of Flash-Searcher: Fast and Effective Web Agents via DAG-Based Parallel Execution☆80Dec 8, 2025Updated 6 months ago
- Official PyTorch implementation of our ICCV2023 paper “When Prompt-based Incremental Learning Does Not Meet Strong Pretraining”☆16Jan 8, 2024Updated 2 years ago
- The official implantation of SGPT (CVPR2024)☆18Jul 15, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- 东北大学 C++课设/C课设 图书管理系统(最卷)☆10May 15, 2022Updated 4 years ago
- [ICLR'25] "Attention in Large Language Models Yields Efficient Zero-Shot Re-Rankers"☆44Mar 31, 2025Updated last year
- A novel algorithm that distributes training-time model rewards to incentivize client contributions for federated learning. (ICLR-2024)☆18May 6, 2024Updated 2 years ago
- Code for Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation (EVOL-RL).☆51Mar 31, 2026Updated 2 months ago
- Source code for <Sequence-Level Training for Non-Autoregressive Neural Machine Translation>.☆24Jan 17, 2022Updated 4 years ago
- Repo for the paper: PerAda: Parameter-Efficient Federated Learning Personalization with Generalization Guarantees (CVPR 2024)☆25Aug 14, 2024Updated last year
- Official repository for FedPerfix: Towards Partial Model Personalization of Vision Transformers in Federated Learning (ICCV2023)☆20Dec 1, 2023Updated 2 years ago
- [CVPR-2024] Text-Enhanced Data-free Approach for Federated Class-Incremental Learning☆21Dec 26, 2024Updated last year
- MM-Instruct: Generated Visual Instructions for Large Multimodal Model Alignment☆35Jul 1, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 为了准备来年的蓝桥杯个人赛的一些常考的算法模板☆16Nov 30, 2019Updated 6 years ago
- [ICML 2026] Reasoning in Parallelism via Self-Distilled RL☆112Feb 5, 2026Updated 4 months ago
- [AAAI 2023 Oral] CoMAE: Single Model Hybrid Pre-training on Small-Scale RGB-D Datasets☆38Aug 20, 2024Updated last year
- ParamMute: Suppressing Knowledge-Critical FFNs for Faithful Retrieval-Augmented Generation☆58Feb 2, 2026Updated 4 months ago
- ☆56Nov 12, 2024Updated last year
- Code for "Graph Contrastive Learning with Cohesive Subgraph Awareness"☆20Feb 29, 2024Updated 2 years ago
- [NeurIPS'25 Spotlight] ARM: Adaptive Reasoning Model☆68Apr 6, 2026Updated 2 months ago
- 🎮 A toolkit for Relation Extraction and more...☆24May 8, 2025Updated last year
- 爬取雨课堂答案☆16Nov 21, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- 对一个简单的c语言文法完整实现了前端和后端,编译完成可得到8086指令集所支持的目标代码☆16Dec 29, 2016Updated 9 years ago
- The official code for the paper 'Towards Fair Graph Federated Learning via Incentive Mechanisms'☆18May 23, 2024Updated 2 years ago
- 此项目为论文《FedServing: A Federated Prediction Serving Framework Based on Incentive Mechanism》的验证项目。基于 intel SGX ,实现将各个不同模型的推测结果在可信硬件中使用 truth…☆18Oct 23, 2023Updated 2 years ago
- A list of conferences and journals relevant to machine translation☆33Mar 17, 2022Updated 4 years ago
- 雨课堂测试题爬虫☆20May 31, 2020Updated 6 years ago
- [CVPR2024] Think Twice Before Selection: Federated Evidential Active Learning for Medical Image Analysis with Domain Shifts☆31Jun 18, 2024Updated last year
- Official Implementation of FedRCL (CVPR 2024)☆27Jun 6, 2024Updated 2 years ago