基于DeepSpeed的大模型微调教程,详细介绍如何使用DeepSpeed进行微调和分布式训练文本总结大模型
☆16Dec 29, 2024Updated last year
Alternatives and similar repositories for DeepSpeed-Finetuning
Users that are interested in DeepSpeed-Finetuning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Lightrun is a Developer-Native Observability Platform, for developers by developers☆22Sep 22, 2024Updated last year
- RNN-based IDS for SOME/IP Intrusion Detection☆10Jul 20, 2021Updated 4 years ago
- 《深度学习与图像识别:原理与实践》图书源码☆11Nov 1, 2021Updated 4 years ago
- Tracking by Joint Local and Global Search: A Target-aware Attention based Approach (IEEE TNNLS 2021)☆10Oct 26, 2021Updated 4 years ago
- A python script that uses Google Translate and Stable Diffusion to assist people in creating images in their native languages.☆10Dec 16, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- vulkan graphic engine for learning☆12Dec 21, 2024Updated last year
- 刷题笔记C++☆17Apr 6, 2021Updated 5 years ago
- Improved Autoencoder-based Ensemble In-vehicle Intrusion Detection System☆13Oct 3, 2023Updated 2 years ago
- ☆11Oct 16, 2024Updated last year
- Team of ChongShi Perception Source Code☆14Aug 26, 2020Updated 5 years ago
- An pytorch implementation of MatchPyramid "Text Matching as Image Recognition"☆11Jul 25, 2024Updated last year
- nerual network enhanced global ilumination with Unity☆11Apr 12, 2024Updated 2 years ago
- [EMNLP 2024 Oral] PsyGUARD: An Automated System for Suicide Detection and Risk Assessment in Psychological Counseling☆22Apr 21, 2025Updated 11 months ago
- ☆20May 21, 2025Updated 10 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆17Apr 14, 2024Updated last year
- 基于腾讯文档的自动填写助手☆17Nov 14, 2020Updated 5 years ago
- ☆10Aug 29, 2022Updated 3 years ago
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆13Jul 27, 2024Updated last year
- DeepSearch - Advanced Web Dir Scanner☆14Nov 13, 2018Updated 7 years ago
- implement some custom schedulers based on kubernetes scheduler framework(基于k8s调度框架实现的调度器插件,用于扩展调度逻辑)☆19Nov 30, 2022Updated 3 years ago
- ☆48Oct 5, 2025Updated 6 months ago
- Python sandboxes for llms☆38Dec 19, 2025Updated 3 months ago
- From Pattern Recognizers to Personalized Companions: A Survey of Large Language Models in Mental Health☆83Apr 2, 2026Updated last week
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- ☆12Jun 14, 2022Updated 3 years ago
- 本项目采用Keras和ALBERT实现文本多分类任务,其中对ALBERT进行微调。☆17Jan 5, 2021Updated 5 years ago
- Machine Learning Generative Classical Music using RNN LSTMs with MIDI music dataset and Magenta Tensorflow☆23Jan 2, 2021Updated 5 years ago
- 基于yolov5的交通标志牌识别项目,使用tt100k数据集☆29Dec 29, 2023Updated 2 years ago
- Package for DIGIT tactile sensor☆35Apr 8, 2024Updated 2 years ago
- 一些保研资源☆21Apr 14, 2019Updated 6 years ago
- Old Photo Restoration☆24May 25, 2025Updated 10 months ago
- A Transformer-based GAN for Anomaly Detection, International Conference on Artificial Neural Networks, (ICANN2022).☆23Mar 7, 2023Updated 3 years ago
- Evaluating the Long-Term Memory of Large Language Models☆60Feb 6, 2026Updated 2 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆49Updated this week
- B站3亿用户信息爬虫(mid号,昵称,性别,关注,粉丝,等级)☆27Apr 19, 2018Updated 7 years ago
- ☆32Sep 28, 2020Updated 5 years ago
- 南大JYY操作系统课实验M1:实现 pstree 打印进程之间的树状的父子关系,包括默认,-V,-p,-n三种选项,不要求组合选项☆17Aug 9, 2022Updated 3 years ago
- Advanced Retrieval Algorithms for Decomposing Large-Scale Candidate Set into Pieces.☆73Apr 13, 2025Updated last year
- jPBC fork for get Maven works again☆17Feb 25, 2019Updated 7 years ago
- A curated collection of influential surveys and papers on Retrieval-Augmented Generation (RAG), covering frameworks, evaluations, multi-m…☆50Feb 3, 2025Updated last year