基于DeepSpeed的大模型微调教程,详细介绍如何使用DeepSpeed进行微调和分布式训练文本总结大模型
☆15Dec 29, 2024Updated last year
Alternatives and similar repositories for DeepSpeed-Finetuning
Users that are interested in DeepSpeed-Finetuning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Lightrun is a Developer-Native Observability Platform, for developers by developers☆21Sep 22, 2024Updated last year
- RNN-based IDS for SOME/IP Intrusion Detection☆10Jul 20, 2021Updated 4 years ago
- 《深度学习与图像识别:原理与实践》图书源码☆10Nov 1, 2021Updated 4 years ago
- Tracking by Joint Local and Global Search: A Target-aware Attention based Approach (IEEE TNNLS 2021)☆10Oct 26, 2021Updated 4 years ago
- A python script that uses Google Translate and Stable Diffusion to assist people in creating images in their native languages.☆10Dec 16, 2022Updated 3 years ago
- vulkan graphic engine for learning☆12Dec 21, 2024Updated last year
- 刷题笔记C++☆18Apr 6, 2021Updated 4 years ago
- ☆11Oct 16, 2024Updated last year
- Improved Autoencoder-based Ensemble In-vehicle Intrusion Detection System☆13Oct 3, 2023Updated 2 years ago
- Team of ChongShi Perception Source Code☆14Aug 26, 2020Updated 5 years ago
- An pytorch implementation of MatchPyramid "Text Matching as Image Recognition"☆11Jul 25, 2024Updated last year
- nerual network enhanced global ilumination with Unity☆11Apr 12, 2024Updated last year
- ☆20May 21, 2025Updated 10 months ago
- ☆16Apr 14, 2024Updated last year
- DeepSearch - Advanced Web Dir Scanner☆14Nov 13, 2018Updated 7 years ago
- ☆10Aug 29, 2022Updated 3 years ago
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆13Jul 27, 2024Updated last year
- ☆45Oct 5, 2025Updated 5 months ago
- implement some custom schedulers based on kubernetes scheduler framework(基于k8s调度框架实现的调度器插件,用于扩展调度逻辑)☆19Nov 30, 2022Updated 3 years ago
- Python sandboxes for llms☆37Dec 19, 2025Updated 3 months ago
- ☆12Jun 14, 2022Updated 3 years ago
- 本项目采用Keras和ALBERT实现文本多分类任务,其中对ALBERT进行微调。☆17Jan 5, 2021Updated 5 years ago
- Machine Learning Generative Classical Music using RNN LSTMs with MIDI music dataset and Magenta Tensorflow☆23Jan 2, 2021Updated 5 years ago
- 基于yolov5的交通标志牌识别项目,使用tt100k数据集☆27Dec 29, 2023Updated 2 years ago
- Old Photo Restoration☆22May 25, 2025Updated 9 months ago
- Package for DIGIT tactile sensor☆35Apr 8, 2024Updated last year
- 一些保研资源☆21Apr 14, 2019Updated 6 years ago
- A Transformer-based GAN for Anomaly Detection, International Conference on Artificial Neural Networks, (ICANN2022).☆23Mar 7, 2023Updated 3 years ago
- Evaluating the Long-Term Memory of Large Language Models☆60Feb 6, 2026Updated last month
- ☆49Oct 26, 2025Updated 4 months ago
- B站3亿用户信息爬虫(mid号,昵称,性别,关注,粉丝,等级)☆26Apr 19, 2018Updated 7 years ago
- 南大JYY操作系统课实验M1:实现 pstree 打印进程之间的树状的父子关系,包括默认,-V,-p,-n三种选项,不要求组合选项☆17Aug 9, 2022Updated 3 years ago
- ☆32Sep 28, 2020Updated 5 years ago
- Advanced Retrieval Algorithms for Decomposing Large-Scale Candidate Set into Pieces.☆74Apr 13, 2025Updated 11 months ago
- jPBC fork for get Maven works again☆17Feb 25, 2019Updated 7 years ago
- Important research papers, blogposts, repos on tactile sensing ( vision-based)☆49Feb 15, 2023Updated 3 years ago
- A modular AI agent framework built with FastAPI (backend) and TypeScript (frontend), integrating LangChain/LangGraph-based agents for con…☆78Mar 11, 2026Updated last week
- Practical Network-Wide Configuration Synthesis with Autocompletion☆41Sep 18, 2025Updated 6 months ago
- qTESLA Library, an optimized implementation of the post-quantum lattice-based digital signature scheme qTESLA.☆26Aug 31, 2022Updated 3 years ago