Train llm (bloom, llama, baichuan2-7b, chatglm3-6b) with deepspeed pipeline mode. Faster than zero/zero++/fsdp.
☆97Feb 5, 2024Updated 2 years ago
Alternatives and similar repositories for gdGPT
Users that are interested in gdGPT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- train llama on a single A100 80G node using 🤗 transformers and 🚀 Deepspeed Pipeline Parallelism☆224Nov 21, 2023Updated 2 years ago
- A prototype repo for hybrid training of pipeline parallel and distributed data parallel with comments on core code snippets. Feel free to…☆58Jul 4, 2023Updated 2 years ago
- ☆84Sep 9, 2023Updated 2 years ago
- The official repo for our SIGIR'23 Full paper: Constructing Tree-based Index for Efficient and Effective Dense Retrieval☆28Jun 7, 2023Updated 2 years ago
- My implementation of AlphaZero for gomoku (Wu Zi Qi, 五子棋); Poorman's AlphaZero☆11Apr 28, 2018Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆19Jan 10, 2023Updated 3 years ago
- ☆27Aug 31, 2023Updated 2 years ago
- Code for Robust Fine-tuning (RbFT)☆17Jan 31, 2025Updated last year
- Code for paper titled "Towards the Law of Capacity Gap in Distilling Language Models"☆102Jul 9, 2024Updated last year
- pytorch implementation of mvp: a multi-stage vision-language pre-training framework☆11Apr 23, 2022Updated 4 years ago
- ☆43Dec 15, 2023Updated 2 years ago
- Implementation of the paper: "Turning Tables: Generating Examples from Semi-structured Tables for Endowing Language Models with Reasoning…☆22Nov 2, 2021Updated 4 years ago
- Best practice for training LLaMA models in Megatron-LM☆665Jan 2, 2024Updated 2 years ago
- PyTorch bindings for CUTLASS grouped GEMM.☆188Apr 8, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Elixir: Train a Large Language Model on a Small GPU Cluster☆16Jun 8, 2023Updated 2 years ago
- Semi-supervised spoken language understanding (SLU) via self-supervised speech and language model pretraining☆12Mar 23, 2021Updated 5 years ago
- 工业级中文语音识别系统电子书☆13Oct 30, 2020Updated 5 years ago
- [SIGIR 2025] The official repo for "Scaling Sparse and Dense Retrieval in Decoder-Only LLMs"☆20Mar 31, 2025Updated last year
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆1,440Mar 20, 2024Updated 2 years ago
- ☆26Nov 7, 2022Updated 3 years ago
- A more efficient GLM implementation!☆54Feb 18, 2023Updated 3 years ago
- The baseline method for CCIR 22 https://www.datafountain.cn/competitions/573☆13Aug 2, 2022Updated 3 years ago
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆2,249Aug 14, 2025Updated 9 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A gomoku AI based on Alpha Zero paper.☆12May 1, 2023Updated 3 years ago
- The codebase for "Group-wise Contrastive Learning for Neural Dialogue Generation" (Cai et al., Findings of EMNLP 2020)☆55Feb 24, 2021Updated 5 years ago
- Artifact for PPoPP22 QGTC: Accelerating Quantized GNN via GPU Tensor Core.☆30Feb 12, 2022Updated 4 years ago
- KoRean based ELECTRA pre-trained models (KR-ELECTRA) for Tensorflow and PyTorch☆15Feb 13, 2022Updated 4 years ago
- ☆24Jun 24, 2020Updated 5 years ago
- Artifact of OSDI '24 paper, ”Llumnix: Dynamic Scheduling for Large Language Model Serving“☆64Jun 5, 2024Updated last year
- ☆23Oct 30, 2023Updated 2 years ago
- [EMNLP 2025] The official implementation for paper "Agentic-R1: Distilled Dual-Strategy Reasoning"☆104Apr 21, 2026Updated last month
- Tacker: Tensor-CUDA Core Kernel Fusion for Improving the GPU Utilization while Ensuring QoS☆33Feb 10, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- NTK scaled version of ALiBi position encoding in Transformer.☆69Aug 16, 2023Updated 2 years ago
- FTPipe and related pipeline model parallelism research.☆44May 16, 2023Updated 3 years ago
- BiLLa: A Bilingual LLaMA with Enhanced Reasoning Ability☆417Jun 1, 2023Updated 2 years ago
- llama2 finetuning with deepspeed and lora☆176Jul 28, 2023Updated 2 years ago
- 使用UniLM实现中文文本摘要☆43Mar 25, 2020Updated 6 years ago
- Tensorflow implementation of DCN for question answering on the Stanford Question Answering Dataset (SQuAD)☆13Dec 1, 2017Updated 8 years ago
- A packaged convolutional voice activity detector for noisy environments.☆14Jun 15, 2019Updated 6 years ago