xrsrke/pipegoose

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/xrsrke/pipegoose)

xrsrke / pipegoose

Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*

☆87

Alternatives and similar repositories for pipegoose

Users that are interested in pipegoose are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

gyunggyung / LiOnConnect
View on GitHub
"Learning-based One-line intelligence Owner Network Connectivity Tool"
☆15Apr 19, 2023Updated 3 years ago
google-deepmind / asyncdiloco
View on GitHub
☆51Jan 18, 2024Updated 2 years ago
samsja / pydantic_config
View on GitHub
Manage ML configuration with pydantic
☆16Mar 18, 2026Updated 4 months ago
sgugger / torchdynamo-tests
View on GitHub
☆20Nov 23, 2022Updated 3 years ago
dmarx / the-rest-of-the-fucking-owl
View on GitHub
Trigger an LLM in your CI/CD to auto-complete your work
☆11Apr 5, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
srush / triton-autodiff
View on GitHub
Experiment of using Tangent to autodiff triton
☆82Jan 22, 2024Updated 2 years ago
alvarobartt / vertex-ai-huggingface-inference-toolkit
View on GitHub
🤗 HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)
☆17Mar 20, 2024Updated 2 years ago
xrsrke / instructGOOSE
View on GitHub
Implementation of Reinforcement Learning from Human Feedback (RLHF)
☆172Apr 7, 2023Updated 3 years ago
pytorch / PiPPy
View on GitHub
Pipeline Parallelism for PyTorch
☆786Aug 21, 2024Updated last year
PrimeIntellect-ai / smart-contracts
View on GitHub
Solidity contracts for the decentralized Prime Network protocol
☆26Jul 6, 2025Updated last year
jason9693 / polyglot-finetuning-oslo
View on GitHub
☆19Sep 20, 2022Updated 3 years ago
Curt-Park / echo-grpc-triton
View on GitHub
Inference API server with echo and gRPC to triton server (golang)
☆13Nov 16, 2022Updated 3 years ago
tripos-education / maths-tripos-questions
View on GitHub
Archive of questions from the Cambridge Mathematics Tripos
☆10Jun 6, 2022Updated 4 years ago
jason9693 / oslo-kogpt-finetunig
View on GitHub
kogpt를 oslo로 파인튜닝하는 예제.
☆23Aug 26, 2022Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
huggingface / nanotron
View on GitHub
Minimalistic large language model 3D-parallelism training
☆2,760May 26, 2026Updated last month
tlkh / t2t-tuner
View on GitHub
Convenient Text-to-Text Training for Transformers
☆18Dec 10, 2021Updated 4 years ago
zhuzilin / ring-flash-attention
View on GitHub
Ring attention implementation with flash attention
☆1,036Sep 10, 2025Updated 10 months ago
nlothian / m1_huggingface_diffusers_demo
View on GitHub
Demo of how to get HuggingFace Diffusers working on an M1 Mac
☆15Sep 9, 2022Updated 3 years ago
teknium1 / ShareGPT-Builder
View on GitHub
☆126Dec 18, 2024Updated last year
InflectionAI / Inflection-Benchmarks
View on GitHub
Public Inflection Benchmarks
☆67Mar 6, 2024Updated 2 years ago
jason9693 / ETA4LLMs
View on GitHub
Calculating Expected Time for training LLM.
☆39Apr 17, 2023Updated 3 years ago
cisco-open / pymultiworld
View on GitHub
A framework for PyTorch to enable fault management for collective communication libraries (CCL) such as NCCL
☆20Feb 9, 2026Updated 5 months ago
nbroad1881 / strideformer
View on GitHub
Using short models to classify long texts
☆21Mar 8, 2023Updated 3 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
fal-ai-community / llmdifftracker
View on GitHub
Lightweight package that tracks and summarizes code changes using LLMs (Large Language Models)
☆32Feb 27, 2025Updated last year
daemyung / practice-triton
View on GitHub
삼각형의 실전! Triton
☆16Feb 15, 2024Updated 2 years ago
eth-easl / fmengine
View on GitHub
Utilities for Training Very Large Models
☆58Sep 25, 2024Updated last year
xrsrke / toolformer
View on GitHub
Implementation of Toolformer: Language Models Can Teach Themselves to Use Tools
☆146Apr 5, 2023Updated 3 years ago
yacineMTB / llama.cpp
View on GitHub
Port of Facebook's LLaMA model in C/C++
☆16Jul 3, 2023Updated 3 years ago
graphcore-research / pytorch-tensor-tracker
View on GitHub
Flexibly track outputs and grad-outputs of torch.nn.Module.
☆13Oct 6, 2023Updated 2 years ago
dominiquegarmier / grok-pytorch
View on GitHub
pytorch implementation of grok
☆11Jul 13, 2026Updated last week
guillaume-be / SentencePiece-Rust-example
View on GitHub
Supporting example for "A Rust SentencePiece implementation"
☆20Jun 7, 2020Updated 6 years ago
SalesforceAIResearch / LaTRO
View on GitHub
☆127Jun 2, 2026Updated last month
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Taishi-N324 / Awesome-RL-Reasoning
View on GitHub
Awesome-RL-Reasoning
☆16May 31, 2026Updated last month
yvrjsharma / JAX
View on GitHub
☆13Jan 3, 2022Updated 4 years ago
redwoodresearch / rust_circuit_public
View on GitHub
☆67Feb 16, 2023Updated 3 years ago
NousResearch / wandb-rs
View on GitHub
☆16Jun 25, 2026Updated 3 weeks ago
eligotts / legos
View on GitHub
☆24Jan 22, 2026Updated 5 months ago
Jokeren / triton-samples
View on GitHub
☆29Jan 17, 2025Updated last year
nateraw / spaces-docker-templates
View on GitHub
🚀🤗 A collection of templates for Hugging Face Spaces
☆35Oct 9, 2023Updated 2 years ago