sail-sg/VocabularyParallelism

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/sail-sg/VocabularyParallelism)

sail-sg / VocabularyParallelism

Vocabulary Parallelism

☆26

Alternatives and similar repositories for VocabularyParallelism

Users that are interested in VocabularyParallelism are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

sail-sg / odc
View on GitHub
On demand communication
☆34Apr 16, 2026Updated 3 months ago
thunlp / Seq1F1B
View on GitHub
Sequence-level 1F1B schedule for LLMs.
☆37Aug 26, 2025Updated 10 months ago
sail-sg / zero-bubble-pipeline-parallelism
View on GitHub
Zero Bubble Pipeline Parallelism
☆462May 7, 2025Updated last year
yzhwang / jax-multi-gpu-resnet50-example
View on GitHub
An example showing how to use jax to train resnet50 on multi-node multi-GPU
☆20Jul 4, 2022Updated 4 years ago
MayDomine / Seq1F1B
View on GitHub
Sequence-level 1F1B schedule for LLMs.
☆19Jun 4, 2024Updated 2 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
MingjiHan99 / KVRaft
View on GitHub
MIT 6.824 2020
☆10Mar 31, 2021Updated 5 years ago
lilakk / BLEUBERI
View on GitHub
Official repository for "BLEUBERI: BLEU is a surprisingly effective reward for instruction following"
☆32Jun 5, 2025Updated last year
yuanxinnn / APTMoE
View on GitHub
☆13Jun 29, 2024Updated 2 years ago
amazon-science / piperag
View on GitHub
PipeRAG: Fast Retrieval-Augmented Generation via Algorithm-System Co-design (KDD 2025)
☆32Jun 14, 2024Updated 2 years ago
fanshiqing / grouped_gemm
View on GitHub
PyTorch bindings for CUTLASS grouped GEMM.
☆191Apr 8, 2026Updated 3 months ago
shengshu-ai / TurboServe
View on GitHub
TurboServe: Serving Streaming Video Generation Efficiently and Economically
☆34Jul 12, 2026Updated last week
sail-sg / hloenv
View on GitHub
an environment based on XLA for deep learning compiler optimization research.
☆24Mar 7, 2023Updated 3 years ago
chenweiphd / OpenEDA-ChipGPT-Hub
View on GitHub
☆12Jun 22, 2023Updated 3 years ago
harvard-cns / Harvard-CNS-Seminar
View on GitHub
Reading seminar in Harvard Cloud Networking and Systems Group
☆16Aug 29, 2022Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
ikuokuo / start-scaled-yolov4
View on GitHub
Start Scaled YOLOv4
☆10Jan 9, 2021Updated 5 years ago
BatsResearch / efsl
View on GitHub
Extended Few-Shot Learning: Exploiting Existing Resources for Novel Tasks
☆10Jul 6, 2021Updated 5 years ago
ChuangtaoChen-TUM / LiveMind
View on GitHub
☆15Apr 15, 2026Updated 3 months ago
chenyu-jiang / dcp
View on GitHub
Code repository for the SOSP'25 paper DCP: Addressing Input Dynamism In Long-Context Training via Dynamic Context Parallelism.
☆21Nov 28, 2025Updated 7 months ago
AniZpZ / smoothquant
View on GitHub
[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models
☆11Dec 13, 2023Updated 2 years ago
chengzeyi / piflux
View on GitHub
(WIP) Parallel inference for black-forest-labs' FLUX model.
☆19Nov 18, 2024Updated last year
ysyisyourbrother / Galaxy-LM
View on GitHub
Work in progress LLM framework.
☆16Oct 31, 2024Updated last year
yale-nlp / refdpo
View on GitHub
☆16Jul 23, 2024Updated last year
IANNXANG / RuscaRL
View on GitHub
☆48Jan 30, 2026Updated 5 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
linxihui / dkernel
View on GitHub
☆22Apr 17, 2025Updated last year
ars22 / e3
View on GitHub
☆20Sep 16, 2025Updated 10 months ago
Froot-NetSys / Arya
View on GitHub
Arya: Arbitrary Graph Pattern Mining with Decomposition-based Sampling
☆18Sep 27, 2023Updated 2 years ago
XenoZLH / Shuffle-R1
View on GitHub
Official code repository of Shuffle-R1
☆26Feb 23, 2026Updated 4 months ago
hatsu3 / curator
View on GitHub
☆13Jan 17, 2024Updated 2 years ago
icerain-alt / FSDPToys
View on GitHub
Learning and Debugging for FSDP/FSDP2 Training
☆17Feb 7, 2026Updated 5 months ago
M1n9X / GraphRAG_Lite
View on GitHub
☆16Jul 12, 2024Updated 2 years ago
viswavi / RLCF
View on GitHub
☆24Oct 23, 2025Updated 8 months ago
RulinShao / LightSeq
View on GitHub
Official repository for DistFlashAttn: Distributed Memory-efficient Attention for Long-context LLMs Training
☆223Aug 19, 2024Updated last year
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
gofreelee / SpaceServe
View on GitHub
☆31Jul 13, 2026Updated last week
DBOS-project / dbos-scheduler
View on GitHub
☆28Jun 1, 2021Updated 5 years ago
MozerWang / DEMO
View on GitHub
[ACL 2025 (Findings)] DEMO: Reframing Dialogue Interaction with Fine-grained Element Modeling
☆22Dec 16, 2024Updated last year
aws-samples / awsame-distributed-ai-samples
View on GitHub
Cluster doctor skills
☆14May 23, 2026Updated last month
DerrickYLJ / LessIsMore
View on GitHub
[ICML 2026] Less Is More: Training-Free Sparse Attention with Global Locality for Efficient Reasoning
☆34Sep 12, 2025Updated 10 months ago
YoujiaZhang / USD
View on GitHub
[IJCAI 2025] Optimized View and Geometry Distillation from Multi-view Diffuser
☆18May 2, 2025Updated last year
uw-mad-dash / decoding-speculative-decoding
View on GitHub
☆16Aug 19, 2024Updated last year