Aleph-Alpha-Research/scaling

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Aleph-Alpha-Research/scaling)

Aleph-Alpha-Research / scaling

Scaling is a distributed training library and installable dependency designed to scale up neural networks, with a dedicated module for training large language models.

☆66

Alternatives and similar repositories for scaling

Users that are interested in scaling are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ethansmith2000 / TransformerExperiments
View on GitHub
☆19Dec 4, 2025Updated 7 months ago
Aleph-Alpha-Research / eval-framework
View on GitHub
Comprehensive LLM evaluation at scale: A production-ready framework for evaluating large language models across multiple benchmarks.
☆41Updated this week
samsja / pydantic_config
View on GitHub
Manage ML configuration with pydantic
☆16Mar 18, 2026Updated 4 months ago
graphcore-research / jax-scalify
View on GitHub
JAX Scalify: end-to-end scaled arithmetics
☆18Oct 30, 2024Updated last year
tugot17 / tokenomics
View on GitHub
Estimate the throughput of OAI compatible servers
☆21Jul 1, 2026Updated 3 weeks ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
cisco-open / pymultiworld
View on GitHub
A framework for PyTorch to enable fault management for collective communication libraries (CCL) such as NCCL
☆20Feb 9, 2026Updated 5 months ago
google-deepmind / asyncdiloco
View on GitHub
☆51Jan 18, 2024Updated 2 years ago
dream3d-ai / torch-submit
View on GitHub
☆10Dec 21, 2024Updated last year
PrimeIntellect-ai / prime-iroh
View on GitHub
Asynchronous P2P communication backend for decentralized pipeline parallelism
☆46Updated this week
packquickly / schedule_free_optx
View on GitHub
Schedule free optimiser implemented in JAX using Optimistix
☆15May 29, 2024Updated 2 years ago
ekinakyurek / gpt3-arithmetic
View on GitHub
Scratchpad/Chain-of-Thought Prompts
☆12Jun 6, 2022Updated 4 years ago
edwardmilsom / function-space-learning-rates-paper
View on GitHub
Code for the paper "Function-Space Learning Rates"
☆23Jun 3, 2025Updated last year
eligotts / legos
View on GitHub
☆24Jan 22, 2026Updated 6 months ago
PrimeIntellect-ai / pccl
View on GitHub
PCCL (Prime Collective Communications Library) implements fault tolerant collective communications over IP
☆157Sep 12, 2025Updated 10 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
JannikSt / ibtop
View on GitHub
Real-time terminal monitor for InfiniBand networks - htop for high-speed interconnects
☆141Dec 30, 2025Updated 6 months ago
evanatyourservice / llm-jax
View on GitHub
Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.
☆19Jul 24, 2025Updated last year
graphcore-research / unit-scaling
View on GitHub
A library for unit scaling in PyTorch
☆135Jul 11, 2025Updated last year
alexa / ramen
View on GitHub
A software for transferring pre-trained English models to foreign languages
☆20Mar 20, 2023Updated 3 years ago
lianakoleva / no-libtorch-compile
View on GitHub
☆21Mar 3, 2025Updated last year
bloc97 / DeMo
View on GitHub
DeMo: Decoupled Momentum Optimization
☆202Dec 2, 2024Updated last year
GindaChen / FlexFlashAttention3
View on GitHub
FlexAttention w/ FlashAttention3 Support
☆27Oct 5, 2024Updated last year
cat-state / tinypar
View on GitHub
☆20Jul 12, 2023Updated 3 years ago
Birch-san / booru-embed
View on GitHub
[WIP] Transformer to embed Danbooru labelsets
☆13Mar 31, 2024Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
PrimeIntellect-ai / OpenDiloco
View on GitHub
OpenDiLoCo: An Open-Source Framework for Globally Distributed Low-Communication Training
☆583Jul 17, 2026Updated last week
haizelabs / annotate
View on GitHub
Skill to annotate and create ai judges from agent logs
☆17Oct 28, 2025Updated 9 months ago
iwiwi / epochraft
View on GitHub
Checkpointable dataset utilities for foundation model training
☆32Jan 29, 2024Updated 2 years ago
lucaslingle / mu_transformer
View on GitHub
Official implementation of 'A Large-Scale Exploration of mu-Transfer' (CoRR 2024)
☆31Jun 5, 2025Updated last year
ethansmith2000 / fsdp_optimizers
View on GitHub
supporting pytorch FSDP for optimizers
☆84Dec 8, 2024Updated last year
g4lb / proxy-service
View on GitHub
A Proxy service using FastAPI and Protocol Buffers (Proto3)
☆13Jun 17, 2023Updated 3 years ago
arcee-ai / trinity-large-tech-report
View on GitHub
☆125Feb 19, 2026Updated 5 months ago
samsja / muon_fsdp_2
View on GitHub
Muon fsdp 2
☆64Aug 8, 2025Updated 11 months ago
Aleph-Alpha-Research / NeurIPS-WANT-submission-efficient-parallelization-layouts
View on GitHub
☆22Dec 15, 2023Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
HealthML / active-segmentation
View on GitHub
ActiveSegmentation: A Simulation Framework for Benchmarking Active Learning Strategies for 3D Medical Image Segmentation
☆20Jul 5, 2022Updated 4 years ago
kimbochen / md-blogs
View on GitHub
A blog where I write about research papers and blog posts I read.
☆12Nov 20, 2024Updated last year
PrimeIntellect-ai / pi-quant
View on GitHub
SIMD quantization kernels
☆93May 29, 2026Updated 2 months ago
matttreed / diloco-sim
View on GitHub
☆23Jan 5, 2025Updated last year
EleutherAI / rnngineering
View on GitHub
Engineering the state of RNN language models (Mamba, RWKV, etc.)
☆33May 25, 2024Updated 2 years ago
PrimeIntellect-ai / experiments-autonomous-speedrunning
View on GitHub
autonomous nanogpt optimizer speedrun
☆109May 14, 2026Updated 2 months ago
VatsaDev / NanoPoor
View on GitHub
NanoGPT-speedrunning for the poor T4 enjoyers
☆72Apr 22, 2025Updated last year