axonn-ai / axonnView external linksLinks
Parallel framework for training and fine-tuning deep neural networks
☆70Nov 10, 2025Updated 3 months ago
Alternatives and similar repositories for axonn
Users that are interested in axonn are comparing it to the libraries listed below
Sorting:
- Analyze parallel execution traces using pandas dataframes☆24Oct 22, 2025Updated 3 months ago
- Cosmic Tagging Network for Neutrino Physics☆13Jun 26, 2024Updated last year
- Reference implementations of MLPerf™ HPC training benchmarks☆49Feb 25, 2025Updated 11 months ago
- Algorithms for approximate attention in LLMs☆21Apr 14, 2025Updated 10 months ago
- Official repo for Detecting, Explaining, and Mitigating Memorization in Diffusion Models (ICLR 2024)☆77Apr 3, 2024Updated last year
- Argonne Leadership Computing Facility OpenCL tutorial☆10Aug 22, 2025Updated 5 months ago
- ☆14Sep 7, 2023Updated 2 years ago
- ☆14Mar 8, 2025Updated 11 months ago
- Aries Network Performance Counters Monitoring Library☆11Nov 19, 2020Updated 5 years ago
- CPU and GPU tutorial examples☆13Apr 4, 2025Updated 10 months ago
- Benchmark implementation of CosmoFlow in TensorFlow Keras☆22Feb 7, 2024Updated 2 years ago
- Distributed Deep Learning Tutorial☆16Nov 17, 2025Updated 2 months ago
- Damselfly Network Simulator☆10Nov 19, 2020Updated 5 years ago
- A PyTorch native platform for training generative AI models☆15Nov 18, 2025Updated 2 months ago
- ☆18Nov 11, 2025Updated 3 months ago
- ☆14Mar 1, 2025Updated 11 months ago
- GPULZ: Optimizing LZSS Lossless Compression for Multi-byte Data on Modern GPUs☆16Apr 18, 2025Updated 9 months ago
- ☆42Jan 24, 2026Updated 3 weeks ago
- MemLiner is a remote-memory-friendly runtime system.☆31Nov 1, 2022Updated 3 years ago
- Graph-indexed Pandas DataFrames for analyzing hierarchical performance data☆34Jan 30, 2026Updated 2 weeks ago
- What do we learn from inverting CLIP models?☆58Mar 6, 2024Updated last year
- [CF ’20] Verified Instruction-Level Energy Consumption Measurement for NVIDIA GPUs☆15Dec 11, 2020Updated 5 years ago
- Guidelines on using Weights and Biases logging for deep learning applications on NERSC machines☆13Aug 7, 2023Updated 2 years ago
- This is the open source version of HPL-MXP. The code performance has been verified on Frontier☆18Jul 9, 2025Updated 7 months ago
- ☆17Nov 11, 2025Updated 3 months ago
- A suite of communication proxies for HPC applications☆13Jul 7, 2023Updated 2 years ago
- Repo for a DOE HPC workflow training event☆13Apr 28, 2023Updated 2 years ago
- some mixture of experts architecture implementations☆25Mar 22, 2024Updated last year
- Parallel Code Evaluation Benchmark☆42Nov 4, 2025Updated 3 months ago
- Fault tolerance for PyTorch (HSDP, LocalSGD, DiLoCo, Streaming DiLoCo)☆476Feb 3, 2026Updated 2 weeks ago
- LLVM/MLIR based compiler instrumentation of AMD GPU kernels☆20Jul 13, 2025Updated 7 months ago
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆17Updated this week
- Unit Scaling demo and experimentation code☆16Mar 12, 2024Updated last year
- Official implementation of GOAT model (ICML2023)☆38Jul 3, 2023Updated 2 years ago
- parser script to process pytorch autograd profiler result, convert json file to excel.☆14Oct 8, 2019Updated 6 years ago
- scalable data movement in Exascale Supercomputers☆17Dec 4, 2025Updated 2 months ago
- A Top-Down Profiler for GPU Applications☆22Feb 29, 2024Updated last year
- Benchmarks☆17Apr 28, 2025Updated 9 months ago
- Proof-of-concept of global switching between numpy/jax/pytorch in a library.☆18Jun 18, 2024Updated last year