Core communication lib for Bagua.
☆48Sep 15, 2021Updated 4 years ago
Alternatives and similar repositories for bagua-core
Users that are interested in bagua-core are comparing it to the libraries listed below
Sorting:
- High performance NCCL plugin for Bagua.☆15Sep 15, 2021Updated 4 years ago
- Bagua tutorials.☆13Sep 4, 2022Updated 3 years ago
- Bagua Speeds up PyTorch☆884Aug 1, 2024Updated last year
- OSPP 2022 Project: String Adaptive Hash Table for Databend☆19Sep 15, 2022Updated 3 years ago
- The codebase for DBSim☆16Mar 8, 2023Updated 2 years ago
- gossip: Efficient Communication Primitives for Multi-GPU Systems☆62Jul 1, 2022Updated 3 years ago
- 各种深度学习(DL)框架分布式训练,包括:Tensorflow、Tensorflow2、Pytorch、Chainer、Caffe、Mxnet ...☆22Aug 8, 2020Updated 5 years ago
- ☆10May 16, 2021Updated 4 years ago
- A networking framework built on top of MQTT to allow the communication and synchronization of distributed, language-independent resources…☆11Feb 28, 2024Updated 2 years ago
- Chaitin-Briggs register-allocation algorithm (LLVM back-end)☆12Jan 6, 2016Updated 10 years ago
- ☆15Jul 18, 2023Updated 2 years ago
- 🕹 Implementation for the lesson Compiling Engineering(2020 Spring) in Peking University, adjusted from UCLA CS 132 Project.☆10Jun 21, 2020Updated 5 years ago
- ☆26May 22, 2022Updated 3 years ago
- Elastic Deep Learning Training based on Kubernetes by Leveraging EDL and Volcano☆32May 19, 2023Updated 2 years ago
- Learned SPatial Hashmap☆12Sep 14, 2025Updated 5 months ago
- Portable LLM - A rust library for LLM inference☆11Apr 13, 2024Updated last year
- TPCH benchmark tool for databend☆11Nov 15, 2022Updated 3 years ago
- Binary translation in Rust☆13Jun 22, 2020Updated 5 years ago
- Layer-wise Sparsification of Distributed Deep Learning☆10Jul 6, 2020Updated 5 years ago
- Paper list for accleration of transformers☆14Jul 1, 2023Updated 2 years ago
- SIMD aligned data structures to work with `std::simd`.☆11Dec 14, 2024Updated last year
- Depict GPU memory footprint during DNN training of PyTorch☆11Nov 17, 2022Updated 3 years ago
- An Implementation for Raft Lab☆11Mar 1, 2020Updated 6 years ago
- clustering algorithm implementation☆13Nov 3, 2025Updated 4 months ago
- High performance distributed framework for training deep learning recommendation models based on PyTorch.☆411Jun 14, 2025Updated 8 months ago
- Experimental DataFusion Optimizer☆52Jun 9, 2023Updated 2 years ago
- Distributed ML Optimizer☆35Jul 28, 2021Updated 4 years ago
- Kubernetes operator for Bagua distributed training job.☆13Feb 7, 2023Updated 3 years ago
- A computation-parallel deep learning architecture.☆13Sep 25, 2019Updated 6 years ago
- The Databend plugin for dbt (data build tool)☆12Mar 17, 2023Updated 2 years ago
- A fast multi-threaded base64 encoding / decoding library and CLI tool, made in Rust.☆12Aug 7, 2023Updated 2 years ago
- SQL Benchmark derived from TPC-H☆11May 20, 2023Updated 2 years ago
- Tacker: Tensor-CUDA Core Kernel Fusion for Improving the GPU Utilization while Ensuring QoS☆34Feb 10, 2025Updated last year
- [ICDCS 2023] DeAR: Accelerating Distributed Deep Learning with Fine-Grained All-Reduce Pipelining☆12Dec 4, 2023Updated 2 years ago
- A bridge between different serde implementations.☆16Sep 8, 2025Updated 5 months ago
- ☆16Sep 4, 2023Updated 2 years ago
- A Triton-only attention backend for vLLM☆24Feb 11, 2026Updated 3 weeks ago
- Python environment for Chinese Standard Mahjong on Botzone platform.☆14Jan 18, 2021Updated 5 years ago
- Model factory is a ML training platform to help engineers to build ML models at scale☆17Sep 27, 2021Updated 4 years ago