BBuf / megatron-lm-parallel-group-playgroundView external linksLinks
☆16Mar 30, 2024Updated last year
Alternatives and similar repositories for megatron-lm-parallel-group-playground
Users that are interested in megatron-lm-parallel-group-playground are comparing it to the libraries listed below
Sorting:
- ☆11Dec 26, 2025Updated last month
- OneFlow->ONNX☆43Apr 19, 2023Updated 2 years ago
- study of cutlass☆22Nov 10, 2024Updated last year
- ☆52May 19, 2025Updated 8 months ago
- CFR implementation of a poker bot.☆12Feb 17, 2023Updated 2 years ago
- ☆23Apr 25, 2023Updated 2 years ago
- ☆125Dec 15, 2023Updated 2 years ago
- An auxiliary project analysis of the characteristics of KV in DiT Attention.☆32Nov 29, 2024Updated last year
- ☆34Feb 3, 2025Updated last year
- A collection of different PyTorch wrappers for training neural networks and reinforcement algorithms☆13Dec 15, 2022Updated 3 years ago
- Tutorial: Writing R and Python Packages with Multithreaded C++ Code using BLAS, AVX2/AVX512, OpenMP, C++11 Threads and Cuda GPU accelerat…☆13Nov 27, 2022Updated 3 years ago
- TridentNet in mmdetection☆22Apr 2, 2020Updated 5 years ago
- Odysseus: Playground of LLM Sequence Parallelism☆79Jun 17, 2024Updated last year
- Some microbenchmarks and design docs before commencement☆12Feb 1, 2021Updated 5 years ago
- a size profiler for cuda binary☆72Jan 15, 2026Updated last month
- A high-throughput and memory-efficient inference and serving engine for LLMs☆12Nov 14, 2025Updated 3 months ago
- Vision-Language Models Toolbox: Your all-in-one solution for multimodal research and experimentation☆12Feb 16, 2025Updated last year
- 基于老年人互助养老模式的时间银行系统研究(程成)☆10Nov 18, 2014Updated 11 years ago
- Byted PyTorch Distributed for Hyperscale Training of LLMs and RLs☆929Nov 27, 2025Updated 2 months ago
- A more efficient yolov5 with oneflow backend 🎉🎉🎉☆217Jul 10, 2025Updated 7 months ago
- A Texas Holdem poker framework written in C++ 20.☆11Apr 23, 2023Updated 2 years ago
- 🌿快速生成文件夹目录结构,支持定义目录层级,支持生成到 markdown 文件。☆13Oct 19, 2022Updated 3 years ago
- 数据挖掘18大算法实现以及其他相关经典DM算法☆12Aug 2, 2015Updated 10 years ago
- A smartphone specs API powered with the most trusted phone information website gsm arena.☆16Feb 1, 2024Updated 2 years ago
- derived from https://github.com/wilfredinni/python-cheatsheet☆10Nov 8, 2023Updated 2 years ago
- Triton-based Symmetric Memory operators and examples☆81Jan 15, 2026Updated last month
- ☆22Dec 11, 2025Updated 2 months ago
- A bot that do auto search and gain points☆10Nov 2, 2023Updated 2 years ago
- The official github repo for the open online courses: "Dive into LLMs".☆10Mar 15, 2024Updated last year
- AlgorithmNote is a knowledge sharing github page, mainly has three parts: algorithm, engineering and basic knowledge.☆14Feb 17, 2015Updated 11 years ago
- ☆22Dec 23, 2025Updated last month
- 🚀全流程自己训练一个VLA 「大模型」1小时从0训练26M参数的视觉多模态VLM!🌏 Train a 26M-parameter VLM from scratch in just 1 hours!☆26Oct 16, 2025Updated 4 months ago
- ☆10Sep 26, 2024Updated last year
- JAX implementation of LLaMA, aiming to train LLaMA on Google Cloud TPU☆14Jul 22, 2023Updated 2 years ago
- NoSQLBenchmark is a stress test toolkit for test NoSQL☆19Dec 30, 2011Updated 14 years ago
- This project wraps the WeChat OCR functionality from the excellent wechat-ocr project into a simple REST API service that can be easily d…☆14Dec 29, 2025Updated last month
- Stable Diffusion inference benchmarks☆10Jun 14, 2024Updated last year
- 首届中国心电智能大赛决赛阶段解决方案-公开版 比赛网址 http://mdi.ids.tsinghua.edu.cn/☆10Aug 21, 2019Updated 6 years ago
- Official training code for MUG-V 10B video generation model. Built on Megatron-LM (v0.14.0) with production-ready distributed training fo…☆19Oct 20, 2025Updated 3 months ago