megvii-research / megfileLinks

Megvii FILE Library - Working with Files in Python same as the standard library

☆163

Alternatives and similar repositories for megfile

Users that are interested in megfile are comparing it to the libraries listed below

Sorting:

Tele-AI / TeleTron
To pioneer training long-context multi-modal transformer models
☆61Updated 3 months ago
TencentARC / mllm-npu
mllm-npu: training multimodal large language models on Ascend NPUs
☆94Updated last year
HFAiLab / ffrecord
FireFlyer Record file format, writer and reader for DL training samples.
☆236Updated 2 years ago
SandAI-org / MagiAttention
A Distributed Attention Towards Linear Scalability for Ultra-Long Context, Heterogeneous Data Training
☆562Updated last week
stepfun-ai / Step3
☆439Updated 3 months ago
ppwwyyxx / RAM-multiprocess-dataloader
Demystify RAM Usage in Multi-Process Data Loaders
☆204Updated 2 years ago
silverbulletmdc / showdata
Large scale image dataset visiualization tool.
☆121Updated last week
ByteDance-Seed / VeOmni
VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo
☆1,338Updated this week
xdit-project / DistVAE
A parallelism VAE avoids OOM for high resolution image generation
☆83Updated 3 months ago
intelligent-machine-learning / atorch
An industrial extension library of pytorch to accelerate large scale model training
☆52Updated 3 months ago
mit-han-lab / patch_conv
Patch convolution to avoid large GPU memory usage of Conv2D
☆93Updated 10 months ago
vipshop / cache-dit
🤗A PyTorch-native Inference Engine with Hybrid Cache Acceleration and Parallelism for DiTs.
☆588Updated this week
FateScript / dotfiles
useful dotfiles included vim, zsh, tmux and vscode
☆18Updated 2 months ago
feifeibear / long-context-attention
USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference
☆605Updated last month
megvii-research / hpman
A hyperparameter manager for deep learning experiments.
☆96Updated 3 years ago
Tencent-Hunyuan / flex-block-attn
flex-block-attn: an efficient block sparse attention computation library
☆88Updated this week
RiseAI-Sys / DAX
High performance inference engine for diffusion models
☆95Updated 2 months ago
Oneflow-Inc / vision
Datasets, Transforms and Models specific to Computer Vision
☆90Updated 2 years ago
NVIDIA / Megatron-Energon
Megatron's multi-modal data loader
☆278Updated last week
DeepLink-org / CVFusion
CVFusion is an open-source deep learning compiler to fuse the OpenCV operators.
☆32Updated 3 years ago
Vchitect / LiteGen
A light-weight and high-efficient training framework for accelerating diffusion tasks.
☆50Updated last year
shawnricecake / draft-attention
Code for Draft Attention
☆93Updated 6 months ago
Ascend / pytorch
Ascend PyTorch adapter (torch_npu). Mirror of https://gitee.com/ascend/pytorch
☆458Updated last week
octoml / deformable-attention-kernel
TVMScript kernel for deformable attention
☆25Updated 3 years ago
thu-nics / DiTFastAttn
☆187Updated 10 months ago
mit-han-lab / Block-Sparse-Attention
A sparse attention kernel supporting mix sparse patterns
☆385Updated 9 months ago
ByteDance-Seed / ByteCheckpoint
ByteCheckpoint: An Unified Checkpointing Library for LFMs
☆252Updated 4 months ago
NVlabs / COAT
[ICLR 2025] COAT: Compressing Optimizer States and Activation for Memory-Efficient FP8 Training
☆250Updated 3 months ago
MegEngine / mgeconvert
MegEngine到其他框架的转换器
☆70Updated 2 years ago
cli99 / flops-profiler
pytorch-profiler
☆51Updated 2 years ago