megvii-research / megfile
Megvii FILE Library - Working with Files in Python same as the standard library
☆127Updated this week
Related projects ⓘ
Alternatives and complementary repositories for megfile
- FireFlyer Record file format, writer and reader for DL training samples.☆116Updated last year
- MegEngine到其他框架的转换器☆67Updated last year
- Datasets, Transforms and Models specific to Computer Vision☆82Updated 11 months ago
- A hyperparameter manager for deep learning experiments.☆95Updated 2 years ago
- 📖A small curated list of Awesome SD/DiT/ViT/Diffusion Inference with Distributed/Caching/Sampling: DistriFusion, PipeFusion, AsyncDiff, …☆89Updated 2 months ago
- Demystify RAM Usage in Multi-Process Data Loaders☆179Updated last year
- A communication library for deep learning☆50Updated 3 months ago
- TVMScript kernel for deformable attention☆24Updated 2 years ago
- optimized BERT transformer inference on NVIDIA GPU. https://arxiv.org/abs/2210.03052☆457Updated 7 months ago
- A codebase & model zoo for pretrained backbone based on MegEngine.☆32Updated last year
- ☆74Updated 10 months ago
- A model compression and acceleration toolbox based on pytorch.☆327Updated 10 months ago
- Simple Dynamic Batching Inference☆145Updated 2 years ago
- xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism☆677Updated this week
- NART = NART is not A RunTime, a deep learning inference framework.☆38Updated last year
- mllm-npu: training multimodal large language models on Ascend NPUs☆83Updated 2 months ago
- useful dotfiles included vim, zsh, tmux and vscode☆17Updated 2 weeks ago
- A collection of memory efficient attention operators implemented in the Triton language.☆217Updated 5 months ago
- CVFusion is an open-source deep learning compiler to fuse the OpenCV operators.☆26Updated 2 years ago
- Decode JPEG image on GPU using PyTorch☆84Updated last year
- Ascend PyTorch adapter (torch_npu). Mirror of https://gitee.com/ascend/pytorch☆254Updated this week
- Patch convolution to avoid large GPU memory usage of Conv2D☆79Updated 5 months ago
- A high-performance, extensible Python AOT compiler.☆412Updated last year
- Easy Parallel Library (EPL) is a general and efficient deep learning framework for distributed model training.☆264Updated last year
- Ring attention implementation with flash attention☆578Updated this week
- LiBai(李白): A Toolbox for Large-Scale Distributed Parallel Training☆390Updated this week
- Large scale image dataset visiualization tool.☆116Updated last year
- CUDA Templates for Linear Algebra Subroutines☆91Updated 6 months ago
- [CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models☆588Updated last week
- Serving Inside Pytorch☆142Updated last week