LYMDLUT / zpdbLinks
☆18Updated last year
Alternatives and similar repositories for zpdb
Users that are interested in zpdb are comparing it to the libraries listed below
Sorting:
- Python debug configuration generator for vscode☆29Updated 4 years ago
- pytorch单精度、半精度、混合精度、单卡、多卡(DP / DDP)、FSDP、DeepSpeed模型训练代码,并对比不同方法的训练速度以及GPU内存的使用☆126Updated last year
- Implementation of Denoising Diffusion Probabilistic Model in MindSpore☆43Updated 2 years ago
- [ICLR 2025 Spotlight] Vision-RWKV: Efficient and Scalable Visual Perception with RWKV-Like Architectures☆523Updated 9 months ago
- DeepSpeed教程 & 示例注释 & 学习笔记 (大模型高效训练)☆183Updated 2 years ago
- ☆42Updated 10 months ago
- 多模态 MM +Chat 合集☆279Updated 3 months ago
- A library for calculating the FLOPs in the forward() process based on torch.fx☆132Updated 7 months ago
- Lightning Attention-2: A Free Lunch for Handling Unlimited Sequence Lengths in Large Language Models☆334Updated 9 months ago
- Yet another PyTorch Trainer and some core components for deep learning.☆223Updated last year
- 一款便捷的抢占显卡脚本☆380Updated 10 months ago
- ☆65Updated 3 years ago
- ☆199Updated last year
- PaddlePaddle Code Convert Toolkit. 『飞桨』深度学习代码转换工具☆118Updated this week
- The official repo for [TPAMI'23] "Vision Transformer with Quadrangle Attention"☆224Updated 2 months ago
- A curated list of papers on the applications of RWKV in computer vision.☆223Updated 5 months ago
- ☆207Updated last month
- [COLM 2025] Open-Qwen2VL: Compute-Efficient Pre-Training of Fully-Open Multimodal LLMs on Academic Resources☆285Updated 3 months ago
- Official PyTorch implementation of the paper "Dataset Distillation with Neural Characteristic Function: A Minmax Perspective" (NCFM) in C…☆391Updated last month
- (Unofficial) PyTorch implementation of grouped-query attention (GQA) from "GQA: Training Generalized Multi-Query Transformer Models from …☆184Updated last year
- My implementation of "Patch n’ Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution"☆266Updated last month
- ☆214Updated last year
- Efficient Mixture of Experts for LLM Paper List☆144Updated 2 months ago
- ☆93Updated 2 years ago
- Cool Papers - Immersive Paper Discovery☆653Updated 3 months ago
- VisualRWKV is the visual-enhanced version of the RWKV language model, enabling RWKV to handle various visual tasks.☆237Updated 6 months ago
- 飞桨护航计划集训营☆20Updated last month
- ☆220Updated 9 months ago
- An awesome gpu tasks scheduler. 轻量好用的GPU机群任务调度工具。觉得有用可以点个star☆193Updated 3 years ago
- VisionLLaMA: A Unified LLaMA Backbone for Vision Tasks☆390Updated last year