LYMDLUT / zpdbLinks
☆18Updated last year
Alternatives and similar repositories for zpdb
Users that are interested in zpdb are comparing it to the libraries listed below
Sorting:
- Python debug configuration generator for vscode☆29Updated 4 years ago
- A library for calculating the FLOPs in the forward() process based on torch.fx☆128Updated 6 months ago
- 多模态 MM +Chat 合集☆276Updated last month
- [ICLR 2025 Spotlight] Vision-RWKV: Efficient and Scalable Visual Perception with RWKV-Like Architectures☆503Updated 7 months ago
- Yet another PyTorch Trainer and some core components for deep learning.☆223Updated last year
- pytorch单精度、半精度、混合精度、单卡、多卡(DP / DDP)、FSDP、DeepSpeed模型训练代码,并对比不同方法的训练速度以及GPU内存的使用☆122Updated last year
- Implementation of Denoising Diffusion Probabilistic Model in MindSpore☆42Updated 2 years ago
- 一款便捷的抢占显卡脚本☆364Updated 8 months ago
- Lightning Attention-2: A Free Lunch for Handling Unlimited Sequence Lengths in Large Language Models☆329Updated 7 months ago
- An awesome gpu tasks scheduler. 轻量好用的GPU机群任务调度工具。觉得有用可以点个star☆191Updated 3 years ago
- Cool Papers - Immersive Paper Discovery☆627Updated last month
- A list of papers, docs, codes about efficient AIGC. This repo is aimed to provide the info for efficient AIGC research, including languag…☆193Updated 8 months ago
- DeepSpeed教程 & 示例注释 & 学习笔记 (大模型高效训练)☆178Updated 2 years ago
- Lossless Training Speed Up by Unbiased Dynamic Data Pruning☆340Updated last year
- [COLM 2025] Open-Qwen2VL: Compute-Efficient Pre-Training of Fully-Open Multimodal LLMs on Academic Resources☆272Updated last month
- ☆65Updated 3 years ago
- DeepSpeed Tutorial☆102Updated last year
- ☆43Updated 8 months ago
- A paper list of some recent works about Token Compress for Vit and VLM☆684Updated 3 weeks ago
- The official repo for [TPAMI'23] "Vision Transformer with Quadrangle Attention"☆220Updated 2 weeks ago
- Efficient Multimodal Large Language Models: A Survey☆373Updated 5 months ago
- A curated list of papers on the applications of RWKV in computer vision.☆211Updated 3 months ago
- (Unofficial) PyTorch implementation of grouped-query attention (GQA) from "GQA: Training Generalized Multi-Query Transformer Models from …☆180Updated last year
- ☆197Updated last year
- Implementation of "Attention Is Off By One" by Evan Miller☆196Updated 2 years ago
- [ICML 2025 Oral] Mixture of Lookup Experts☆53Updated 4 months ago
- Official Code of Paper "Reversible Column Networks" "RevColv2"☆263Updated 2 years ago
- [CVPR 2023 Highlight] This is the official implementation of "Stitchable Neural Networks".☆249Updated 2 years ago
- Efficient Mixture of Experts for LLM Paper List☆132Updated 2 weeks ago
- Curated list of methods that focuses on improving the efficiency of diffusion models☆45Updated last year