LYMDLUT / zpdbLinks
☆18Updated last year
Alternatives and similar repositories for zpdb
Users that are interested in zpdb are comparing it to the libraries listed below
Sorting:
- A library for calculating the FLOPs in the forward() process based on torch.fx☆137Updated last month
- Lightning Attention-2: A Free Lunch for Handling Unlimited Sequence Lengths in Large Language Models☆339Updated 11 months ago
- [ICLR 2025 Spotlight] Vision-RWKV: Efficient and Scalable Visual Perception with RWKV-Like Architectures☆540Updated 11 months ago
- Python debug configuration generator for vscode☆29Updated 4 years ago
- pytorch单精度、半精度、混合精度、单卡、多卡(DP / DDP)、FSDP、DeepSpeed模型训练代码,并对比不同方法的训练速度以及GPU内存的使用☆129Updated last year
- Implementation of Denoising Diffusion Probabilistic Model in MindSpore☆45Updated 3 years ago
- DeepSpeed教程 & 示例注释 & 学习笔记 (大模型高效训练)☆186Updated 2 years ago
- A list of papers, docs, codes about efficient AIGC. This repo is aimed to provide the info for efficient AIGC research, including languag…☆204Updated 11 months ago
- 多模态 MM +Chat 合集☆282Updated 5 months ago
- Official Code of Paper "Reversible Column Networks" "RevColv2"☆265Updated 2 years ago
- [CVPR 2023 Highlight] This is the official implementation of "Stitchable Neural Networks".☆250Updated 2 years ago
- Implementation of "Attention Is Off By One" by Evan Miller☆198Updated 2 years ago
- Implementation of Post-training Quantization on Diffusion Models (CVPR 2023)☆141Updated 2 years ago
- ☆61Updated last year
- ☆201Updated 2 years ago
- Efficient Mixture of Experts for LLM Paper List☆166Updated 4 months ago
- 一款便捷的抢占显卡脚本☆393Updated last month
- [COLM 2025] Open-Qwen2VL: Compute-Efficient Pre-Training of Fully-Open Multimodal LLMs on Academic Resources☆305Updated 5 months ago
- Lossless Training Speed Up by Unbiased Dynamic Data Pruning☆344Updated last year
- Yet another PyTorch Trainer and some core components for deep learning.☆222Updated last year
- When it comes to optimizers, it's always better to be safe than sorry☆402Updated 4 months ago
- Official PyTorch implementation of the paper "Dataset Distillation with Neural Characteristic Function: A Minmax Perspective" (NCFM) in C…☆404Updated last month
- A generalized framework for subspace tuning methods in parameter efficient fine-tuning.☆169Updated this week
- ☆218Updated 2 months ago
- The official repo for [TPAMI'23] "Vision Transformer with Quadrangle Attention"☆234Updated 4 months ago
- [EMNLP 2024 & AAAI 2026] A powerful toolkit for compressing large models including LLMs, VLMs, and video generative models.☆672Updated 2 months ago
- An awesome gpu tasks scheduler. 轻量好用的GPU机群任务调度工具。觉得有用可以点个star☆197Updated 3 years ago
- An efficient pytorch implementation of selective scan in one file, works with both cpu and gpu, with corresponding mathematical derivatio…☆101Updated 3 months ago
- VisionLLaMA: A Unified LLaMA Backbone for Vision Tasks☆390Updated last year
- [NeurIPS 2025 Spotlight] TPA: Tensor ProducT ATTenTion Transformer (T6) (https://arxiv.org/abs/2501.06425)☆445Updated last week