Nota-NetsPresso / PyNetsPresso
The official NetsPresso Python package.
☆44Updated this week
Alternatives and similar repositories for PyNetsPresso:
Users that are interested in PyNetsPresso are comparing it to the libraries listed below
- A library for training, compressing and deploying computer vision models (including ViT) with edge devices☆68Updated 2 weeks ago
- Compressed LLMs for Efficient Text Generation [ICLR'24 Workshop]☆76Updated 6 months ago
- A 28× Compressed Wav2Lip for Efficient Talking Face Generation [ICCV'23 Demo] [MLSys'23 Workshop] [NVIDIA GTC'23]☆56Updated last year
- Structured Neuron Level Pruning to compress Transformer-based models [ECCV'24]☆12Updated 7 months ago
- ☆83Updated last year
- ☆56Updated 2 years ago
- OwLite is a low-code AI model compression toolkit for AI models.☆43Updated last month
- A Compressed Stable Diffusion for Efficient Text-to-Image Generation [ECCV'24]☆285Updated 8 months ago
- A performance library for machine learning applications.☆183Updated last year
- Reproduction of Vision Transformer in Tensorflow2. Train from scratch and Finetune.☆48Updated 3 years ago
- Example code for RBLN SDK developers building inference applications☆17Updated last week
- ☆51Updated 4 months ago
- 2022_AAAI accepted paper, NaturalInversion:Data-Free Image Synthesis Improving Real-World Consistency☆10Updated 3 years ago
- ☆22Updated 2 months ago
- OwLite Examples repository offers illustrative example codes to help users seamlessly compress PyTorch deep learning models and transform…☆10Updated 6 months ago
- PyTorch CoreSIG☆55Updated 3 months ago
- KoLLaVA: Korean Large Language-and-Vision Assistant (feat.LLaVA)☆289Updated 6 months ago
- Imagenet(for image classification, 2012) 데이터 셋 다운로드 및 정리 방법 정리☆23Updated 4 years ago
- Tiny configuration for Triton Inference Server☆45Updated 2 months ago
- Official PyTorch implementation of "LayerMerge: Neural Network Depth Compression through Layer Pruning and Merging" (ICML'24)☆29Updated 7 months ago
- The official implementation of PTQD: Accurate Post-Training Quantization for Diffusion Models☆97Updated last year
- Official Github repository for the SIGCOMM '24 paper "Accelerating Model Training in Multi-cluster Environments with Consumer-grade GPUs"☆69Updated 8 months ago
- ☆10Updated last year
- [ICLR 2023] RC-MAE☆51Updated last year
- Official PyTorch implementation of "Efficient Latency-Aware CNN Depth Compression via Two-Stage Dynamic Programming" (ICML'23)☆13Updated 8 months ago
- Ditto is an open-source framework that enables direct conversion of HuggingFace PreTrainedModels into TensorRT-LLM engines.☆31Updated this week
- ptq4vm official repository☆20Updated last week
- Getting GPU Util 99%☆34Updated 4 years ago
- Code for the AAAI 2024 Oral paper "OWQ: Outlier-Aware Weight Quantization for Efficient Fine-Tuning and Inference of Large Language Model…☆57Updated last year
- ☆12Updated 2 years ago