ShishirPatil / poetLinks
ML model training for edge devices
β168Updated 2 years ago
Alternatives and similar repositories for poet
Users that are interested in poet are comparing it to the libraries listed below
Sorting:
- π Interactive performance profiling and debugging tool for PyTorch neural networks.β64Updated 11 months ago
- β157Updated 2 years ago
- Advanced Ultra-Low Bitrate Compression Techniques for the LLaMA Family of LLMsβ110Updated last year
- β160Updated 2 years ago
- β115Updated last year
- PipeTransformer: Automated Elastic Pipelining for Distributed Training of Large-scale Models. ICML 2021β55Updated 4 years ago
- Official code for "SWARM Parallelism: Training Large Models Can Be Surprisingly Communication-Efficient"β148Updated 2 years ago
- Home for OctoML PyTorch Profilerβ114Updated 2 years ago
- β120Updated last year
- Memory Optimizations for Deep Learning (ICML 2023)β114Updated last year
- Reorder-based post-training quantization for large language modelβ196Updated 2 years ago
- PB-LLM: Partially Binarized Large Language Modelsβ157Updated 2 years ago
- A Python library transfers PyTorch tensors between CPU and NVMeβ123Updated last year
- A schedule language for large model trainingβ152Updated 4 months ago
- β252Updated last year
- Code for paper: "QuIP: 2-Bit Quantization of Large Language Models With Guarantees"β392Updated last year
- Code for the paper "QMoE: Practical Sub-1-Bit Compression of Trillion-Parameter Models".β279Updated 2 years ago
- A performant, memory-efficient checkpointing library for PyTorch applications, designed with large, complex distributed workloads in mindβ¦β162Updated 2 weeks ago
- β94Updated 3 years ago
- [ICLR'25] Fast Inference of MoE Models with CPU-GPU Orchestrationβ253Updated last year
- Flexible simulator for mixed precision and format simulation of LLMs and vision transformers.β51Updated 2 years ago
- π Collection of components for development, training, tuning, and inference of foundation models leveraging PyTorch native components.β218Updated 3 weeks ago
- β145Updated 11 months ago
- Compression for Foundation Modelsβ35Updated 5 months ago
- SparseTIR: Sparse Tensor Compiler for Deep Learningβ141Updated 2 years ago
- GPTQ inference Triton kernelβ316Updated 2 years ago
- AI and Memory Wallβ225Updated last year
- A repository dedicated to evaluating the performance of quantizied LLaMA3 using various quantization methods..β198Updated 11 months ago
- β71Updated 9 months ago
- Benchmark PyTorch Custom Operatorsβ14Updated 2 years ago