ShishirPatil / poetLinks
ML model training for edge devices
β167Updated 2 years ago
Alternatives and similar repositories for poet
Users that are interested in poet are comparing it to the libraries listed below
Sorting:
- π Interactive performance profiling and debugging tool for PyTorch neural networks.β64Updated 10 months ago
- β120Updated last year
- β157Updated 2 years ago
- β159Updated 2 years ago
- Advanced Ultra-Low Bitrate Compression Techniques for the LLaMA Family of LLMsβ110Updated last year
- Home for OctoML PyTorch Profilerβ114Updated 2 years ago
- β113Updated last year
- GPTQ inference Triton kernelβ316Updated 2 years ago
- A performant, memory-efficient checkpointing library for PyTorch applications, designed with large, complex distributed workloads in mindβ¦β161Updated 2 months ago
- A Python library transfers PyTorch tensors between CPU and NVMeβ122Updated last year
- Code for the paper "QMoE: Practical Sub-1-Bit Compression of Trillion-Parameter Models".β278Updated 2 years ago
- β71Updated 8 months ago
- This repository contains the experimental PyTorch native float8 training UXβ226Updated last year
- Code for paper: "QuIP: 2-Bit Quantization of Large Language Models With Guarantees"β390Updated last year
- Reorder-based post-training quantization for large language modelβ197Updated 2 years ago
- A schedule language for large model trainingβ151Updated 3 months ago
- Memory Optimizations for Deep Learning (ICML 2023)β111Updated last year
- PB-LLM: Partially Binarized Large Language Modelsβ157Updated 2 years ago
- [MLSys'24] Atom: Low-bit Quantization for Efficient and Accurate LLM Servingβ331Updated last year
- β252Updated last year
- [ICLR'25] Fast Inference of MoE Models with CPU-GPU Orchestrationβ243Updated last year
- Official code for "SWARM Parallelism: Training Large Models Can Be Surprisingly Communication-Efficient"β147Updated 2 years ago
- Torch Distributed Experimentalβ117Updated last year
- PipeTransformer: Automated Elastic Pipelining for Distributed Training of Large-scale Models. ICML 2021β56Updated 4 years ago
- AI and Memory Wallβ224Updated last year
- Repository for CPU Kernel Generation for LLM Inferenceβ27Updated 2 years ago
- β94Updated 3 years ago
- β122Updated last year
- π Collection of components for development, training, tuning, and inference of foundation models leveraging PyTorch native components.β216Updated 2 weeks ago
- Repository for Sparse Finetuning of LLMs via modified version of the MosaicML llmfoundryβ42Updated last year