nebuly-ai / exploring-AI-optimizationLinks
Curated list of awesome material on optimization techniques to make artificial intelligence faster and more efficient π
β118Updated last year
Alternatives and similar repositories for exploring-AI-optimization
Users that are interested in exploring-AI-optimization are comparing it to the libraries listed below
Sorting:
- Presents comprehensive benchmarks of XLA-compatible pre-trained models in Keras.β37Updated last year
- ML/DL Math and Method notesβ61Updated last year
- Context Manager to profile the forward and backward times of PyTorch's nn.Moduleβ83Updated last year
- ML model training for edge devicesβ165Updated last year
- A scalable & efficient active learning/data selection system for everyone.β214Updated last year
- experiments with inference on llamaβ104Updated last year
- A performant, memory-efficient checkpointing library for PyTorch applications, designed with large, complex distributed workloads in mindβ¦β158Updated 3 weeks ago
- NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Dayβ256Updated last year
- Outlining techniques for improving the training performance of your PyTorch model without compromising its accuracyβ128Updated 2 years ago
- Torch Distributed Experimentalβ116Updated 11 months ago
- This code repository contains the code used for my "Optimizing Memory Usage for Training LLMs and Vision Transformers in PyTorch" blog poβ¦β92Updated 2 years ago
- FasterAI: Prune and Distill your models with FastAI and PyTorchβ249Updated last month
- π Interactive performance profiling and debugging tool for PyTorch neural networks.β64Updated 5 months ago
- Codes for paper "KNAS: Green Neural Architecture Search"β92Updated 3 years ago
- An open-source AutoML Library based on PyTorchβ306Updated last week
- Plugin for deploying MLflow models to TorchServeβ111Updated 2 years ago
- deep learning with pytorch lightningβ1Updated 8 months ago
- Fine-tune an LLM to perform batch inference and online serving.β112Updated last month
- Home for OctoML PyTorch Profilerβ113Updated 2 years ago
- ML model optimization product to accelerate inference.β326Updated last month
- PDFs and Codelabs for the Efficient Deep Learning book.β194Updated 2 years ago
- vLLM: A high-throughput and memory-efficient inference and serving engine for LLMsβ87Updated this week
- Easily benchmark PyTorch model FLOPs, latency, throughput, allocated gpu memory and energy consumptionβ103Updated last year
- benchmarking some transformer deploymentsβ26Updated 2 years ago
- β125Updated last year
- Official code for "Distributed Deep Learning in Open Collaborations" (NeurIPS 2021)β117Updated 3 years ago
- β251Updated 11 months ago
- π€ Trade any tensors over the networkβ30Updated last year
- Various transformers for FSDP researchβ37Updated 2 years ago
- Functional local implementations of main model parallelism approachesβ95Updated 2 years ago