nebuly-ai / exploring-AI-optimizationLinks
Curated list of awesome material on optimization techniques to make artificial intelligence faster and more efficient π
β119Updated 2 years ago
Alternatives and similar repositories for exploring-AI-optimization
Users that are interested in exploring-AI-optimization are comparing it to the libraries listed below
Sorting:
- ML/DL Math and Method notesβ64Updated last year
- experiments with inference on llamaβ103Updated last year
- NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Dayβ256Updated last year
- Codes for paper "KNAS: Green Neural Architecture Search"β93Updated 3 years ago
- Context Manager to profile the forward and backward times of PyTorch's nn.Moduleβ82Updated 2 years ago
- This code repository contains the code used for my "Optimizing Memory Usage for Training LLMs and Vision Transformers in PyTorch" blog poβ¦β92Updated 2 years ago
- A performant, memory-efficient checkpointing library for PyTorch applications, designed with large, complex distributed workloads in mindβ¦β161Updated last month
- Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and teβ¦β43Updated last year
- A scalable & efficient active learning/data selection system for everyone.β217Updated last year
- Home for OctoML PyTorch Profilerβ114Updated 2 years ago
- Various transformers for FSDP researchβ39Updated 2 years ago
- Functional local implementations of main model parallelism approachesβ96Updated 2 years ago
- Presents comprehensive benchmarks of XLA-compatible pre-trained models in Keras.β37Updated 2 years ago
- ML model training for edge devicesβ168Updated 2 years ago
- Train fastai models faster (and other useful tools)β71Updated 4 months ago
- Repository containing awesome resources regarding Hugging Face tooling.β48Updated last year
- vLLM: A high-throughput and memory-efficient inference and serving engine for LLMsβ90Updated last week
- The backend behind the LLM-Perf Leaderboardβ11Updated last year
- π Stream inferences of real-time ML models in production to any data lake (Experimental)β81Updated 3 years ago
- Fine-tune an LLM to perform batch inference and online serving.β112Updated 4 months ago
- π· Build compute kernelsβ158Updated this week
- PyTorch centric eager mode debuggerβ48Updated 10 months ago
- Google TPU optimizations for transformers modelsβ120Updated 8 months ago
- Torch Distributed Experimentalβ117Updated last year
- π€ Trade any tensors over the networkβ30Updated 2 years ago
- ποΈ A unified multi-backend utility for benchmarking Transformers, Timm, PEFT, Diffusers and Sentence-Transformers with full support of Oβ¦β318Updated 3 weeks ago
- β46Updated last year
- FasterAI: Prune and Distill your models with FastAI and PyTorchβ249Updated 4 months ago
- Plugin for deploying MLflow models to TorchServeβ110Updated 2 years ago
- π Interactive performance profiling and debugging tool for PyTorch neural networks.β64Updated 8 months ago