nebuly-ai / exploring-AI-optimizationLinks
Curated list of awesome material on optimization techniques to make artificial intelligence faster and more efficient ๐
โ116Updated last year
Alternatives and similar repositories for exploring-AI-optimization
Users that are interested in exploring-AI-optimization are comparing it to the libraries listed below
Sorting:
- An open-source AutoML Library based on PyTorchโ306Updated last month
- โ29Updated 2 years ago
- deep learning with pytorch lightningโ1Updated 7 months ago
- โ250Updated 10 months ago
- A performant, memory-efficient checkpointing library for PyTorch applications, designed with large, complex distributed workloads in mindโฆโ157Updated 6 months ago
- PDFs and Codelabs for the Efficient Deep Learning book.โ192Updated 2 years ago
- experiments with inference on llamaโ104Updated last year
- Outlining techniques for improving the training performance of your PyTorch model without compromising its accuracyโ128Updated 2 years ago
- Context Manager to profile the forward and backward times of PyTorch's nn.Moduleโ83Updated last year
- Lightning HPO & Training Studio Appโ18Updated 2 years ago
- vLLM: A high-throughput and memory-efficient inference and serving engine for LLMsโ86Updated this week
- A scalable & efficient active learning/data selection system for everyone.โ214Updated 10 months ago
- Fine-tune an LLM to perform batch inference and online serving.โ111Updated last week
- ML model training for edge devicesโ164Updated last year
- ๐น๏ธ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.โ137Updated 10 months ago
- ML/DL Math and Method notesโ61Updated last year
- Plugin for deploying MLflow models to TorchServeโ109Updated 2 years ago
- ๐ Interactive performance profiling and debugging tool for PyTorch neural networks.โ61Updated 4 months ago
- Module to Automatically maximize the utilization of GPU resources in a Kubernetes cluster through real-time dynamic partitioning and elasโฆโ662Updated last year
- Functional local implementations of main model parallelism approachesโ95Updated 2 years ago
- NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Dayโ256Updated last year
- โ35Updated 5 months ago
- Collection of kernels written in Triton languageโ125Updated 2 months ago
- MLCubeยฎ is a project that reduces friction for machine learning by ensuring that models are easily portable and reproducible.โ157Updated 8 months ago
- git extension for {collaborative, communal, continual} model developmentโ212Updated 6 months ago
- The Triton backend for the PyTorch TorchScript models.โ150Updated 3 weeks ago
- FasterAI: Prune and Distill your models with FastAI and PyTorchโ248Updated 2 months ago
- Home for OctoML PyTorch Profilerโ113Updated 2 years ago
- โ43Updated 2 years ago
- Torch Distributed Experimentalโ117Updated 10 months ago