nebuly-ai / exploring-AI-optimization
Curated list of awesome material on optimization techniques to make artificial intelligence faster and more efficient ๐
โ113Updated last year
Alternatives and similar repositories for exploring-AI-optimization:
Users that are interested in exploring-AI-optimization are comparing it to the libraries listed below
- NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Dayโ255Updated last year
- Context Manager to profile the forward and backward times of PyTorch's nn.Moduleโ84Updated last year
- Module to Automatically maximize the utilization of GPU resources in a Kubernetes cluster through real-time dynamic partitioning and elasโฆโ643Updated 9 months ago
- An open-source AutoML Library based on PyTorchโ306Updated last month
- ML model training for edge devicesโ160Updated last year
- Fast low-bit matmul kernels in Tritonโ236Updated this week
- Amos optimizer with JEstimator lib.โ81Updated 9 months ago
- ๐ Interactive performance profiling and debugging tool for PyTorch neural networks.โ58Updated 3 weeks ago
- Repository for the QUIK project, enabling the use of 4bit kernels for generative inference - EMNLP 2024โ175Updated 10 months ago
- Cataloging released Triton kernels.โ168Updated last month
- ๐น๏ธ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.โ136Updated 6 months ago
- End-to-End LLM Guideโ101Updated 7 months ago
- MLCubeยฎ is a project that reduces friction for machine learning by ensuring that models are easily portable and reproducible.โ154Updated 5 months ago
- A performant, memory-efficient checkpointing library for PyTorch applications, designed with large, complex distributed workloads in mindโฆโ154Updated 2 months ago
- Code for the paper "QMoE: Practical Sub-1-Bit Compression of Trillion-Parameter Models".โ265Updated last year
- ML/DL Math and Method notesโ58Updated last year
- FasterAI: Prune and Distill your models with FastAI and PyTorchโ247Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMsโ259Updated 4 months ago
- โ246Updated 6 months ago
- PDFs and Codelabs for the Efficient Deep Learning book.โ192Updated last year
- Outlining techniques for improving the training performance of your PyTorch model without compromising its accuracyโ127Updated last year
- Codes for paper "KNAS: Green Neural Architecture Search"โ91Updated 3 years ago
- A scalable & efficient active learning/data selection system for everyone.โ214Updated 7 months ago
- experiments with inference on llamaโ104Updated 8 months ago
- An open-source efficient deep learning framework/compiler, written in python.โ681Updated last week
- Distributed skorch on Ray Trainโ57Updated 2 years ago
- Home for OctoML PyTorch Profilerโ107Updated last year
- Implementation of a Transformer, but completely in Tritonโ257Updated 2 years ago
- โ29Updated last year
- ๐ค Trade any tensors over the networkโ30Updated last year