nebuly-ai / exploring-AI-optimization
Curated list of awesome material on optimization techniques to make artificial intelligence faster and more efficient π
β114Updated last year
Alternatives and similar repositories for exploring-AI-optimization
Users that are interested in exploring-AI-optimization are comparing it to the libraries listed below
Sorting:
- An open-source AutoML Library based on PyTorchβ306Updated last month
- experiments with inference on llamaβ104Updated 11 months ago
- Context Manager to profile the forward and backward times of PyTorch's nn.Moduleβ83Updated last year
- ML/DL Math and Method notesβ60Updated last year
- πΉοΈ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.β136Updated 9 months ago
- Outlining techniques for improving the training performance of your PyTorch model without compromising its accuracyβ129Updated 2 years ago
- A scalable & efficient active learning/data selection system for everyone.β214Updated 10 months ago
- Codes for paper "KNAS: Green Neural Architecture Search"β92Updated 3 years ago
- ML model training for edge devicesβ163Updated last year
- β250Updated 9 months ago
- β29Updated 2 years ago
- An implementation of the transformer architecture onto an Nvidia CUDA kernelβ181Updated last year
- This code repository contains the code used for my "Optimizing Memory Usage for Training LLMs and Vision Transformers in PyTorch" blog poβ¦β92Updated last year
- Fine-tune an LLM to perform batch inference and online serving.β110Updated last week
- The Triton backend for the PyTorch TorchScript models.β150Updated last week
- Slides, notes, and materials for the workshopβ325Updated 11 months ago
- Tune efficiently any LLM model from HuggingFace using distributed training (multiple GPU) and DeepSpeed. Uses Ray AIR to orchestrate the β¦β57Updated last year
- ReLM is a Regular Expression engine for Language Modelsβ104Updated last year
- An open-source efficient deep learning framework/compiler, written in python.β698Updated 2 months ago
- NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Dayβ255Updated last year
- deep learning with pytorch lightningβ1Updated 6 months ago
- Small scale distributed training of sequential deep learning models, built on Numpy and MPI.β132Updated last year
- Make triton easierβ47Updated 11 months ago
- TitanML Takeoff Server is an optimization, compression and deployment platform that makes state of the art machine learning models accessβ¦β114Updated last year
- A performant, memory-efficient checkpointing library for PyTorch applications, designed with large, complex distributed workloads in mindβ¦β157Updated 5 months ago
- β43Updated 2 years ago
- train with kittens!β57Updated 6 months ago
- Documented and Unit Tested educational Deep Learning framework with Autograd from scratch.β111Updated last year
- Home for OctoML PyTorch Profilerβ113Updated 2 years ago
- Cataloging released Triton kernels.β221Updated 4 months ago