nebuly-ai / exploring-AI-optimizationLinks

Curated list of awesome material on optimization techniques to make artificial intelligence faster and more efficient 🚀

☆119

Alternatives and similar repositories for exploring-AI-optimization

Users that are interested in exploring-AI-optimization are comparing it to the libraries listed below

Sorting:

stas00 / ml-ways
ML/DL Math and Method notes
☆64Updated last year
hamelsmu / llama-inference
experiments with inference on llama
☆103Updated last year
llm-efficiency-challenge / neurips_llm_efficiency_challenge
NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day
☆256Updated last year
Jingjing-NLP / KNAS
Codes for paper "KNAS: Green Neural Architecture Search"
☆93Updated 3 years ago
kshitij12345 / torchnnprofiler
Context Manager to profile the forward and backward times of PyTorch's nn.Module
☆82Updated 2 years ago
rasbt / pytorch-memory-optim
This code repository contains the code used for my "Optimizing Memory Usage for Training LLMs and Vision Transformers in PyTorch" blog po…
☆92Updated 2 years ago
meta-pytorch / torchsnapshot
A performant, memory-efficient checkpointing library for PyTorch applications, designed with large, complex distributed workloads in mind…
☆161Updated last month
rwightman / genalog
Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and te…
☆43Updated last year
HuaizhengZhang / Active-Learning-as-a-Service
A scalable & efficient active learning/data selection system for everyone.
☆217Updated last year
octoml / octoml-profile
Home for OctoML PyTorch Profiler
☆114Updated 2 years ago
lessw2020 / transformer_central
Various transformers for FSDP research
☆39Updated 2 years ago
hundredblocks / large-model-parallelism
Functional local implementations of main model parallelism approaches
☆96Updated 2 years ago
sayakpaul / keras-xla-benchmarks
Presents comprehensive benchmarks of XLA-compatible pre-trained models in Keras.
☆37Updated 2 years ago
ShishirPatil / poet
ML model training for edge devices
☆168Updated 2 years ago
warner-benjamin / fastxtend
Train fastai models faster (and other useful tools)
☆71Updated 4 months ago
NielsRogge / awesome-huggingface
Repository containing awesome resources regarding Hugging Face tooling.
☆48Updated last year
EmbeddedLLM / vllm
vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs
☆90Updated last week
IlyasMoutawwakil / llm-perf-backend
The backend behind the LLM-Perf Leaderboard
☆11Updated last year
aporia-ai / inferencedb
🚀 Stream inferences of real-time ML models in production to any data lake (Experimental)
☆81Updated 3 years ago
anyscale / e2e-llm-workflows
Fine-tune an LLM to perform batch inference and online serving.
☆112Updated 4 months ago
huggingface / kernel-builder
👷 Build compute kernels
☆158Updated this week
ezyang / torchdbg
PyTorch centric eager mode debugger
☆48Updated 10 months ago
huggingface / optimum-tpu
Google TPU optimizations for transformers models
☆120Updated 8 months ago
pytorch / torchdistx
Torch Distributed Experimental
☆117Updated last year
chainyo / tensorshare
🤝 Trade any tensors over the network
☆30Updated 2 years ago
huggingface / optimum-benchmark
🏋️ A unified multi-backend utility for benchmarking Transformers, Timm, PEFT, Diffusers and Sentence-Transformers with full support of O…
☆318Updated 3 weeks ago
google-deepmind / asyncdiloco
☆46Updated last year
FasterAI-Labs / fasterai
FasterAI: Prune and Distill your models with FastAI and PyTorch
☆249Updated 4 months ago
mlflow / mlflow-torchserve
Plugin for deploying MLflow models to TorchServe
☆110Updated 2 years ago
CentML / DeepView.Profile
🏙 Interactive performance profiling and debugging tool for PyTorch neural networks.
☆64Updated 8 months ago