prabhuomkar / bitbeastLinks
Experiments with Model Training, Deployment & Monitoring
β39Updated 3 months ago
Alternatives and similar repositories for bitbeast
Users that are interested in bitbeast are comparing it to the libraries listed below
Sorting:
- Curated list of awesome material on optimization techniques to make artificial intelligence faster and more efficient πβ119Updated 2 years ago
- Python bindings for ggmlβ146Updated last year
- The Triton backend for the ONNX Runtime.β163Updated this week
- πΉοΈ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.β139Updated last year
- The backend behind the LLM-Perf Leaderboardβ11Updated last year
- vLLM: A high-throughput and memory-efficient inference and serving engine for LLMsβ92Updated last week
- A high-throughput and memory-efficient inference and serving engine for LLMsβ266Updated last year
- Lightning HPO & Training Studio Appβ18Updated 2 years ago
- Google TPU optimizations for transformers modelsβ121Updated 9 months ago
- Machine Learning Serving focused on GenAI with simplicity as the top priority.β58Updated last month
- Advanced Ultra-Low Bitrate Compression Techniques for the LLaMA Family of LLMsβ110Updated last year
- ποΈ A unified multi-backend utility for benchmarking Transformers, Timm, PEFT, Diffusers and Sentence-Transformers with full support of Oβ¦β317Updated last month
- The Triton backend for the PyTorch TorchScript models.β163Updated last week
- This code repository contains the code used for my "Optimizing Memory Usage for Training LLMs and Vision Transformers in PyTorch" blog poβ¦β91Updated 2 years ago
- Triton CLI is an open source command line interface that enables users to create, deploy, and profile models served by the Triton Inferenβ¦β71Updated this week
- Home for OctoML PyTorch Profilerβ114Updated 2 years ago
- Various transformers for FSDP researchβ38Updated 2 years ago
- ML/DL Math and Method notesβ64Updated last year
- experiments with inference on llamaβ103Updated last year
- Article about deploying machine learning models using grpc, pytorch and asyncioβ29Updated 2 years ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.β33Updated last month
- β32Updated 2 years ago
- torch::deploy (multipy for non-torch uses) is a system that lets you get around the GIL problem by running multiple Python interpreters iβ¦β181Updated 2 months ago
- Inference Vision Transformer (ViT) in plain C/C++ with ggmlβ296Updated last year
- A β‘οΈ Lightning.ai β‘οΈ app demo for Voice based web search using OpenAI's Whisper and DuckDuckGoβ27Updated 3 years ago
- Outlining techniques for improving the training performance of your PyTorch model without compromising its accuracyβ129Updated 2 years ago
- Benchmark suite for LLMs from Fireworks.aiβ83Updated last week
- ClearML - Model-Serving Orchestration and Repository Solutionβ156Updated last month
- Presents comprehensive benchmarks of XLA-compatible pre-trained models in Keras.β37Updated 2 years ago
- SGLang is fast serving framework for large language models and vision language models.β30Updated this week