prabhuomkar / bitbeast

Experiments with Model Training, Deployment & Monitoring

☆39

Alternatives and similar repositories for bitbeast:

Users that are interested in bitbeast are comparing it to the libraries listed below

lessw2020 / transformer_central
Various transformers for FSDP research
☆37Updated 2 years ago
aniketmaurya / fastserve-ai
Machine Learning Serving focused on GenAI with simplicity as the top priority.
☆58Updated 2 weeks ago
IlyasMoutawwakil / py-txi
A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.
☆34Updated 4 months ago
FrancescoSaverioZuppichini / dynamic-batching-asyncio
☆32Updated 2 years ago
IlyasMoutawwakil / llm-perf-backend
The backend behind the LLM-Perf Leaderboard
☆10Updated 11 months ago
chainyo / tensorshare
🤝 Trade any tensors over the network
☆30Updated last year
Nachimak28 / LAI-voice-search-openai-whisper-demo
A ⚡️ Lightning.ai ⚡️ app demo for Voice based web search using OpenAI's Whisper and DuckDuckGo
☆26Updated 2 years ago
hamelsmu / llama-inference
experiments with inference on llama
☆104Updated 10 months ago
NielsRogge / awesome-huggingface
Repository containing awesome resources regarding Hugging Face tooling.
☆46Updated last year
premAI-io / benchmarks
🕹️ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.
☆136Updated 9 months ago
UmerHA / triton_util
Make triton easier
☆47Updated 10 months ago
triton-inference-server / onnxruntime_backend
The Triton backend for the ONNX Runtime.
☆142Updated this week
gau-nernst / learn-cuda
Learn CUDA with PyTorch
☆20Updated 2 months ago
sdpython / onnxcustom
Tutorial on how to convert machine learned models into ONNX
☆16Updated 2 years ago
staghado / vit.cpp
Inference Vision Transformer (ViT) in plain C/C++ with ggml
☆266Updated last year
huggingface / optimum-tpu
Google TPU optimizations for transformers models
☆108Updated 3 months ago
aporia-ai / inferencedb
🚀 Stream inferences of real-time ML models in production to any data lake (Experimental)
☆80Updated 2 years ago
onnx / neural-compressor
Model compression for ONNX
☆91Updated 5 months ago
PrithivirajDamodaran / SPLADERunner
Lite weight wrapper for the independent implementation of SPLADE++ models for search & retrieval pipelines. Models and Library created by…
☆30Updated 8 months ago
Preemo-Inc / text-generation-inference
☆199Updated last year
mlc-ai / llm-perf-bench
☆118Updated last year
stas00 / ml-ways
ML/DL Math and Method notes
☆60Updated last year
Pleias / Various-Finetuning
Set of scripts to finetune LLMs
☆37Updated last year
Lightning-AI / utilities
Common Python utilities and GitHub Actions in Lightning Ecosystem
☆56Updated this week
lucidrains / pytorch-custom-utils
Just some miscellaneous utility functions / decorators / modules related to Pytorch and Accelerate to help speed up implementation of new…
☆120Updated 9 months ago
hamelsmu / ft-drift
Check for data drift between two OpenAI multi-turn chat jsonl files.
☆37Updated last year
iamlemec / bert.cpp
GGML implementation of BERT model with Python bindings and quantization.
☆56Updated last year
AnswerDotAI / minai
A miniture AI training framework for PyTorch
☆40Updated 2 months ago
huggingface / kernel-builder
👷 Build compute kernels
☆35Updated this week
huggingface / kernels
Load compute kernels from the Hub
☆115Updated this week