prabhuomkar / bitbeast
Experiments with Model Training, Deployment & Monitoring
☆39Updated last month
Alternatives and similar repositories for bitbeast:
Users that are interested in bitbeast are comparing it to the libraries listed below
- Various transformers for FSDP research☆37Updated 2 years ago
- Machine Learning Serving focused on GenAI with simplicity as the top priority.☆58Updated 2 weeks ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆34Updated 4 months ago
- ☆32Updated 2 years ago
- The backend behind the LLM-Perf Leaderboard☆10Updated 11 months ago
- 🤝 Trade any tensors over the network☆30Updated last year
- A ⚡️ Lightning.ai ⚡️ app demo for Voice based web search using OpenAI's Whisper and DuckDuckGo☆26Updated 2 years ago
- experiments with inference on llama☆104Updated 10 months ago
- Repository containing awesome resources regarding Hugging Face tooling.☆46Updated last year
- 🕹️ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.☆136Updated 9 months ago
- Make triton easier☆47Updated 10 months ago
- The Triton backend for the ONNX Runtime.☆142Updated this week
- Learn CUDA with PyTorch☆20Updated 2 months ago
- Tutorial on how to convert machine learned models into ONNX☆16Updated 2 years ago
- Inference Vision Transformer (ViT) in plain C/C++ with ggml☆266Updated last year
- Google TPU optimizations for transformers models☆108Updated 3 months ago
- 🚀 Stream inferences of real-time ML models in production to any data lake (Experimental)☆80Updated 2 years ago
- Model compression for ONNX☆91Updated 5 months ago
- Lite weight wrapper for the independent implementation of SPLADE++ models for search & retrieval pipelines. Models and Library created by…☆30Updated 8 months ago
- ☆199Updated last year
- ☆118Updated last year
- ML/DL Math and Method notes☆60Updated last year
- Set of scripts to finetune LLMs☆37Updated last year
- Common Python utilities and GitHub Actions in Lightning Ecosystem☆56Updated this week
- Just some miscellaneous utility functions / decorators / modules related to Pytorch and Accelerate to help speed up implementation of new…☆120Updated 9 months ago
- Check for data drift between two OpenAI multi-turn chat jsonl files.☆37Updated last year
- GGML implementation of BERT model with Python bindings and quantization.☆56Updated last year
- A miniture AI training framework for PyTorch☆40Updated 2 months ago
- 👷 Build compute kernels☆35Updated this week
- Load compute kernels from the Hub☆115Updated this week