prabhuomkar / bitbeastLinks
Experiments with Model Training, Deployment & Monitoring
โ40Updated 5 months ago
Alternatives and similar repositories for bitbeast
Users that are interested in bitbeast are comparing it to the libraries listed below
Sorting:
- Machine Learning Serving focused on GenAI with simplicity as the top priority.โ59Updated 3 weeks ago
- Curated list of awesome material on optimization techniques to make artificial intelligence faster and more efficient ๐โ119Updated 2 years ago
- A โก๏ธ Lightning.ai โก๏ธ app demo for Voice based web search using OpenAI's Whisper and DuckDuckGoโ27Updated 3 years ago
- Various transformers for FSDP researchโ38Updated 3 years ago
- GGML implementation of BERT model with Python bindings and quantization.โ58Updated last year
- The backend behind the LLM-Perf Leaderboardโ11Updated last year
- The Triton backend for the PyTorch TorchScript models.โ172Updated 2 weeks ago
- ๐คArtificial intelligence classify a food ๐ nutritional table by a simple photo. Don't eat ๐๐๐ฎ...โ10Updated 5 years ago
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 linesโ196Updated last year
- The Triton backend for the ONNX Runtime.โ172Updated 2 weeks ago
- Python bindings for ggmlโ147Updated last year
- Lightning HPO & Training Studio Appโ19Updated 2 years ago
- ๐น๏ธ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.โ138Updated last year
- Google TPU optimizations for transformers modelsโ135Updated last week
- Outlining techniques for improving the training performance of your PyTorch model without compromising its accuracyโ129Updated 2 years ago
- Article about deploying machine learning models using grpc, pytorch and asyncioโ30Updated 3 years ago
- Port of Microsoft's BioGPT in C/C++ using ggmlโ85Updated last year
- experiments with inference on llamaโ103Updated last year
- RWKV (Receptance Weighted Key Value) is a RNN with Transformer-level performanceโ41Updated 2 years ago
- Inference Vision Transformer (ViT) in plain C/C++ with ggmlโ306Updated last year
- Advanced Ultra-Low Bitrate Compression Techniques for the LLaMA Family of LLMsโ110Updated 2 years ago
- Binding to transformers in ggmlโ64Updated 2 weeks ago
- ML/DL Math and Method notesโ66Updated 2 years ago
- The Triton backend for TensorRT.โ84Updated last week
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.โ32Updated 4 months ago
- ๐ค Trade any tensors over the networkโ30Updated 2 years ago
- A miniture AI training framework for PyTorchโ42Updated last year
- Visualize ONNX models with model-explorerโ67Updated 3 weeks ago
- A high-throughput and memory-efficient inference and serving engine for LLMsโ267Updated last month
- Booster - open accelerator for LLM models. Better inference and debugging for AI hackersโ167Updated last year