CambioML / gpuv
visualize your gpu usage
☆17Updated last year
Related projects ⓘ
Alternatives and complementary repositories for gpuv
- Profiling tools for distributed training☆37Updated last year
- Fine-tuning and serving LLMs on any cloud☆87Updated 11 months ago
- A simple Pure Python/PyTorch performance daemon for training workloads☆15Updated last year
- AI Evaluation Platform☆45Updated 2 weeks ago
- cluster/scheduler health monitoring for GPU jobs on k8s☆44Updated this week
- NLP with Rust for Python 🦀🐍☆59Updated 5 months ago
- ☆101Updated 3 months ago
- pykoi: Active learning in one unified interface☆410Updated 9 months ago
- ☆44Updated last week
- Chrome Extension for exploring Hugging Face datasets 🔎☆48Updated 2 months ago
- A simple DAG for executing LLM calls and using tools.☆39Updated last year
- utilities for loading and running text embeddings with onnx☆39Updated 3 months ago
- Repository for the QUIK project, enabling the use of 4bit kernels for generative inference - EMNLP 2024☆173Updated 7 months ago
- ☆39Updated 10 months ago
- Your buddy in the (L)LM space.☆63Updated 2 months ago
- Efficient BM25 with DuckDB 🦆☆29Updated last month
- Open sourced backend for Martian's LLM Inference Provider Leaderboard☆17Updated 3 months ago
- Embedding models from Jina AI☆56Updated 10 months ago
- experiments with inference on llama☆105Updated 5 months ago
- A place to store reusable transformer components of my own creation or found on the interwebs☆44Updated 2 weeks ago
- The code for the paper ROUTERBENCH: A Benchmark for Multi-LLM Routing System☆92Updated 5 months ago
- Code repository for the paper - "AdANNS: A Framework for Adaptive Semantic Search"☆60Updated last year
- Small, simple agent task environments for training and evaluation☆16Updated 3 weeks ago
- Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, imp…☆162Updated 2 months ago
- Repository for Sparse Finetuning of LLMs via modified version of the MosaicML llmfoundry☆38Updated 10 months ago
- Google TPU optimizations for transformers models☆75Updated this week
- SGLang is fast serving framework for large language models and vision language models.☆11Updated 2 weeks ago
- ☆43Updated 2 months ago
- Drift detection module for machine learning pipelines.☆21Updated last year
- Cedana: Access and run on compute anywhere in the world, on any provider. Migrate seamlessly between providers, arbitraging price/perform…☆56Updated 7 months ago