modal-labs / stopwatchLinks
A tool for benchmarking LLMs on Modal
☆39Updated last week
Alternatives and similar repositories for stopwatch
Users that are interested in stopwatch are comparing it to the libraries listed below
Sorting:
- Framework for building and maintaining self-updating prompts for LLMs☆64Updated last year
- Lite weight wrapper for the independent implementation of SPLADE++ models for search & retrieval pipelines. Models and Library created by…☆31Updated 10 months ago
- ☆77Updated last year
- ☆30Updated 8 months ago
- ☆48Updated 5 months ago
- Multimodal AI workloads: batch inference, model training and online serving.☆19Updated 3 weeks ago
- Tools for merging pretrained large language models.☆19Updated last year
- Cray-LM unified training and inference stack.☆22Updated 5 months ago
- Simple UI for debugging correlations of text embeddings☆287Updated last month
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆33Updated 2 months ago
- Fine-tune an LLM to perform batch inference and online serving.☆112Updated last month
- A sample pattern for running CI tests on Modal☆18Updated 3 months ago
- Lightweight Non-Parametric Embedding Fine-Tuning☆25Updated 9 months ago
- Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and te…☆42Updated last year
- Chunk your text using gpt4o-mini more accurately☆44Updated 11 months ago
- Build Agentic workflows with function calling using open LLMs☆28Updated last week
- Check for data drift between two OpenAI multi-turn chat jsonl files.☆37Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated last year
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆99Updated last year
- 🤝 Trade any tensors over the network☆30Updated last year
- ☆210Updated 2 weeks ago
- Machine Learning Serving focused on GenAI with simplicity as the top priority.☆59Updated last week
- ML/DL Math and Method notes☆61Updated last year
- ☆48Updated last year
- Drift detection module for machine learning pipelines.☆25Updated 2 years ago
- High-Performance Engine for Multi-Vector Search☆116Updated last month
- ☆38Updated last year
- XTR/WARP (SIGIR'25) is an extremely fast and accurate retrieval engine based on Stanford's ColBERTv2/PLAID and Google DeepMind's XTR.☆137Updated 2 months ago
- ☆154Updated 7 months ago
- PyTorch implementation for MRL☆19Updated last year