dstackai / gpuhuntLinks
GPU prices aggregator for cloud providers
☆45Updated last month
Alternatives and similar repositories for gpuhunt
Users that are interested in gpuhunt are comparing it to the libraries listed below
Sorting:
- Vector Database with support for late interaction and token level embeddings.☆54Updated 7 months ago
- ⚡️ A fast and flexible PyTorch inference server that runs locally, on any cloud or AI HW.☆147Updated last year
- ☆198Updated last year
- TitanML Takeoff Server is an optimization, compression and deployment platform that makes state of the art machine learning models access…☆114Updated 2 years ago
- ☆114Updated last year
- Python client library for improving your LLM app accuracy☆97Updated 11 months ago
- Repo to experiment with Graph RAG strategies using Kùzu☆64Updated 4 months ago
- Chat Markup Language conversation library☆55Updated 2 years ago
- A collection of tools for your LLMs that run on Modal☆23Updated 11 months ago
- Additional packages (components, document stores and the likes) to extend the capabilities of Haystack☆178Updated last week
- A simple DAG for executing LLM calls and using tools.☆42Updated 2 years ago
- GoalChain for goal-orientated LLM conversation flows☆71Updated last year
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆69Updated 2 months ago
- LLM plugin for clustering embeddings☆82Updated last year
- Replace expensive LLM calls with finetunes automatically☆66Updated last year
- A high performance batching router optimises max throughput for text inference workload☆16Updated 2 years ago
- A Lightweight Library for AI Observability☆255Updated 11 months ago
- This open-source repository offers reference code for integrating workplace datastores with Cohere's LLMs, enabling developers and busine…☆154Updated last year
- Tutorial for building LLM router☆242Updated last year
- Retrieval of fully structured data made easy. Use LLMs or custom models. Specialized on PDFs and HTML files. Extensive support of tabular…☆83Updated 3 weeks ago
- Using modal.com to process FineWeb-edu data☆20Updated 9 months ago
- Transformer GPU VRAM estimator☆67Updated last year
- ☆40Updated 8 months ago
- A guidance compatibility layer for llama-cpp-python☆36Updated 2 years ago
- A Chrome extension that saves conversations with Claude to GitHubGists or your clipboard.☆90Updated last year
- Routing on Random Forest (RoRF)☆239Updated last year
- A curated list of amazingly awesome Modal applications, demos, and shiny things. Inspired by awesome-php.☆173Updated last month
- Embedding models from Jina AI☆65Updated 2 years ago
- simplifies the process of creating and managing LLM workflows.☆113Updated last year
- Tools for formatting large language model prompts.☆13Updated 2 years ago