dstackai / gpuhuntLinks
GPU prices aggregator for cloud providers
☆45Updated last month
Alternatives and similar repositories for gpuhunt
Users that are interested in gpuhunt are comparing it to the libraries listed below
Sorting:
- Vector Database with support for late interaction and token level embeddings.☆54Updated 7 months ago
- A Lightweight Library for AI Observability☆255Updated 11 months ago
- ⚡️ A fast and flexible PyTorch inference server that runs locally, on any cloud or AI HW.☆147Updated last year
- Tutorial for building LLM router☆244Updated last year
- ☆198Updated last year
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆115Updated 9 months ago
- Embedding models from Jina AI☆65Updated 2 years ago
- ☆114Updated last year
- Python client library for improving your LLM app accuracy☆97Updated 11 months ago
- Aana SDK is a powerful framework for building AI enabled multimodal applications.☆55Updated 5 months ago
- Self-host LLMs with vLLM and BentoML☆168Updated 2 weeks ago
- Routing on Random Forest (RoRF)☆239Updated last year
- Modular, open source LLMOps stack that separates concerns: LiteLLM unifies LLM APIs, manages routing and cost controls, and ensures high-…☆133Updated 11 months ago
- QLLM: A powerful CLI for seamless interaction with multiple Large Language Models. Simplify AI workflows, streamline development, and unl…☆34Updated 9 months ago
- A simple DAG for executing LLM calls and using tools.☆42Updated 2 years ago
- Repo to experiment with Graph RAG strategies using Kùzu☆64Updated 4 months ago
- Using modal.com to process FineWeb-edu data☆20Updated 10 months ago
- A stable, fast and easy-to-use inference library with a focus on a sync-to-async API☆47Updated last year
- ScalarLM - a unified training and inference stack☆96Updated 2 months ago
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆84Updated last year
- This open-source repository offers reference code for integrating workplace datastores with Cohere's LLMs, enabling developers and busine…☆154Updated last year
- Gitlab.com Mirror - Please open issues and pull requests over there☆54Updated 3 months ago
- Transformer GPU VRAM estimator☆68Updated last year
- Replace expensive LLM calls with finetunes automatically☆66Updated last year
- simplifies the process of creating and managing LLM workflows.☆113Updated last year
- Foyle is a copilot to help developers deploy and operate their applications.☆133Updated 10 months ago
- Python SDK for experimenting, testing, evaluating & monitoring LLM-powered applications - Parea AI (YC S23)☆82Updated 11 months ago
- Embed anything.☆27Updated last year
- LLM prompt language based on Jinja. Banks provides tools and functions to build prompts text and chat messages from generic blueprints. I…☆121Updated last week
- Tools for formatting large language model prompts.☆13Updated 2 years ago