titanml / takeoff-communityLinks
TitanML Takeoff Server is an optimization, compression and deployment platform that makes state of the art machine learning models accessible to everyone.
β114Updated last year
Alternatives and similar repositories for takeoff-community
Users that are interested in takeoff-community are comparing it to the libraries listed below
Sorting:
- π€ Trade any tensors over the networkβ30Updated 2 years ago
- π Datasets and models for instruction-tuningβ237Updated 2 years ago
- β197Updated last year
- Large Language Model (LLM) Inference API and Chatbotβ126Updated last year
- Machine Learning Serving focused on GenAI with simplicity as the top priority.β58Updated last month
- πΉοΈ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.β139Updated last year
- Command Line Interface for Hugging Face Inference Endpointsβ66Updated last year
- Multi-threaded matrix multiplication and cosine similarity calculations for dense and sparse matrices. Appropriate for calculating the K β¦β83Updated 10 months ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.β33Updated last month
- experiments with inference on llamaβ103Updated last year
- Large Language Model Hosting Containerβ90Updated last month
- A Lightweight Library for AI Observabilityβ251Updated 8 months ago
- Mistral + Haystack: build RAG pipelines that rock π€β106Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β50Updated last year
- Chunk your text using gpt4o-mini more accuratelyβ44Updated last year
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing β‘β68Updated last year
- β73Updated last year
- Framework for building and maintaining self-updating prompts for LLMsβ64Updated last year
- π Unstructured Data Connectors for Haystack 2.0β17Updated 2 years ago
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Daβ117Updated 7 months ago
- Check for data drift between two OpenAI multi-turn chat jsonl files.β38Updated last year
- β48Updated 2 years ago
- β80Updated last year
- β77Updated last year
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Modelsβ111Updated 7 months ago
- LangChain chat model abstractions for dynamic failover, load balancing, chaos engineering, and more!β83Updated last year
- A Discord Bot for distilling papers, GitHub repos, Blogposts, and much more using the power of LLMs and vector search.β13Updated 2 years ago
- Using LlamaIndex with Ray for productionizing LLM applicationsβ71Updated 2 years ago
- β89Updated 2 years ago
- Vector Database with support for late interaction and token level embeddings.β55Updated 4 months ago