titanml / takeoff-community
TitanML Takeoff Server is an optimization, compression and deployment platform that makes state of the art machine learning models accessible to everyone.
β114Updated 11 months ago
Alternatives and similar repositories for takeoff-community:
Users that are interested in takeoff-community are comparing it to the libraries listed below
- β199Updated 11 months ago
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Daβ90Updated 3 weeks ago
- Mistral + Haystack: build RAG pipelines that rock π€β100Updated 11 months ago
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Modelsβ97Updated last month
- Machine Learning Serving focused on GenAI with simplicity as the top priority.β58Updated last week
- Using LlamaIndex with Ray for productionizing LLM applicationsβ71Updated last year
- experiments with inference on llamaβ104Updated 7 months ago
- End-to-End LLM Guideβ99Updated 6 months ago
- Self-host LLMs with vLLM and BentoMLβ79Updated this week
- β154Updated last year
- β77Updated 7 months ago
- Multi-threaded matrix multiplication and cosine similarity calculations for dense and sparse matrices. Appropriate for calculating the K β¦β78Updated 3 weeks ago
- πΉοΈ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.β138Updated 5 months ago
- Tutorial for building LLM routerβ170Updated 5 months ago
- Fiddler Auditor is a tool to evaluate language models.β174Updated 10 months ago
- LangChain chat model abstractions for dynamic failover, load balancing, chaos engineering, and more!β79Updated 11 months ago
- Large Language Model (LLM) Inference API and Chatbotβ124Updated 9 months ago
- β83Updated last year
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.β82Updated last year
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.β34Updated last month
- Find the optimal model serving solution for π€ Hugging Face models πβ42Updated last year
- Check for data drift between two OpenAI multi-turn chat jsonl files.β37Updated 9 months ago
- β12Updated 8 months ago
- β76Updated 7 months ago
- π Datasets and models for instruction-tuningβ232Updated last year
- Additional packages (components, document stores and the likes) to extend the capabilities of Haystack version 2.0 and onwardsβ130Updated this week
- β47Updated last year
- Convenient wrapper for fine-tuning and inference of Large Language Models (LLMs) with several quantization techniques (GTPQ, bitsandbytesβ¦β146Updated last year
- β40Updated last year
- Python SDK for experimenting, testing, evaluating & monitoring LLM-powered applications - Parea AI (YC S23)β73Updated 4 months ago