titanml / takeoff-community
TitanML Takeoff Server is an optimization, compression and deployment platform that makes state of the art machine learning models accessible to everyone.
☆114Updated last year
Alternatives and similar repositories for takeoff-community:
Users that are interested in takeoff-community are comparing it to the libraries listed below
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Da☆93Updated last month
- Lite weight wrapper for the independent implementation of SPLADE++ models for search & retrieval pipelines. Models and Library created by…☆29Updated 5 months ago
- ☆199Updated last year
- Mistral + Haystack: build RAG pipelines that rock 🤘☆100Updated last year
- 📚 Datasets and models for instruction-tuning☆233Updated last year
- ☆47Updated last year
- 🤝 Trade any tensors over the network☆30Updated last year
- Web App for generating synthetic data☆46Updated 5 months ago
- Command Line Interface for Hugging Face Inference Endpoints☆67Updated 10 months ago
- Hassle-free ML Pipelines on Kubernetes☆37Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆48Updated 7 months ago
- Using LlamaIndex with Ray for productionizing LLM applications☆71Updated last year
- Multi-threaded matrix multiplication and cosine similarity calculations for dense and sparse matrices. Appropriate for calculating the K …☆78Updated last month
- Large Language Model (LLM) Inference API and Chatbot☆124Updated 10 months ago
- ☆12Updated 9 months ago
- Machine Learning Serving focused on GenAI with simplicity as the top priority.☆58Updated last month
- End-to-End LLM Guide☆101Updated 7 months ago
- Writing Blog Posts with Generative Feedback Loops!☆47Updated 10 months ago
- A Discord Bot for distilling papers, GitHub repos, Blogposts, and much more using the power of LLMs and vector search.☆13Updated last year
- LangChain chat model abstractions for dynamic failover, load balancing, chaos engineering, and more!☆79Updated last year
- Data extraction with LLM on CPU☆68Updated last year
- ☆15Updated 8 months ago
- Iterate fast on your RAG pipelines☆22Updated 2 months ago
- ☆76Updated 8 months ago
- Data extraction with LLM on CPU☆85Updated last year
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆100Updated 2 months ago
- ☆52Updated last month
- A project that enables identification and classification of an intent of a message with dynamic labels☆33Updated last month
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆64Updated 3 months ago
- Set of scripts to finetune LLMs☆36Updated 10 months ago