titanml / takeoff-community
TitanML Takeoff Server is an optimization, compression and deployment platform that makes state of the art machine learning models accessible to everyone.
β114Updated last year
Alternatives and similar repositories for takeoff-community:
Users that are interested in takeoff-community are comparing it to the libraries listed below
- Mistral + Haystack: build RAG pipelines that rock π€β103Updated last year
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Daβ102Updated 2 weeks ago
- β85Updated last year
- β199Updated last year
- Using LlamaIndex with Ray for productionizing LLM applicationsβ71Updated last year
- Web App for generating synthetic dataβ46Updated 7 months ago
- π Datasets and models for instruction-tuningβ238Updated last year
- Fine-tune an LLM to perform batch inference and online serving.β109Updated this week
- Machine Learning Serving focused on GenAI with simplicity as the top priority.β58Updated last week
- β161Updated last year
- Resources for exploring Generative Feedback Loops with Weaviate!β37Updated 3 months ago
- πΉοΈ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.β136Updated 8 months ago
- Large Language Model (LLM) Inference API and Chatbotβ125Updated last year
- A python package that provides a custom streamlit connection to query data from weaviate, the AI native vector databaseβ54Updated 8 months ago
- Hassle-free ML Pipelines on Kubernetesβ38Updated last year
- β77Updated 10 months ago
- Domain Adapted Language Modeling Toolkit - E2E RAGβ319Updated 5 months ago
- Check for data drift between two OpenAI multi-turn chat jsonl files.β37Updated last year
- Adding NeMo Guardrails to a LlamaIndex RAG pipelineβ37Updated last year
- β16Updated 10 months ago
- β31Updated 4 months ago
- Additional packages (components, document stores and the likes) to extend the capabilities of Haystackβ144Updated this week
- β47Updated last year
- experiments with inference on llamaβ104Updated 10 months ago
- β88Updated last year
- RAGArch is a Streamlit-based application that empowers users to experiment with various components and parameters of Retrieval-Augmented β¦β84Updated last year
- Data extraction with LLM on CPUβ68Updated last year
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Modelsβ105Updated last week
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β49Updated 9 months ago
- β40Updated last year