titanml / takeoff-communityLinks

TitanML Takeoff Server is an optimization, compression and deployment platform that makes state of the art machine learning models accessible to everyone.

☆114

Alternatives and similar repositories for takeoff-community

Users that are interested in takeoff-community are comparing it to the libraries listed below

Sorting:

neuml / txtinstruct
📚 Datasets and models for instruction-tuning
☆238Updated 2 years ago
anakin87 / mistral-haystack
Mistral + Haystack: build RAG pipelines that rock 🤘
☆106Updated last year
Preemo-Inc / text-generation-inference
☆198Updated last year
aniketmaurya / fastserve-ai
Machine Learning Serving focused on GenAI with simplicity as the top priority.
☆59Updated last month
aniketmaurya / llm-inference
Large Language Model (LLM) Inference API and Chatbot
☆126Updated last year
rragundez / chunkdot
Multi-threaded matrix multiplication and cosine similarity calculations for dense and sparse matrices. Appropriate for calculating the K …
☆83Updated 11 months ago
PrithivirajDamodaran / Route0x
Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Da
☆117Updated 8 months ago
premAI-io / benchmarks
🕹️ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.
☆139Updated last year
louisbrulenaudet / ragoon
High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡
☆68Updated 2 weeks ago
grski / bRAG
☆48Updated 2 years ago
chainyo / tensorshare
🤝 Trade any tensors over the network
☆30Updated 2 years ago
MantisAI / hugie
Command Line Interface for Hugging Face Inference Endpoints
☆66Updated last year
awslabs / llm-hosting-container
Large Language Model Hosting Container
☆90Updated last month
ParadigmAI / paradigm
Hassle-free ML Pipelines on Kubernetes
☆39Updated 2 years ago
anyscale / ray-summit-2023-training
☆89Updated 2 years ago
davanstrien / data-for-fine-tuning-llms
☆80Updated last year
cfahlgren1 / observers
A Lightweight Library for AI Observability
☆251Updated 9 months ago
PrithivirajDamodaran / SPLADERunner
Lite weight wrapper for the independent implementation of SPLADE++ models for search & retrieval pipelines. Models and Library created by…
☆33Updated last year
muellerzr / minimal-trainer-zoo
Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines
☆196Updated last year
hamelsmu / llama-inference
experiments with inference on llama
☆103Updated last year
dm4ml / motion
Framework for building and maintaining self-updating prompts for LLMs
☆64Updated last year
cohere-ai / DiskVectorIndex
☆210Updated 5 months ago
flowaicom / flow-judge
Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…
☆78Updated last year
mrmps / ai-chunker
Chunk your text using gpt4o-mini more accurately
☆44Updated last year
BerriAI / bettertest
☆74Updated last year
deshwalmahesh / PHUDGE
Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…
☆51Updated last year
andrewnguonly / ChatAbstractions
LangChain chat model abstractions for dynamic failover, load balancing, chaos engineering, and more!
☆83Updated last year
amogkam / llama_index_ray
Using LlamaIndex with Ray for productionizing LLM applications
☆71Updated 2 years ago
anyscale / e2e-llm-workflows
Fine-tune an LLM to perform batch inference and online serving.
☆114Updated 6 months ago
parea-ai / parea-sdk-py
Python SDK for experimenting, testing, evaluating & monitoring LLM-powered applications - Parea AI (YC S23)
☆81Updated 9 months ago