autonomi-ai / nosLinks
⚡️ A fast and flexible PyTorch inference server that runs locally, on any cloud or AI HW. 
☆145Updated last year
Alternatives and similar repositories for nos
Users that are interested in nos are comparing it to the libraries listed below
Sorting:
- Vector Database with support for late interaction and token level embeddings.☆55Updated 4 months ago
- ☆197Updated last year
- TitanML Takeoff Server is an optimization, compression and deployment platform that makes state of the art machine learning models access…☆114Updated last year
- 🕹️ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.☆140Updated last year
- run paligemma in real time☆133Updated last year
- LLaVA server (llama.cpp).☆183Updated 2 years ago
- Efficient vector database for hundred millions of embeddings.☆208Updated last year
- Aana SDK is a powerful framework for building AI enabled multimodal applications.☆53Updated 2 months ago
- Chat Markup Language conversation library☆55Updated last year
- GRDN.AI app for garden optimization☆70Updated last year
- A curated list of amazingly awesome Modal applications, demos, and shiny things. Inspired by awesome-php.☆162Updated 2 months ago
- ☆38Updated last year
- Python client library for improving your LLM app accuracy☆97Updated 8 months ago
- Full finetuning of large language models without large memory requirements☆93Updated last month
- Foyle is a copilot to help developers deploy and operate their applications.☆133Updated 7 months ago
- Start a server from the MLX library.☆192Updated last year
- ☆89Updated last year
- Maybe the new state of the art vision model? we'll see 🤷♂️☆165Updated last year
- ScalarLM - a unified training and inference stack☆90Updated 3 weeks ago
- Routing on Random Forest (RoRF)☆215Updated last year
- ☆112Updated last year
- Cerule - A Tiny Mighty Vision Model☆67Updated last year
- Fine-tuning and serving LLMs on any cloud☆89Updated last year
- A stable, fast and easy-to-use inference library with a focus on a sync-to-async API☆45Updated last year
- ☆67Updated last year
- ☆123Updated last year
- Synthetic Data for LLM Fine-Tuning☆120Updated last year
- Machine Learning Serving focused on GenAI with simplicity as the top priority.☆58Updated 3 weeks ago
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI☆221Updated last year
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…☆151Updated 3 months ago