autonomi-ai / nosLinks
⚡️ A fast and flexible PyTorch inference server that runs locally, on any cloud or AI HW.
☆146Updated last year
Alternatives and similar repositories for nos
Users that are interested in nos are comparing it to the libraries listed below
Sorting:
- ☆198Updated last year
- Vector Database with support for late interaction and token level embeddings.☆54Updated 7 months ago
- TitanML Takeoff Server is an optimization, compression and deployment platform that makes state of the art machine learning models access…☆114Updated 2 years ago
- 🕹️ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.☆138Updated last year
- Python client library for improving your LLM app accuracy☆97Updated 11 months ago
- Aana SDK is a powerful framework for building AI enabled multimodal applications.☆55Updated 5 months ago
- GPU prices aggregator for cloud providers☆45Updated 3 weeks ago
- run paligemma in real time☆133Updated last year
- LLaVA server (llama.cpp).☆183Updated 2 years ago
- Replace expensive LLM calls with finetunes automatically☆66Updated last year
- Pipeline is an open source python SDK for building AI/ML workflows☆138Updated last year
- Foyle is a copilot to help developers deploy and operate their applications.☆133Updated 10 months ago
- A curated list of amazingly awesome Modal applications, demos, and shiny things. Inspired by awesome-php.☆168Updated 3 weeks ago
- Chat Markup Language conversation library☆55Updated 2 years ago
- Start a server from the MLX library.☆195Updated last year
- ScalarLM - a unified training and inference stack☆95Updated 2 months ago
- Machine Learning Serving focused on GenAI with simplicity as the top priority.☆59Updated 2 weeks ago
- Maybe the new state of the art vision model? we'll see 🤷♂️☆170Updated 2 years ago
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI☆222Updated last year
- ☆38Updated last year
- GRDN.AI app for garden optimization☆69Updated 2 months ago
- Efficient vector database for hundred millions of embeddings.☆211Updated last year
- 🐮📢 The first AI voice assistant that interrupts *you*☆148Updated last year
- Fine-tuning and serving LLMs on any cloud☆90Updated 2 years ago
- Convenient wrapper for fine-tuning and inference of Large Language Models (LLMs) with several quantization techniques (GTPQ, bitsandbytes…☆146Updated 2 years ago
- Full finetuning of large language models without large memory requirements☆94Updated 4 months ago
- ☆67Updated 9 months ago
- ☆206Updated last year
- ☆40Updated last year
- Synthetic Data for LLM Fine-Tuning☆120Updated 2 years ago