autonomi-ai / nosLinks
⚡️ A fast and flexible PyTorch inference server that runs locally, on any cloud or AI HW.
☆145Updated last year
Alternatives and similar repositories for nos
Users that are interested in nos are comparing it to the libraries listed below
Sorting:
- ☆197Updated last year
- 🕹️ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.☆138Updated last year
- Aana SDK is a powerful framework for building AI enabled multimodal applications.☆52Updated last month
- Vector Database with support for late interaction and token level embeddings.☆55Updated 3 months ago
- TitanML Takeoff Server is an optimization, compression and deployment platform that makes state of the art machine learning models access…☆114Updated last year
- GRDN.AI app for garden optimization☆70Updated last year
- run paligemma in real time☆133Updated last year
- Python client library for improving your LLM app accuracy☆98Updated 7 months ago
- Foyle is a copilot to help developers deploy and operate their applications.☆132Updated 6 months ago
- Maybe the new state of the art vision model? we'll see 🤷♂️☆165Updated last year
- Replace expensive LLM calls with finetunes automatically☆63Updated last year
- ☆111Updated last year
- Efficient vector database for hundred millions of embeddings.☆208Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆52Updated last year
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI☆221Updated last year
- A curated list of amazingly awesome Modal applications, demos, and shiny things. Inspired by awesome-php.☆159Updated last month
- LLaVA server (llama.cpp).☆182Updated last year
- Chat Markup Language conversation library☆55Updated last year
- Full finetuning of large language models without large memory requirements☆93Updated 2 weeks ago
- ☆67Updated last year
- Convenient wrapper for fine-tuning and inference of Large Language Models (LLMs) with several quantization techniques (GTPQ, bitsandbytes…☆146Updated last year
- Pipeline is an open source python SDK for building AI/ML workflows☆138Updated last year
- Self-host LLMs with vLLM and BentoML☆150Updated last week
- ☆123Updated last year
- The implementation of "Leeroo Orchestrator: Elevating LLMs Performance Through Model Integration"☆55Updated last year
- Fine-tuning and serving LLMs on any cloud☆90Updated last year
- Let's create synthetic textbooks together :)☆75Updated last year
- an implementation of Self-Extend, to expand the context window via grouped attention☆118Updated last year
- A framework for orchestrating AI agents using a mermaid graph☆77Updated last year
- Synthetic Data for LLM Fine-Tuning☆120Updated last year