autonomi-ai / nos
⚡️ A fast and flexible PyTorch inference server that runs locally, on any cloud or AI HW.
☆125Updated 3 months ago
Related projects: ⓘ
- run paligemma in real time☆122Updated 4 months ago
- ☆201Updated 7 months ago
- Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs☆128Updated this week
- Vector Database with support for late interaction and token level embeddings.☆51Updated last week
- Python client library for improving your LLM app accuracy☆94Updated this week
- A toolkit for building AI agents that use devices☆93Updated this week
- Tutorial for building LLM router☆145Updated 2 months ago
- ☆77Updated this week
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI☆223Updated 4 months ago
- A simple Python sandbox for helpful LLM data agents☆143Updated 3 months ago
- AI For Software Operations☆81Updated this week
- Action library for AI Agent☆187Updated last week
- Start a server from the MLX library.☆157Updated last month
- A curated list of amazingly awesome Modal applications, demos, and shiny things. Inspired by awesome-php.☆63Updated last week
- an implementation of Self-Extend, to expand the context window via grouped attention☆117Updated 8 months ago
- A library for building software agents using behavior trees and language models.☆69Updated 4 months ago
- Maybe the new state of the art vision model? we'll see 🤷♂️☆154Updated 8 months ago
- Replace expensive LLM calls with finetunes automatically☆60Updated 7 months ago
- AutoEvals is a tool for quickly and easily evaluating AI model outputs using best practices.☆150Updated this week
- Python SDK for experimenting, testing, evaluating & monitoring LLM-powered applications - Parea AI (YC S23)☆72Updated last week
- ☆58Updated 3 weeks ago
- Chat Markup Language conversation library☆53Updated 8 months ago
- Just a bunch of benchmark logs for different LLMs☆112Updated last month
- Full finetuning of large language models without large memory requirements☆94Updated 8 months ago
- ☆64Updated 3 months ago
- Examples of models deployable with Truss☆120Updated this week
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆192Updated 4 months ago
- WIP - Allows you to create DSPy pipelines using ComfyUI☆170Updated last month
- Leveraging DSPy for AI-driven task understanding and solution generation, the Self-Discover Framework automates problem-solving through r…☆53Updated 2 months ago
- ☆40Updated 7 months ago