autonomi-ai / nosView external linksLinks
⚡️ A fast and flexible PyTorch inference server that runs locally, on any cloud or AI HW.
☆147Jun 8, 2024Updated last year
Alternatives and similar repositories for nos
Users that are interested in nos are comparing it to the libraries listed below
Sorting:
- A Dockerfile builder for Machine Learning developers☆20May 3, 2024Updated last year
- Simple orchestration for EC2 spot containers☆19Sep 27, 2024Updated last year
- Simple LLM inference server☆20Jun 13, 2024Updated last year
- Deep Learning Inference in 35 Lines of Python☆22Mar 27, 2015Updated 10 years ago
- AI_Powered_Dev_Search_Engine☆12Mar 10, 2024Updated last year
- Real-time semantic segmentation inference production ready code based on deeplab-resnet/psp-net and tensorflow☆11May 18, 2018Updated 7 years ago
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- ojjson is a library designed to facilitate JSON interactions with Ollama, a large language api (LLM). It leverages the power of Zod for s…☆12Nov 7, 2024Updated last year
- Building/Packaging SLAM Libraries with conda☆13Apr 12, 2018Updated 7 years ago
- Example of a Streamlit data app powered by Vaex☆11Jul 7, 2022Updated 3 years ago
- ☆14Aug 25, 2024Updated last year
- ☆17May 22, 2025Updated 8 months ago
- A sleek, customizable interface for managing LLMs with responsive design and easy agent personalization.☆17Aug 30, 2024Updated last year
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆96Feb 9, 2023Updated 3 years ago
- Hassle-free ML Pipelines on Kubernetes☆39May 28, 2023Updated 2 years ago
- Easily create LLM automation/agent workflows☆60Feb 13, 2024Updated 2 years ago
- [NeurIPS 2024] The official implementation of "Image Copy Detection for Diffusion Models"☆18Oct 1, 2024Updated last year
- A web-app to explore topics using LLM (less typing and more clicks)☆68Feb 6, 2026Updated last week
- Adapting Self-Supervised Representations as a Latent Space for Efficient Generation☆39Oct 17, 2025Updated 3 months ago
- ☆17Dec 18, 2023Updated 2 years ago
- Automated LLM novelist☆46Apr 11, 2024Updated last year
- Building synthetic data for preference tuning☆27Dec 26, 2024Updated last year
- A guide to testing different runpod (and other linux VMs) configurations. Specifically the speed of LLM outputs☆17Jan 12, 2024Updated 2 years ago
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆3,718May 21, 2025Updated 8 months ago
- This is a self hosting repository for creating AI Agents and AI Agent powered workflows using n8n, qdrant, ollama, postgres and redis☆28Dec 31, 2025Updated last month
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆26Updated this week
- JacQues is a Dash-based interactive web application that facilitates real-time chat and document management.☆22Jan 5, 2026Updated last month
- ☆18Feb 5, 2026Updated last week
- Official implementation of Half-Quadratic Quantization (HQQ)☆913Dec 18, 2025Updated last month
- Official implementation of the ANLS* metric☆22Updated this week
- Official n8n custom node for VLM Run☆29Jan 22, 2026Updated 3 weeks ago
- A frontend for creative writing with LLMs☆147Jul 15, 2024Updated last year
- A stable, fast and easy-to-use inference library with a focus on a sync-to-async API☆47Sep 26, 2024Updated last year
- The simplest way to serve AI/ML models in production☆1,113Updated this week
- Controllable Animation Video Generation with Large Models-based Multimodal Agents☆232Jan 7, 2026Updated last month
- Repository containing awesome resources regarding Hugging Face tooling.☆48Jan 8, 2024Updated 2 years ago
- ☆60Jan 21, 2024Updated 2 years ago
- create workflows with LLMs☆55Aug 2, 2024Updated last year
- Privacy-Preserving Bandits (MLSys'20)☆22Dec 8, 2022Updated 3 years ago