⚡️ A fast and flexible PyTorch inference server that runs locally, on any cloud or AI HW.
☆147Jun 8, 2024Updated last year
Alternatives and similar repositories for nos
Users that are interested in nos are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Dockerfile builder for Machine Learning developers☆20May 3, 2024Updated last year
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- Simple orchestration for EC2 spot containers☆19Sep 27, 2024Updated last year
- ☆12Apr 1, 2024Updated 2 years ago
- AI web parser library + CLI☆48May 5, 2025Updated 11 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ojjson is a library designed to facilitate JSON interactions with Ollama, a large language api (LLM). It leverages the power of Zod for s…☆12Nov 7, 2024Updated last year
- Simple LLM inference server☆20Jun 13, 2024Updated last year
- ☆31Updated this week
- This repository is designed for deploying and managing server processes that handle embeddings using the Infinity Embedding model or Larg…☆26Mar 6, 2025Updated last year
- ☆60Jan 21, 2024Updated 2 years ago
- AI_Powered_Dev_Search_Engine☆12Mar 10, 2024Updated 2 years ago
- [NeurIPS 2024] The official implementation of "Image Copy Detection for Diffusion Models"☆18Oct 1, 2024Updated last year
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆97Feb 9, 2023Updated 3 years ago
- A sleek, customizable interface for managing LLMs with responsive design and easy agent personalization.☆17Aug 30, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Adapting Self-Supervised Representations as a Latent Space for Efficient Generation☆40Oct 17, 2025Updated 5 months ago
- A stable, fast and easy-to-use inference library with a focus on a sync-to-async API☆48Sep 26, 2024Updated last year
- ☆17May 22, 2025Updated 10 months ago
- ☆17Dec 18, 2023Updated 2 years ago
- A web-app to explore topics using LLM (less typing and more clicks)☆68Mar 15, 2026Updated 3 weeks ago
- ☆67May 23, 2025Updated 10 months ago
- A frontend for creative writing with LLMs☆157Jul 15, 2024Updated last year
- Universal connector to LLMs for Node.js & Bun☆30Updated this week
- A really tiny autograd engine☆100May 26, 2025Updated 10 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆24Dec 27, 2024Updated last year
- Automated LLM novelist☆47Apr 11, 2024Updated last year
- ☆14Dec 21, 2025Updated 3 months ago
- Official implementation of Half-Quadratic Quantization (HQQ)☆925Feb 26, 2026Updated last month
- Apache Arrow-compatible space-efficient "tape" class in pure Rust to be used with StringZilla for GPU, NUMA, and disk transfers of variab…☆29Nov 21, 2025Updated 4 months ago
- Building synthetic data for preference tuning☆27Dec 26, 2024Updated last year
- Controllable Animation Video Generation with Large Models-based Multimodal Agents☆240Jan 7, 2026Updated 3 months ago
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆3,743May 21, 2025Updated 10 months ago
- Interface for interacting with Gradient AI in Python☆15Jun 28, 2024Updated last year
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- WIP: Ofen is a toolkit aimed at making transformer models production-ready. API included☆17Oct 2, 2024Updated last year
- Generate Stunning Images and Craft Visual Stories for your Brand☆20Oct 25, 2024Updated last year
- Easily create LLM automation/agent workflows☆60Feb 13, 2024Updated 2 years ago
- Distribute and run AI workloads on Kubernetes magically in Python, like PyTorch for ML infra.☆1,176Mar 23, 2026Updated 2 weeks ago
- Agentic Keyframe Search for Video Question Answering☆17Apr 7, 2025Updated last year
- ☆20Sep 28, 2024Updated last year
- ☆66Jun 27, 2024Updated last year