Runs 405B LLMs on 8GB VRAM
☆2,763Apr 2, 2026Updated last month
Alternatives and similar repositories for airllm
Users that are interested in airllm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 100% in-browser, hands-free AI voice chat using Whisper, WebLLM, and Supertonic TTS☆159Dec 11, 2025Updated 5 months ago
- AirLLM 70B inference with single 4GB GPU☆17,925Mar 10, 2026Updated 2 months ago
- [MLsys2026]: RAG on Everything with LEANN. Enjoy 97% storage savings while running a fast, accurate, and 100% private RAG application on …☆11,502Updated this week
- Open-source context retrieval layer for AI agents☆6,333Updated this week
- 💻InfiniteGPU is a platform that enables effortless exchange of compute resources for AI workloads☆83Apr 23, 2026Updated 3 weeks ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Build reliable customer-facing AI agents with Parlant: an interaction control harness optimized for controlled, consistent, and predictab…☆18,069May 12, 2026Updated last week
- 📑 PageIndex: Document Index for Vectorless, Reasoning-based RAG☆31,186May 13, 2026Updated last week
- Official inference framework for 1-bit LLMs☆38,971Mar 10, 2026Updated 2 months ago
- The open source research environment for AI researchers to seamlessly train, evaluate, and scale models from local hardware to GPU cluste…☆4,947Updated this week
- 🕷️ An adaptive Web Scraping framework that handles everything from a single request to a full-scale crawl!☆49,632May 11, 2026Updated last week
- Open Source AI Platform - AI Chat with advanced features that works with every LLM☆284May 12, 2026Updated last week
- Build, deploy, and orchestrate AI agents. Sim is the central intelligence layer for your AI workforce.☆28,487Updated this week
- Memory layer for AI Agents. Replace complex RAG pipelines with a serverless, single-file memory layer. Give your agents instant retrieval…☆15,468May 6, 2026Updated 2 weeks ago
- Multi Face Recognition and Detection☆68Nov 1, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Effortless AI-assisted data labeling with AI support from YOLO, Segment Anything (SAM+SAM2), MobileSAM!!☆82May 4, 2025Updated last year
- Camera monitoring with VLM☆1,420Mar 9, 2026Updated 2 months ago
- 100% free and full open-source edge Firecrawl alternative with better links extraction for agents - that you can deploy to cloudflare or …☆574Mar 12, 2026Updated 2 months ago
- OpenAI chatGPT hybrid search and retrieval augmented generation☆16May 2, 2026Updated 2 weeks ago
- 🪄 Create rich visualizations with AI☆15,693Updated this week
- Herramienta de extracción de exploits desde Shodan Exploits para facilitar la búsqueda de vulnerabilidades conocidas. 👁☆41Sep 20, 2025Updated 8 months ago
- Tensorlake is a serverless runtime for sandboxes and deploying background agentic applications☆919Updated this week
- A lightweight self-hosted bot in a single binary, written in Go.☆1,235Mar 28, 2026Updated last month
- "DeepCode: Open Agentic Coding (Paper2Code & Text2Web & Text2Backend)"☆15,588Apr 30, 2026Updated 2 weeks ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Multilingual Document Layout Parsing in a Single Vision-Language Model☆8,755Mar 24, 2026Updated last month
- A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive vi…☆36,478Updated this week
- Turn any YouTube video into a documentation link☆324Feb 3, 2026Updated 3 months ago
- SoTA open-source TTS☆24,761May 1, 2026Updated 2 weeks ago
- A powerful Model Context Protocol (MCP) server that provides an all-in-one solution for public web access.☆2,365Apr 20, 2026Updated last month
- AI Coding assistant using Codex that integrates directly with Telegram. Send off coding requests from your phone!☆56Oct 21, 2025Updated 6 months ago
- Low-code/no-code python library that transforms plain English instructions into fully configured multi-agent AI teams☆192Apr 10, 2026Updated last month
- 🔥 Search, scrape, and clean the web for AI agents.☆120,407Updated this week
- 100+ AI Agent & RAG apps you can actually run — clone, customize, ship.☆110,222May 9, 2026Updated last week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Composable Google Sheets CLI for humans and agents. Read, write, update cells by key—with Agent Skills for Claude Code and OpenAI Codex.☆60Feb 11, 2026Updated 3 months ago
- Fully Local Manus AI. No APIs, No $200 monthly bills. Enjoy an autonomous agent that thinks, browses the web, and code for the sole cost …☆26,322Apr 22, 2026Updated 3 weeks ago
- Swiss-army tool for scraping and extracting data from online assets, made for hackers☆4,663Oct 12, 2024Updated last year
- A complete workflow automation and server monitoring system.☆4,153Updated this week
- MPC Server for PySpark inpired by the LakeSail☆18Feb 26, 2026Updated 2 months ago
- Supercharge Your LLM with the Fastest KV Cache Layer☆8,282Updated this week
- File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.☆7,367Feb 21, 2025Updated last year