Runs 405B LLMs on 8GB VRAM
☆2,755Apr 2, 2026Updated 3 weeks ago
Alternatives and similar repositories for airllm
Users that are interested in airllm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 100% in-browser, hands-free AI voice chat using Whisper, WebLLM, and Supertonic TTS☆154Dec 11, 2025Updated 4 months ago
- AirLLM 70B inference with single 4GB GPU☆16,633Mar 10, 2026Updated last month
- [MLsys2026]: RAG on Everything with LEANN. Enjoy 97% storage savings while running a fast, accurate, and 100% private RAG application on …☆10,923Updated this week
- Open-source context retrieval layer for AI agents☆6,265Updated this week
- 🕷️ An adaptive Web Scraping framework that handles everything from a single request to a full-scale crawl!☆38,587Apr 22, 2026Updated last week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 💻InfiniteGPU is a platform that enables effortless exchange of compute resources for AI workloads☆83Apr 23, 2026Updated last week
- The interaction control harness for customer-facing AI agents - optimized for building controlled, consistent, and predictable customer i…☆18,034Updated this week
- 📑 PageIndex: Document Index for Vectorless, Reasoning-based RAG☆25,672Apr 21, 2026Updated last week
- Official inference framework for 1-bit LLMs☆38,495Mar 10, 2026Updated last month
- Open Source AI Platform - AI Chat with advanced features that works with every LLM☆269Updated this week
- The open source research environment for AI researchers to seamlessly train, evaluate, and scale models from local hardware to GPU cluste…☆4,934Updated this week
- Build, deploy, and orchestrate AI agents. Sim is the central intelligence layer for your AI workforce.☆27,855Updated this week
- Memory layer for AI Agents. Replace complex RAG pipelines with a serverless, single-file memory layer. Give your agents instant retrieval…☆15,291Mar 16, 2026Updated last month
- Multi Face Recognition and Detection☆68Nov 1, 2022Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Effortless AI-assisted data labeling with AI support from YOLO, Segment Anything (SAM+SAM2), MobileSAM!!☆82May 4, 2025Updated 11 months ago
- Forge Orchestrator: Multi-AI task orchestration. File locking, knowledge capture, drift detection. Rust.☆108Apr 19, 2026Updated last week
- Camera monitoring with VLM☆1,357Mar 9, 2026Updated last month
- 100% free and full open-source edge Firecrawl alternative with better links extraction for agents - that you can deploy to cloudflare or …☆572Mar 12, 2026Updated last month
- OpenAI chatGPT hybrid search and retrieval augmented generation☆16Apr 2, 2026Updated 3 weeks ago
- Herramienta de extracción de exploits desde Shodan Exploits para facilitar la búsqueda de vulnerabilidades conocidas. 👁☆40Sep 20, 2025Updated 7 months ago
- 🪄 Create rich visualizations with AI☆15,244Updated this week
- Tensorlake is a serverless runtime for sandboxes and deploying background agentic applications☆903Updated this week
- A lightweight self-hosted bot in a single binary, written in Go.☆1,177Mar 28, 2026Updated last month
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- "DeepCode: Open Agentic Coding (Paper2Code & Text2Web & Text2Backend)"☆15,283Updated this week
- Multilingual Document Layout Parsing in a Single Vision-Language Model☆8,513Mar 24, 2026Updated last month
- A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive vi…☆35,921Updated this week
- Composable Google Sheets CLI for humans and agents. Read, write, update cells by key—with Agent Skills for Claude Code and OpenAI Codex.☆58Feb 11, 2026Updated 2 months ago
- SoTA open-source TTS☆24,503Mar 26, 2026Updated last month
- AI Coding assistant using Codex that integrates directly with Telegram. Send off coding requests from your phone!☆55Oct 21, 2025Updated 6 months ago
- A powerful Model Context Protocol (MCP) server that provides an all-in-one solution for public web access.☆2,316Apr 20, 2026Updated last week
- Low-code/no-code python library that transforms plain English instructions into fully configured multi-agent AI teams☆192Apr 10, 2026Updated 2 weeks ago
- 🔥 The API to search, scrape, and interact with the web for AI☆112,116Updated this week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 100+ AI Agent & RAG apps you can actually run — clone, customize, ship.☆107,099Apr 19, 2026Updated last week
- Fully Local Manus AI. No APIs, No $200 monthly bills. Enjoy an autonomous agent that thinks, browses the web, and code for the sole cost …☆26,069Apr 22, 2026Updated last week
- Swiss-army tool for scraping and extracting data from online assets, made for hackers☆4,664Oct 12, 2024Updated last year
- Supercharge Your LLM with the Fastest KV Cache Layer☆8,132Updated this week
- MPC Server for PySpark inpired by the LakeSail☆18Feb 26, 2026Updated 2 months ago
- File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.☆7,353Feb 21, 2025Updated last year
- Generate fixed dimensional embeddings for multi-dimensional vectors in python based on Muvera from Google.☆20Jun 28, 2025Updated 10 months ago