Runs 405B LLMs on 8GB VRAM
☆3,022Apr 2, 2026Updated 2 months ago
Alternatives and similar repositories for airllm
Users that are interested in airllm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 100% in-browser, hands-free AI voice chat using Whisper, WebLLM, and Supertonic TTS☆185Dec 11, 2025Updated 6 months ago
- AirLLM 70B inference with single 4GB GPU☆21,679Updated this week
- [MLsys2026]: RAG on Everything with LEANN. Enjoy 97% storage savings while running a fast, accurate, and 100% private RAG application on …☆12,615Updated this week
- Open-source context retrieval layer for AI agents☆6,460Jun 5, 2026Updated 3 weeks ago
- 💻InfiniteGPU is a platform that enables effortless exchange of compute resources for AI workloads☆82May 31, 2026Updated last month
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Build reliable customer-facing AI agents with Parlant: an interaction control harness optimized for controlled, consistent, and predictab…☆18,143Jun 24, 2026Updated last week
- Official inference framework for 1-bit LLMs☆39,469Mar 10, 2026Updated 3 months ago
- 📑 PageIndex: Document Index for Vectorless, Reasoning-based RAG☆33,405Updated this week
- 🕷️ An adaptive Web Scraping framework that handles everything from a single request to a full-scale crawl!☆66,351Updated this week
- The open source research environment for AI researchers to seamlessly train, evaluate, and scale models from local hardware to GPU cluste…☆5,107Updated this week
- Open Source AI Platform - AI Chat with advanced features that works with every LLM☆295Jun 24, 2026Updated last week
- Build, deploy, and orchestrate AI agents. Sim is the central intelligence layer for your AI workforce.☆28,866Updated this week
- Memory layer for AI Agents. Replace complex RAG pipelines with a serverless, single-file memory layer. Give your agents instant retrieval…☆15,701May 27, 2026Updated last month
- Multi Face Recognition and Detection☆68Nov 1, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Forge Orchestrator: Multi-AI task orchestration. File locking, knowledge capture, drift detection. Rust.