Running a big model on a small laptop
☆3,919Mar 19, 2026Updated 3 months ago
Alternatives and similar repositories for flash-moe
Users that are interested in flash-moe are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Running a big model on a small laptop☆44Mar 26, 2026Updated 2 months ago
- PMetal: high-performance Apple Silicon framework for local LLM inference, LoRA/QLoRA fine-tuning, serving, quantization, and MLX/Metal ac…☆296Jun 5, 2026Updated 2 weeks ago
- World's first Nintendo 3DS emulator for Apple devices based on Citra.☆18Apr 7, 2023Updated 3 years ago
- A sine/square waveform generator using Teensy 3.1☆11May 19, 2015Updated 11 years ago
- [AAAI26]: DS SERVE: The Largest Open Vector Store over Pretain Data; A Framework for Efficient and Scalable Neural Retrieval☆51Jan 28, 2026Updated 4 months ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Sim2Reason: Solving Physics Olympiad via Reinforcement Learning on Physics Simulators. We present a method for turning physics simulator…☆165Apr 25, 2026Updated last month
- Demo of knowledge graph creation and Graph RAG with Dspy and Kuzu☆22Jun 30, 2025Updated 11 months ago
- Pitcher mechanics analyzer: single-camera video → 3D biomechanical analysis using SAM 3D Body. Built for MLB player development.☆50Apr 9, 2026Updated 2 months ago
- Embedding Inversion via Conditional Masked Diffusion: recover original text from embedding vectors using parallel denoising. Live demo + …☆56Mar 7, 2026Updated 3 months ago
- Convert StableHLO models into Apple Core ML format☆22Jun 12, 2026Updated last week
- Electron shell for mrmd - Zen Markdown Editor with real-time collaboration☆37Mar 20, 2026Updated 2 months ago
- THEORY OF SPACE: a benchmark for evaluating whether foundation models can actively explore under partial observability efficiently to bui…☆81Feb 27, 2026Updated 3 months ago
- CBOR library for Arduino☆18Jul 26, 2025Updated 10 months ago
- This repository serves as a central hub for discovering tools and services focused on automated prompt engineering. Whether you're lookin…☆16Oct 11, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Profile your CoreML models directly from Python 🐍☆30Sep 8, 2025Updated 9 months ago
- Seo Crawler Saas Open source☆60Jun 9, 2026Updated last week
- Hundreds of models & providers. One command to find what runs on your hardware.☆27,858Updated this week
- Live view of Claude Code sessions and the ability to search them☆87Dec 30, 2025Updated 5 months ago
- OpenClaw Supermemory lets to have long-term memory and recall for your openclaw agent.☆793Updated this week
- Run Time Series Foundation Models on Apple Silicon☆33Feb 27, 2026Updated 3 months ago
- AirLLM 70B inference with single 4GB GPU☆19,933Mar 10, 2026Updated 3 months ago
- 🐙 A curated set of Codex and OpenClaw skills for workflow automation, technical debugging, and agent-assisted development patterns.☆86May 27, 2026Updated 3 weeks ago
- AI agents running research on single-GPU nanochat training automatically☆86,596Mar 26, 2026Updated 2 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 2023 Advent of Code in Ada☆37Oct 10, 2025Updated 8 months ago
- Native Excalidraw diagram preview tool for pi — draw and save diagrams from the agent with a live glimpse webview.☆62May 3, 2026Updated last month
- The ArtificialSongGenerator automatically composes and compiles the Artifical Audio Multitrack dataset (AAM).☆27Nov 17, 2025Updated 7 months ago
- This repository is an implementation of virtually trying on different outfits by image editing and inpainting using Imagen 3 model.☆25Apr 4, 2025Updated last year
- Run frontier AI locally.☆45,365Updated this week
- a simple hl7-parser☆13Feb 13, 2025Updated last year
- AI-powered penetration testing assistant using local LLM on linux (Parrot OS)☆3,064Apr 11, 2026Updated 2 months ago
- agent skills, starting w/ context engineering☆79May 12, 2026Updated last month
- Talk to your Mac, query your docs, no cloud required. On-device voice AI + RAG☆1,523Mar 16, 2026Updated 3 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Open-source AI coworker, with memory☆14,959Jun 12, 2026Updated last week
- Deterministic browser automation. Works out of the box with Claude/Codex/OpenCode☆474Jun 6, 2026Updated last week
- Find out why your CoreML model isn't running on the Neural Engine!☆30Jun 18, 2024Updated 2 years ago
- Claude Code agent that routes to external LLMs (Grok, Gemini, GPT-5, etc.) via OpenRouter - just mention the model name☆31Nov 30, 2025Updated 6 months ago
- Stash — persistent memory layer for AI agents. Episodes, facts, and working context stored in Postgres. MCP server included. Self-hosted,…☆712Updated this week
- ☆190Jun 11, 2026Updated last week
- Tree-based speculative decoding for Apple Silicon (MLX). ~10-15% faster than DFlash on code, ~1.5x over autoregressive. First MLX port wi…☆140Apr 15, 2026Updated 2 months ago