From-scratch implementation of OpenAI's GPT-OSS model in Python. No Torch, No GPUs.
☆108Nov 5, 2025Updated 5 months ago
Alternatives and similar repositories for GPT-OSS
Users that are interested in GPT-OSS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Windows tool to query various LLM AIs. Supports branched conversations, history and summaries among others.☆34Updated this week
- FastAPI + MLX offline-first voice agent with <1s latency. Minimal UI☆53Oct 21, 2025Updated 5 months ago
- ✅ Iterative Transparent Reasoning System by chonkyDB ✅ combining reasoning, graph and vector for trustworthy, explainable and smart LLMs …☆37Jun 13, 2025Updated 10 months ago
- Production-grade OpenClaw personal assistant setup. Security-hardened, 15+ custom tools, Purple-Team audited. Templates & architecture do…☆75Mar 25, 2026Updated 3 weeks ago
- SwiftLet is a lightweight Python framework for running open-source Large Language Models (LLMs) locally using safetensors☆28Aug 6, 2025Updated 8 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Transplants vocabulary between language models, enabling the creation of draft models for speculative decoding WITHOUT retraining.☆50Oct 29, 2025Updated 5 months ago
- Simple agent framework using Ollama tool calling☆10Aug 27, 2024Updated last year
- Surgically de-slop LLMs☆14Jun 1, 2025Updated 10 months ago
- Writing Tools, Apple's AI-inspired app, enchants Windows, enhancing your pen with AI LLMs. One hotkey press, system-wide, fixes grammar, …☆27Jul 26, 2025Updated 8 months ago
- A thin cython wrapper around llama.cpp, whisper.cpp and stable-diffusion.cpp☆23Updated this week
- Browser extension that lets you summarize and chat with any webpage using a local LLM of your choice.☆22Oct 24, 2024Updated last year
- ScribePal is an Open Source intelligent browser extension that leverages AI to empower your web experience by providing contextual insigh…☆22Apr 6, 2026Updated last week
- Run Orpheus 3B Locally with Gradio UI, Standalone App☆24Apr 1, 2025Updated last year
- WASM-powered AI agents with in-browser LLMs☆24Jan 5, 2026Updated 3 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆23Sep 27, 2024Updated last year
- A demonstration of metadata generation for RAG using a Health Canada document☆19Jan 19, 2025Updated last year
- A Simple, Explainable Vision Language Model for detecting manifacturing defects into products☆14Sep 23, 2025Updated 6 months ago
- Local Qwen3 LLM inference. One easy-to-understand file of C source with no dependencies.☆167Jul 5, 2025Updated 9 months ago
- Visual Tagger is a JavaScript tool that visually highlights HTML elements for AIs, aiding in identifying interactive components on web pa…☆11Oct 28, 2024Updated last year
- ☆51Oct 1, 2025Updated 6 months ago
- ☆31Mar 18, 2026Updated 3 weeks ago
- ☆15Mar 18, 2026Updated 3 weeks ago
- Let's have some retro gaming fun with AI! Join the discord: https://discord.gg/5xXzkMu8Zk☆76Nov 19, 2025Updated 4 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- MCP server to give every agent an ephemeral Linux sandboxes for executing shell commands.☆37Mar 3, 2026Updated last month
- This GUI aims to simplify the process of converting GGUF files to llamafile format by providing an intuitive and convenient way for users…☆14Jan 2, 2026Updated 3 months ago
- Orchestrator Kit for Agentic Reasoning - OrKa is a modular AI orchestration system that transforms Large Language Models (LLMs) into comp…☆93Updated this week
- LLamaHTML is a simple html file to communicate with a running llamacpp llama-server☆23Aug 5, 2025Updated 8 months ago
- Generate a llama-quantize command to copy the quantization parameters of any GGUF☆32Jan 23, 2026Updated 2 months ago
- The hearth of The Pulsar App, fast, secure and shared inference with modern UI☆59Dec 1, 2024Updated last year
- Run and train Transformer based Large Language Models (LLMS) natively in .NET using TorchSharp☆24Nov 8, 2024Updated last year
- KGet is a modern, lightweight download manager written in Rust for fast and reliable file downloads from the command line and native app …☆36Mar 2, 2026Updated last month
- a SplineCamera react component☆14Feb 18, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- BitTorrent client written in Rust☆23Nov 3, 2025Updated 5 months ago
- Implementation of the Hierarchical Reasoning Model (HRM), applied to a pathfinding task, plus performance study.☆32Sep 9, 2025Updated 7 months ago
- ☆58Feb 8, 2026Updated 2 months ago
- An open source deep learning library for Unity.☆17Mar 15, 2026Updated last month
- Generate Your Own Private Morning Radio for Commute☆32Feb 5, 2025Updated last year
- Mic-controlled mouse clicks☆17Oct 6, 2025Updated 6 months ago
- A dynamic multi-expert AI architecture running on a single consumer GPU (RTX 3060).☆36Dec 2, 2025Updated 4 months ago