A PyTorch implementation of the GPT-OSS-20B architecture. All components are coded from scratch: RoPE with YaRN, RMSNorm, SwiGLU with clamping and residual connection, Mixture-of-Experts (MoE), Self-Attention with learned sinks, banded attention, GQA, and KV-cache.
☆233Dec 2, 2025Updated 6 months ago
Alternatives and similar repositories for gpt-oss-20B
Users that are interested in gpt-oss-20B are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- sitp: run nanochat by building teenygrad from scratch: the bridge from micrograd to tinygrad!☆74Updated this week
- This repository hosts the code to port NumPy model weights of BiT-ResNets to TensorFlow SavedModel format.☆14Dec 21, 2021Updated 4 years ago
- This repository hosts code for converting the original MLP Mixer models (JAX) to TensorFlow.☆15Sep 29, 2021Updated 4 years ago
- Minimal implementation of Denoised Smoothing (https://arxiv.org/abs/2003.01908) in TensorFlow.☆20Aug 4, 2021Updated 4 years ago
- How to build an ACP compliant agent that uses MCP as well!☆11May 6, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- This repository holds files and scripts for incorporating simple CI/CD practices for model training in ML.☆20Oct 26, 2021Updated 4 years ago
- ☆17Feb 14, 2024Updated 2 years ago
- PyTorch implementation of Train Short, Test Long: Attention with Linear Biases Enables Input Length Extrapolation☆33Dec 29, 2021Updated 4 years ago
- An AI character interaction system with emotional modeling and advanced memory management☆17Oct 26, 2024Updated last year
- Implementation for MomentumSMoE☆19Apr 19, 2025Updated last year
- Contains code to demonstrate distributed training in TensorFlow 2 with AI Platform and custom Docker contains.☆20Apr 28, 2021Updated 5 years ago
- Confidential inference in enclave for OpenAI grant. Uses k3s and Triton☆16Mar 20, 2025Updated last year
- An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.☆13Jun 7, 2023Updated 3 years ago
- Official code for Guiding Language Model Math Reasoning with Planning Tokens☆19Feb 29, 2024Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- survery of small language models☆18Jul 23, 2024Updated last year
- ☆11Oct 14, 2022Updated 3 years ago
- collab-dev - Collaboration Metrics for Code Reviews☆23May 12, 2025Updated last year
- ☆11May 16, 2026Updated 3 weeks ago
- ☆23Oct 30, 2019Updated 6 years ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆60Aug 4, 2022Updated 3 years ago
- A std::execution style runtime context and High Performance RPC Transport for using OpenUCX. Including CUDA/ROCM/... devices with RDMA.☆32May 26, 2026Updated 2 weeks ago
- ☆19Mar 3, 2025Updated last year
- Solidity contracts for the decentralized Prime Network protocol☆26Jul 6, 2025Updated 11 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A simple, generic, and flexible keyframe animation library for Rust.☆30Jun 1, 2026Updated last week
- mysql-3.23.49☆11Jun 28, 2014Updated 11 years ago
- 🎓Automatically Update circult-eda-mlsys-tinyml Papers Daily using Github Actions (Update Every 8th hours)☆10Jun 1, 2026Updated last week
- A small logging proxy server for intercepting and logging code completion requests from copilot.☆13May 5, 2023Updated 3 years ago
- GEMV implementation with CUTLASS☆21Aug 21, 2025Updated 9 months ago
- [NAACL 2025] A Closer Look into Mixture-of-Experts in Large Language Models☆61Feb 7, 2025Updated last year
- A guided lab for MCP security and best practices☆23Updated this week
- Clust_mgr is an important compnent of KunlunBase. It provides a HTTP API for KunlunBase users to do cluster management, provisioning and …☆10Jun 13, 2023Updated 2 years ago
- ☆14Nov 3, 2025Updated 7 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Multi-heap-sort for many small arrays, quicksort with 3 pivots for one big array, CUDA acceleration, CUDA memory compression.☆13Sep 29, 2024Updated last year
- a single interface around speech-to-speech foundation models☆28Jun 27, 2025Updated 11 months ago
- Custom ComfyUI node that combines VSR + VFI and allows streaming processing for arbitrary video length.☆66Mar 28, 2026Updated 2 months ago
- Simple orchestration for EC2 spot containers☆19Sep 27, 2024Updated last year
- A multimodal live AI assistant designed to enhance the browsing experience using Gemini.☆11Feb 15, 2025Updated last year
- ☆108May 29, 2025Updated last year
- Data Agents are intelligent assistants built by data engineers to help non-data professionals navigate the organization’s data infrastruc…☆24Apr 14, 2025Updated last year