A PyTorch implementation of the GPT-OSS-20B architecture. All components are coded from scratch: RoPE with YaRN, RMSNorm, SwiGLU with clamping and residual connection, Mixture-of-Experts (MoE), Self-Attention with learned sinks, banded attention, GQA, and KV-cache.
☆231Dec 2, 2025Updated 4 months ago
Alternatives and similar repositories for gpt-oss-20B
Users that are interested in gpt-oss-20B are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- llm201n: neural networks zero to super hero. the bridge from mirograd to tinygrad!☆71Apr 22, 2026Updated last week
- This repository hosts the code to port NumPy model weights of BiT-ResNets to TensorFlow SavedModel format.☆14Dec 21, 2021Updated 4 years ago
- Testing OpenAI Universe☆14Dec 10, 2016Updated 9 years ago
- This repository hosts code for converting the original MLP Mixer models (JAX) to TensorFlow.☆15Sep 29, 2021Updated 4 years ago
- Minimal implementation of Denoised Smoothing (https://arxiv.org/abs/2003.01908) in TensorFlow.☆20Aug 4, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Neural Arithmetic Logic Units by Trask et al.☆12Apr 10, 2019Updated 7 years ago
- This repository holds files and scripts for incorporating simple CI/CD practices for model training in ML.☆20Oct 26, 2021Updated 4 years ago
- turn small javascript functions into GPT function calls☆12Aug 23, 2023Updated 2 years ago
- An AI character interaction system with emotional modeling and advanced memory management☆17Oct 26, 2024Updated last year
- PyTorch implementation of Train Short, Test Long: Attention with Linear Biases Enables Input Length Extrapolation☆32Dec 29, 2021Updated 4 years ago
- Antenna analyzer based on RigExpert Zero II and Arduino☆13Jan 25, 2024Updated 2 years ago
- Contains code to demonstrate distributed training in TensorFlow 2 with AI Platform and custom Docker contains.☆20Apr 28, 2021Updated 5 years ago
- This project shows how to derive the total number of training tokens from a large text dataset from 🤗 datasets with Apache Beam and Data…☆27Oct 20, 2022Updated 3 years ago
- GEMM☆10Aug 26, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Supporting code for the blog post on modular manifolds.☆121Sep 26, 2025Updated 7 months ago
- ☆128Dec 9, 2025Updated 4 months ago
- survery of small language models☆18Jul 23, 2024Updated last year
- CUDA 8-bit Tensor Core Matrix Multiplication based on m16n16k16 WMMA API☆37Sep 15, 2023Updated 2 years ago
- ☆11Sep 21, 2022Updated 3 years ago
- a simple implementation of self attention layer that outputs flattened sentence embedding matrix, with the Frobenius norm penalty☆16Sep 14, 2018Updated 7 years ago
- A std::execution style runtime context and High Performance RPC Transport for using OpenUCX. Including CUDA/ROCM/... devices with RDMA.☆30Apr 21, 2026Updated last week
- SciFin is a python package for Science & Finance.☆11Oct 25, 2020Updated 5 years ago
- [arXiv 2026] Official PyTorch Repository for "Coarse-Guided Visual Generation via Weighted h-Transform Sampling"☆41Mar 16, 2026Updated last month
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆19Mar 3, 2025Updated last year
- A simple, generic, and flexible keyframe animation library for Rust.☆30Mar 27, 2026Updated last month
- Solidity contracts for the decentralized Prime Network protocol☆26Jul 6, 2025Updated 9 months ago
- 🎓Automatically Update circult-eda-mlsys-tinyml Papers Daily using Github Actions (Update Every 8th hours)☆10Updated this week
- My tests and experiments with some popular dl frameworks.☆17Sep 11, 2025Updated 7 months ago
- A small logging proxy server for intercepting and logging code completion requests from copilot.☆13May 5, 2023Updated 2 years ago
- GEMV implementation with CUTLASS☆21Aug 21, 2025Updated 8 months ago
- Custom ComfyUI node that combines VSR + VFI and allows streaming processing for arbitrary video length.☆61Mar 28, 2026Updated last month
- PySOM - The Simple Object Machine Smalltalk implemented in Python☆19Aug 19, 2025Updated 8 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Multi-heap-sort for many small arrays, quicksort with 3 pivots for one big array, CUDA acceleration, CUDA memory compression.☆13Sep 29, 2024Updated last year
- 在 Mirai Console 中使用MCL管理包和其他高级功能☆10Nov 13, 2022Updated 3 years ago
- Simple orchestration for EC2 spot containers☆19Sep 27, 2024Updated last year
- A multimodal live AI assistant designed to enhance the browsing experience using Gemini.☆11Feb 15, 2025Updated last year
- A simple Angular2 application with Loopback, which has some basic elements like authentication configured☆12Jan 29, 2017Updated 9 years ago
- TypeScript Utils☆14Jan 23, 2018Updated 8 years ago
- FlashSampling: Fast and Memory-Efficient Exact Sampling (https://huggingface.co/papers/2603.15854)☆67Apr 9, 2026Updated 2 weeks ago