HamzaElshafie / gpt-oss-20BView external linksLinks
A PyTorch implementation of the GPT-OSS-20B architecture. All components are coded from scratch: RoPE with YaRN, RMSNorm, SwiGLU with clamping and residual connection, Mixture-of-Experts (MoE), Self-Attention with learned sinks, banded attention, GQA, and KV-cache.
☆209Dec 2, 2025Updated 2 months ago
Alternatives and similar repositories for gpt-oss-20B
Users that are interested in gpt-oss-20B are comparing it to the libraries listed below
Sorting:
- This repository hosts the code to port NumPy model weights of BiT-ResNets to TensorFlow SavedModel format.☆14Dec 21, 2021Updated 4 years ago
- Neural Arithmetic Logic Units by Trask et al.☆12Apr 10, 2019Updated 6 years ago
- This repository hosts code for converting the original MLP Mixer models (JAX) to TensorFlow.☆15Sep 29, 2021Updated 4 years ago
- ☆16Jun 20, 2023Updated 2 years ago
- Minimal implementation of Denoised Smoothing (https://arxiv.org/abs/2003.01908) in TensorFlow.☆20Aug 4, 2021Updated 4 years ago
- This repository holds files and scripts for incorporating simple CI/CD practices for model training in ML.☆21Oct 26, 2021Updated 4 years ago
- Contains code to demonstrate distributed training in TensorFlow 2 with AI Platform and custom Docker contains.☆20Apr 28, 2021Updated 4 years ago
- ☆24Oct 30, 2019Updated 6 years ago
- This project shows how to derive the total number of training tokens from a large text dataset from 🤗 datasets with Apache Beam and Data…☆27Oct 20, 2022Updated 3 years ago
- CUDA 8-bit Tensor Core Matrix Multiplication based on m16n16k16 WMMA API☆35Sep 15, 2023Updated 2 years ago
- This repository hosts code for converting the original Vision Transformer models (JAX) to TensorFlow.☆33Mar 23, 2022Updated 3 years ago
- Project demonstrating dual model deployment scenarios using Vertex AI (GCP).☆34Dec 28, 2021Updated 4 years ago
- Kate is Multimodal Live Assistant that ignites your browsing experience☆11Feb 15, 2025Updated last year
- All Resources from Stanford CS106B 2021☆23Jul 11, 2025Updated 7 months ago
- ☆14Jan 9, 2026Updated last month
- TensorFlow 2 / Lite implementation of Ultra-Fast Structure-Aware Lane Detection☆12Aug 19, 2020Updated 5 years ago
- ☆10Apr 7, 2025Updated 10 months ago
- A ratatui based vertical and horizontal slider.☆35Jan 7, 2026Updated last month
- An Awesome list of AI tools powered by ChatGPT / Whisper and Stable DIffusion or are useful to developers of that domain☆10Jul 26, 2023Updated 2 years ago
- FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection☆24Feb 10, 2026Updated last week
- ☆10Aug 15, 2022Updated 3 years ago
- generate spot-it cards☆10Jun 13, 2015Updated 10 years ago
- Architecture principles☆13May 23, 2025Updated 8 months ago
- Code for the experiments in the ACL 2020 paper "Estimating predictive uncertainty for rumour verification models"☆11May 15, 2020Updated 5 years ago
- How to build an ACP compliant agent that uses MCP as well!☆11May 6, 2025Updated 9 months ago
- Workshop on Text Classification at 1729 Conference☆13Sep 4, 2022Updated 3 years ago
- TypeScript Utils☆14Jan 23, 2018Updated 8 years ago
- python port of arc90's readability bookmarklet, updated to match latest readability.js!☆19Sep 13, 2011Updated 14 years ago
- SciFin is a python package for Science & Finance.☆11Oct 25, 2020Updated 5 years ago
- decontamination☆24Dec 3, 2025Updated 2 months ago
- ☆16Apr 28, 2023Updated 2 years ago
- solutions for advent of code 2018☆17Dec 19, 2018Updated 7 years ago
- ☆12Nov 11, 2024Updated last year
- ☆11Nov 30, 2023Updated 2 years ago
- Persistent dense gemm for Hopper in `CuTeDSL`☆15Aug 9, 2025Updated 6 months ago
- ☆36Feb 6, 2026Updated last week
- J.A.R.V.I.S is a very advanced virtual assistant who can automate almost all tasks of everything of PC & IoT. Just Say It.☆11Jul 29, 2021Updated 4 years ago
- Implementation of ConvMixer-Patches Are All You Need? in TensorFlow and Keras☆12Oct 31, 2021Updated 4 years ago
- Tool for migrating MongoDB contents to Solr for indexing written in Ruby☆17Aug 24, 2011Updated 14 years ago