ctlllll/gpt-oss-reverse-engineering

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ctlllll/gpt-oss-reverse-engineering)

ctlllll / gpt-oss-reverse-engineering

☆72

Alternatives and similar repositories for gpt-oss-reverse-engineering

Users that are interested in gpt-oss-reverse-engineering are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zhenyuhe00 / SWE-Swiss
View on GitHub
SWE-Swiss: A Multi-Task Fine-Tuning and RL Recipe for High-Performance Issue Resolution
☆105Sep 24, 2025Updated 10 months ago
lsj2408 / URPE
View on GitHub
[NeurIPS 2022] Your Transformer May Not be as Powerful as You Expect (official implementation)
☆35Aug 6, 2023Updated 2 years ago
linhaowei1 / TFG-Flow
View on GitHub
👌[ICLR 2025] TFG-Flow: Training-free Guidance in Multimodal Generative Flow
☆20Mar 4, 2025Updated last year
zhuzilin / flash-attention-with-sink
View on GitHub
☆37Aug 7, 2025Updated 11 months ago
cherichy / tilecute
View on GitHub
☆32Jul 2, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
HanGuo97 / hilt
View on GitHub
☆40Dec 14, 2025Updated 7 months ago
zxytim / arithmetic-encoding-compression
View on GitHub
☆11Apr 3, 2023Updated 3 years ago
IBM / ColPret
View on GitHub
Efficient Scaling laws and collaborative pretraining.
☆23Updated this week
suoych / KEDs
View on GitHub
Implementation of the paper Knowledge-Enhanced Dual-stream Zero-shot Composed Image Retrieval (CVPR 2024)
☆20Nov 4, 2024Updated last year
Chillee / lit-llama
View on GitHub
Simple (fast) transformer inference in PyTorch with torch.compile + lit-llama code
☆10Aug 29, 2023Updated 2 years ago
li-plus / flash-preference
View on GitHub
Accelerate LLM preference tuning via prefix sharing with a single line of code
☆52Jul 4, 2025Updated last year
guyuntian / CoT_benchmark
View on GitHub
Code for "Towards Revealing the Mystery behind Chain of Thought: a Theoretical Perspective"
☆21Jul 16, 2023Updated 3 years ago
ctlllll / understanding_llm_benchmarks
View on GitHub
Understanding the correlation between different LLM benchmarks
☆30Jan 11, 2024Updated 2 years ago
zbh2047 / L_inf-dist-net
View on GitHub
[ICML 2021] This is the official github repo for training L_inf dist nets with high certified accuracy.
☆41Mar 16, 2022Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
princeton-nlp / ProLong
View on GitHub
Homepage for ProLong (Princeton long-context language models) and paper "How to Train Long-Context Language Models (Effectively)"
☆261Sep 12, 2025Updated 10 months ago
kellycyy / daily_dilemmas
View on GitHub
☆16Aug 23, 2025Updated 11 months ago
TiledTensor / TiledBench
View on GitHub
Benchmark tests supporting the TiledCUDA library.
☆19Nov 19, 2024Updated last year
RuntianZ / adversarial-robustness-unlabeled
View on GitHub
Adversarially Robust Generalization Just Requires More Unlabeled Data
☆11Aug 8, 2019Updated 6 years ago
dsl-learn / cutile-learn
View on GitHub
NVIDIA cuTile learn
☆169Dec 9, 2025Updated 7 months ago
Infini-AI-Lab / TriForce
View on GitHub
[COLM 2024] TriForce: Lossless Acceleration of Long Sequence Generation with Hierarchical Speculative Decoding
☆281Aug 31, 2024Updated last year
oliverYoung2001 / UltraAttn
View on GitHub
SC'25 UltraAttn: Efficiently Parallelizing Attention through Hierarchical Context-Tiling
☆16Aug 14, 2025Updated 11 months ago
tangzhy / RealCritic
View on GitHub
☆15Jan 27, 2025Updated last year
zhenyuhe00 / BiPE
View on GitHub
Two Stones Hit One Bird: Bilevel Positional Encoding for Better Length Extrapolation, ICML 2024
☆24Jun 26, 2024Updated 2 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
AdamG012 / moe-paper-models
View on GitHub
A sumary of MoE experimental setups across a number of different papers.
☆16Feb 16, 2023Updated 3 years ago
Anonymous1252022 / fp4-all-the-way
View on GitHub
☆52May 20, 2025Updated last year
feifeibear / ChituAttention
View on GitHub
Quantized Attention on GPU
☆45Nov 22, 2024Updated last year
IST-DASLab / Quartet-II
View on GitHub
Quartet II Official Code
☆77May 1, 2026Updated 2 months ago
LeiWang1999 / Stream-k.tvm
View on GitHub
☆20Sep 28, 2024Updated last year
aikitoria / nanotrace
View on GitHub
Low overhead tracing library and trace visualizer for pipelined CUDA kernels
☆137Jul 17, 2026Updated last week
CMU-SAFARI / DRAM-Datasheet-Survey
View on GitHub
A survey of manufacturer-provided DRAM operating parameters and timings as specified by DRAM chip datasheets from between 1970 and 2021. …
☆11May 4, 2022Updated 4 years ago
linhaowei1 / SLD
View on GitHub
[ICLR26] AI-based scaling law discovery
☆31Jan 30, 2026Updated 5 months ago
haochengxi / Train_Transformers_with_INT4
View on GitHub
☆157Jun 22, 2023Updated 3 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
cpldcpu / LRMTokenEconomy
View on GitHub
Measuring Thinking Efficiency in Reasoning Models - Research Repository
☆39Dec 2, 2025Updated 7 months ago
Edward-Sun / TSM-PDE
View on GitHub
Code of ICML paper arxiv.org/abs/2302.08105
☆14May 4, 2023Updated 3 years ago
nanomaoli / llm_reproducibility
View on GitHub
☆106May 29, 2026Updated last month
MadryLab / journey-TRAK
View on GitHub
Code for the paper "The Journey, Not the Destination: How Data Guides Diffusion Models"
☆25Dec 12, 2023Updated 2 years ago
graphdeco-inria / gaussian-hierarchy
View on GitHub
☆14Jul 17, 2024Updated 2 years ago
allenai / olmix
View on GitHub
☆41May 26, 2026Updated last month
Co1lin / AIR-Server-Doc
View on GitHub
Server Usage Documentation of AIR
☆22Feb 22, 2023Updated 3 years ago