β71Aug 6, 2025Updated 8 months ago
Alternatives and similar repositories for gpt-oss-reverse-engineering
Users that are interested in gpt-oss-reverse-engineering are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- π[ICLR 2025] TFG-Flow: Training-free Guidance in Multimodal Generative Flowβ19Mar 4, 2025Updated last year
- Code for "Towards Revealing the Mystery behind Chain of Thought: a Theoretical Perspective"β22Jul 16, 2023Updated 2 years ago
- Simple (fast) transformer inference in PyTorch with torch.compile + lit-llama codeβ10Aug 29, 2023Updated 2 years ago
- [NeurIPS 2022] Your Transformer May Not be as Powerful as You Expect (official implementation)β34Aug 6, 2023Updated 2 years ago
- [ICLR26] AI-based scaling law discoveryβ28Jan 30, 2026Updated 2 months ago
- Proton VPN Special Offer - Get 70% off β’ AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Vortex: A Flexible and Efficient Sparse Attention Frameworkβ50Updated this week
- β38Aug 7, 2025Updated 8 months ago
- β34Nov 26, 2025Updated 4 months ago
- Code for the paper "The Journey, Not the Destination: How Data Guides Diffusion Models"β25Dec 12, 2023Updated 2 years ago
- The repository contains code for Adaptive Data Optimizationβ34Dec 9, 2024Updated last year
- Server Usage Documentation of AIRβ23Feb 22, 2023Updated 3 years ago
- [NeurIPS-2024] The offical Implementation of "Instruction-Guided Visual Masking"β42Nov 15, 2024Updated last year
- Is In-Context Learning Sufficient for Instruction Following in LLMs? [ICLR 2025]β32Jan 23, 2025Updated last year
- Course website for Operating System course in Peking University.β14Nov 28, 2021Updated 4 years ago
- DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- β18Mar 18, 2024Updated 2 years ago
- [CHIL 2024] Interpretation of Intracardiac Electrograms Through Textual Representationsβ12Sep 4, 2024Updated last year
- Face Recognition on NVIDIA TX2β10Sep 5, 2018Updated 7 years ago
- Official codebase for the NeurIPS 2023 paper: Towards Last-layer Retraining for Group Robustness with Fewer Annotations. https://arxiv.orβ¦β12May 15, 2024Updated last year
- Code for the paper "Rethinking Benchmark and Contamination for Language Models with Rephrased Samples"β319Dec 20, 2023Updated 2 years ago
- [ECCV'24] Official Implementation of Autoregressive Visual Entity Recognizer.β14Mar 2, 2024Updated 2 years ago
- Model Selection with Large Language Models for Reasoning (EMNLP2023 Findings)β30Dec 23, 2023Updated 2 years ago
- ζΈ εε€§ε¦βθ·ε‘ι¨θ―Ύε βε©ζοΌε ε«θͺε¨ηΎε°γηι’γηΉεθ―ι³ζιηεθ½γβ39Dec 15, 2025Updated 3 months ago
- Repository for the paper: "TiC-LM: A Web-Scale Benchmark for Time-Continual LLM Pretraining" ACL Oral 2025β22Mar 6, 2026Updated last month
- Wordpress hosting with auto-scaling on Cloudways β’ AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- β15Apr 26, 2025Updated 11 months ago
- FlexAttention w/ FlashAttention3 Supportβ27Oct 5, 2024Updated last year
- β19Oct 30, 2025Updated 5 months ago
- Organize the Web: Constructing Domains Enhances Pre-Training Data Curationβ80May 2, 2025Updated 11 months ago
- Code for ICLR 2022 Paper (HyperDQN: A Randomized Exploration Method for Deep Reinforcement Learning)β12Nov 28, 2023Updated 2 years ago
- SWE-Swiss: A Multi-Task Fine-Tuning and RL Recipe for High-Performance Issue Resolutionβ103Sep 24, 2025Updated 6 months ago
- An Open-Source RAG Workload Trace to Optimize RAG Serving Systemsβ36Nov 18, 2025Updated 4 months ago
- β87Feb 10, 2026Updated 2 months ago
- β13Feb 18, 2025Updated last year
- Proton VPN Special Offer - Get 70% off β’ AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- [NAACL'25] RuleR: Improving LLM Controllability by Rule-based Data Recyclingβ14Sep 27, 2025Updated 6 months ago
- code for the table-based open domain question answering project, with paper title: "Reasoning over Hybrid Chain for Table-and-Text Open Dβ¦β12Sep 16, 2022Updated 3 years ago
- codes for paper "AttCAT: Explaining Transformers via Attentive Class Activation Tokens"β12May 13, 2024Updated last year
- β109Jul 15, 2025Updated 8 months ago
- β34Apr 8, 2025Updated last year
- Understanding the correlation between different LLM benchmarksβ29Jan 11, 2024Updated 2 years ago
- A simple gitlab/github web hooks daemonβ16Feb 6, 2026Updated 2 months ago