β72Aug 6, 2025Updated 10 months ago
Alternatives and similar repositories for gpt-oss-reverse-engineering
Users that are interested in gpt-oss-reverse-engineering are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- π[ICLR 2025] TFG-Flow: Training-free Guidance in Multimodal Generative Flowβ20Mar 4, 2025Updated last year
- Code for "Towards Revealing the Mystery behind Chain of Thought: a Theoretical Perspective"β21Jul 16, 2023Updated 2 years ago
- Aims for memory-efficient training (24GB VRAM) on consumer GPUs. Optimizing language models through guidance tokens in reasoning chains, β¦β28Feb 23, 2025Updated last year
- Implementation of the paper Knowledge-Enhanced Dual-stream Zero-shot Composed Image Retrieval (CVPR 2024)β20Nov 4, 2024Updated last year
- β11Apr 3, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Simple (fast) transformer inference in PyTorch with torch.compile + lit-llama codeβ10Aug 29, 2023Updated 2 years ago
- [NeurIPS 2022] Your Transformer May Not be as Powerful as You Expect (official implementation)β35Aug 6, 2023Updated 2 years ago
- Adversarially Robust Generalization Just Requires More Unlabeled Dataβ11Aug 8, 2019Updated 6 years ago
- Vortex: Programmable Sparse Attention for Agents as Algorithm Designersβ63Jun 24, 2026Updated last week
- β37Aug 7, 2025Updated 10 months ago
- Code of ICML paper arxiv.org/abs/2302.08105β14May 4, 2023Updated 3 years ago
- β37Nov 26, 2025Updated 7 months ago
- Code for the paper "The Journey, Not the Destination: How Data Guides Diffusion Models"β25Dec 12, 2023Updated 2 years ago
- The repository contains code for Adaptive Data Optimizationβ36Dec 9, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits β’ AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Server Usage Documentation of AIRβ22Feb 22, 2023Updated 3 years ago
- β32Sep 6, 2023Updated 2 years ago
- Is In-Context Learning Sufficient for Instruction Following in LLMs? [ICLR 2025]β33Jan 23, 2025Updated last year
- >>> εΌεΈΈδΈζ + θε鑡葨 + εζ―ι’ζ΅ + TLB + Cache + Flash + VGA + uCoreβ20Nov 17, 2023Updated 2 years ago
- β18Mar 18, 2024Updated 2 years ago
- β13Feb 12, 2023Updated 3 years ago
- Formal Contracts for Multi-Agent Reinforcement Learningβ20Oct 24, 2023Updated 2 years ago
- Homepage for ProLong (Princeton long-context language models) and paper "How to Train Long-Context Language Models (Effectively)"β255Sep 12, 2025Updated 9 months ago
- β10Jul 13, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [CHIL 2024] Interpretation of Intracardiac Electrograms Through Textual Representationsβ12Sep 4, 2024Updated last year
- Official codebase for the NeurIPS 2023 paper: Towards Last-layer Retraining for Group Robustness with Fewer Annotations. https://arxiv.orβ¦β12May 15, 2024Updated 2 years ago
- Code for the paper "Rethinking Benchmark and Contamination for Language Models with Rephrased Samples"β324Dec 20, 2023Updated 2 years ago
- β24Sep 1, 2025Updated 10 months ago
- This repo consists of my implementation of DocFormerV2β12Mar 31, 2024Updated 2 years ago
- Neural network approximators of linear algebra operations on GPU with PyTorchβ17May 30, 2022Updated 4 years ago
- [ECCV'24] Official Implementation of Autoregressive Visual Entity Recognizer.β14Mar 2, 2024Updated 2 years ago
- Accelerate LLM preference tuning via prefix sharing with a single line of codeβ52Jul 4, 2025Updated 11 months ago
- Official implementation of Panacea: A foundation model for clinical trial design, recruitment, search, and summarization.β21Dec 24, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ζΈ εε€§ε¦βθ·ε‘ι¨θ―Ύε βε©ζοΌε ε«θͺε¨ηΎε°γηι’γηΉεθ―ι³ζιηεθ½γβ41Dec 15, 2025Updated 6 months ago
- Repository for the paper: "TiC-LM: A Web-Scale Benchmark for Time-Continual LLM Pretraining" ACL Oral 2025β25Apr 19, 2026Updated 2 months ago
- Multi-Level Triton Runner supporting Python, IR, PTX, AMDGCN, cubin and hasco.β98May 8, 2026Updated last month
- β12Sep 2, 2021Updated 4 years ago
- FlexAttention w/ FlashAttention3 Supportβ27Oct 5, 2024Updated last year
- Code for ICLR 2022 Paper (HyperDQN: A Randomized Exploration Method for Deep Reinforcement Learning)β12Nov 28, 2023Updated 2 years ago
- Organize the Web: Constructing Domains Enhances Pre-Training Data Curationβ81May 2, 2025Updated last year