☆71Aug 6, 2025Updated 9 months ago
Alternatives and similar repositories for gpt-oss-reverse-engineering
Users that are interested in gpt-oss-reverse-engineering are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for "Towards Revealing the Mystery behind Chain of Thought: a Theoretical Perspective"☆22Jul 16, 2023Updated 2 years ago
- Aims for memory-efficient training (24GB VRAM) on consumer GPUs. Optimizing language models through guidance tokens in reasoning chains, …☆28Feb 23, 2025Updated last year
- ☆11Apr 3, 2023Updated 3 years ago
- Simple (fast) transformer inference in PyTorch with torch.compile + lit-llama code☆10Aug 29, 2023Updated 2 years ago
- The unofficial CLI of Amazon S3 Vectors (Preview) in Rust☆17Jul 19, 2025Updated 10 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [NeurIPS 2022] Your Transformer May Not be as Powerful as You Expect (official implementation)☆34Aug 6, 2023Updated 2 years ago
- [ICLR26] AI-based scaling law discovery☆28Jan 30, 2026Updated 3 months ago
- Vortex: A Flexible and Efficient Sparse Attention Framework☆53Updated this week
- ☆37Aug 7, 2025Updated 9 months ago
- Code of ICML paper arxiv.org/abs/2302.08105☆14May 4, 2023Updated 3 years ago
- ☆35Nov 26, 2025Updated 5 months ago
- Code for the paper "The Journey, Not the Destination: How Data Guides Diffusion Models"☆25Dec 12, 2023Updated 2 years ago
- The repository contains code for Adaptive Data Optimization☆36Dec 9, 2024Updated last year
- Server Usage Documentation of AIR☆22Feb 22, 2023Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [NeurIPS-2024] The offical Implementation of "Instruction-Guided Visual Masking"☆42Nov 15, 2024Updated last year
- Is In-Context Learning Sufficient for Instruction Following in LLMs? [ICLR 2025]☆32Jan 23, 2025Updated last year
- >>> 异常中断 + 虚存页表 + 分支预测 + TLB + Cache + Flash + VGA + uCore☆20Nov 17, 2023Updated 2 years ago
- ☆18Mar 18, 2024Updated 2 years ago
- Formal Contracts for Multi-Agent Reinforcement Learning☆20Oct 24, 2023Updated 2 years ago
- Tools to convert sigsep mus dataset from STEMS <-> WAV☆12Jul 15, 2020Updated 5 years ago
- ☆10Jul 13, 2024Updated last year
- Official codebase for the NeurIPS 2023 paper: Towards Last-layer Retraining for Group Robustness with Fewer Annotations. https://arxiv.or…☆12May 15, 2024Updated 2 years ago
- Code for the paper "Rethinking Benchmark and Contamination for Language Models with Rephrased Samples"☆321Dec 20, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [ICML 2021] This is the official github repo for training L_inf dist nets with high certified accuracy.☆42Mar 16, 2022Updated 4 years ago
- [ECCV'24] Official Implementation of Autoregressive Visual Entity Recognizer.☆14Mar 2, 2024Updated 2 years ago
- Model Selection with Large Language Models for Reasoning (EMNLP2023 Findings)☆30Dec 23, 2023Updated 2 years ago
- Accelerate LLM preference tuning via prefix sharing with a single line of code☆52Jul 4, 2025Updated 10 months ago
- Official implementation of Panacea: A foundation model for clinical trial design, recruitment, search, and summarization.☆19Dec 24, 2024Updated last year
- 清华大学“荷塘雨课堂”助手,包含自动签到、答题、点名语音提醒等功能。☆41Dec 15, 2025Updated 5 months ago
- The Automotive Urban Traffic Ontology☆15Aug 20, 2024Updated last year
- ☆13Sep 2, 2021Updated 4 years ago
- ☆15Apr 26, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- FlexAttention w/ FlashAttention3 Support☆27Oct 5, 2024Updated last year
- ☆48Jan 17, 2023Updated 3 years ago
- ☆19Oct 30, 2025Updated 6 months ago
- Organize the Web: Constructing Domains Enhances Pre-Training Data Curation☆81May 2, 2025Updated last year
- React file explorer / file browser component☆57Mar 25, 2026Updated last month
- SWE-Swiss: A Multi-Task Fine-Tuning and RL Recipe for High-Performance Issue Resolution☆105Sep 24, 2025Updated 7 months ago
- An Open-Source RAG Workload Trace to Optimize RAG Serving Systems☆36Nov 18, 2025Updated 6 months ago