openai / circuit_sparsityLinks
Open-source release accompanying Gao et al. 2025
☆478Updated 3 weeks ago
Alternatives and similar repositories for circuit_sparsity
Users that are interested in circuit_sparsity are comparing it to the libraries listed below
Sorting:
- Simple & Scalable Pretraining for Neural Architecture Research☆305Updated last month
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆355Updated 6 months ago
- Super basic implementation (gist-like) of RLMs with REPL environments.☆293Updated 2 months ago
- rl from zero pretrain, can it be done? yes.☆286Updated 3 months ago
- Open source interpretability artefacts for R1.☆165Updated 8 months ago
- Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…☆366Updated last year
- This repo contains the source code for the paper "Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning"☆284Updated last month
- Chain of Experts (CoE) enables communication between experts within Mixture-of-Experts (MoE) models☆228Updated 2 months ago
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆174Updated 11 months ago
- An extension of the nanoGPT repository for training small MOE models.☆224Updated 9 months ago
- OpenTinker is an RL-as-a-Service infrastructure for foundation models☆499Updated last week
- Research code artifacts for Code World Model (CWM) including inference tools, reproducibility, and documentation.☆783Updated last week
- Official JAX implementation of End-to-End Test-Time Training for Long Context☆102Updated last week
- ☆365Updated last month
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆109Updated 10 months ago
- Physics of Language Models, Part 4☆291Updated this week
- Tina: Tiny Reasoning Models via LoRA☆312Updated 3 months ago
- Normalized Transformer (nGPT)☆195Updated last year
- Repo for "LoLCATs: On Low-Rank Linearizing of Large Language Models"☆249Updated 11 months ago
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆85Updated 9 months ago
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆122Updated 2 months ago
- A framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).☆335Updated 3 weeks ago
- [EMNLP 2025] The official implementation for paper "Agentic-R1: Distilled Dual-Strategy Reasoning"☆101Updated 4 months ago
- Official PyTorch implementation for Hogwild! Inference: Parallel LLM Generation with a Concurrent Attention Cache☆136Updated 4 months ago
- Curated collection of community environments☆196Updated 2 weeks ago
- PyTorch building blocks for the OLMo ecosystem☆656Updated this week
- Async RL Training at Scale☆976Updated this week
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆249Updated last week
- Exploring Applications of GRPO☆251Updated 4 months ago
- RLP: Reinforcement as a Pretraining Objective☆220Updated 3 months ago