openai / preparednessLinks
Releases from OpenAI Preparedness
☆783Updated 3 weeks ago
Alternatives and similar repositories for preparedness
Users that are interested in preparedness are comparing it to the libraries listed below
Sorting:
- Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"☆551Updated 3 months ago
- Atom of Thoughts for Markov LLM Test-Time Scaling☆574Updated last week
- Training Large Language Model to Reason in a Continuous Latent Space☆1,162Updated 5 months ago
- Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym [ICML 2025]☆489Updated last month
- Pretraining code for a large-scale depth-recurrent language model☆783Updated 2 weeks ago
- Verifiers for LLM Reinforcement Learning☆1,328Updated this week
- RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.☆2,016Updated 3 weeks ago
- TTRL: Test-Time Reinforcement Learning☆650Updated 2 weeks ago
- This repo contains the dataset and code for the paper "SWE-Lancer: Can Frontier LLMs Earn $1 Million from Real-World Freelance Software E…☆1,408Updated last month
- MLGym A New Framework and Benchmark for Advancing AI Research Agents☆519Updated this week
- Repository for Zochi's Research☆221Updated 3 weeks ago
- ☆2,023Updated 2 weeks ago
- Self-Adapting Language Models☆430Updated last week
- MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering☆760Updated last week
- A Self-adaptation Framework🐙 that adapts LLMs for unseen tasks in real-time!☆1,106Updated 4 months ago
- procedural reasoning datasets☆872Updated this week
- CodeScientist: An automated scientific discovery system for code-based experiments☆271Updated 2 months ago
- ☆570Updated 2 months ago
- LIMO: Less is More for Reasoning☆963Updated 2 months ago
- Dream 7B, a large diffusion language model☆774Updated last week
- Muon is Scalable for LLM Training☆1,081Updated 2 months ago
- Understanding R1-Zero-Like Training: A Critical Perspective☆991Updated last month
- Official Repo for Open-Reasoner-Zero☆1,969Updated 3 weeks ago
- official repository for “Reinforcement Learning for Reasoning in Large Language Models with One Training Example”☆290Updated last week
- An agent benchmark with tasks in a simulated software company.☆397Updated last week
- Automatic evals for LLMs☆437Updated 2 weeks ago
- The AI Scientist-v2: Workshop-Level Automated Scientific Discovery via Agentic Tree Search☆1,354Updated last month
- Seed-Coder is a family of lightweight open-source code LLMs comprising base, instruct and reasoning models, developed by ByteDance Seed.☆506Updated 2 weeks ago
- Synthetic data curation for post-training and structured data extraction☆1,414Updated last week
- An Open Large Reasoning Model for Real-World Solutions☆1,498Updated 3 weeks ago