safety-research / circuit-tracerLinks
☆991Updated this week
Alternatives and similar repositories for circuit-tracer
Users that are interested in circuit-tracer are comparing it to the libraries listed below
Sorting:
- Textbook on reinforcement learning from human feedback☆938Updated this week
- Verifiers for LLM Reinforcement Learning☆1,057Updated this week
- procedural reasoning datasets☆625Updated this week
- Training Large Language Model to Reason in a Continuous Latent Space☆1,135Updated 4 months ago
- Darwin Gödel Machine: Open-Ended Evolution of Self-Improving Agents☆349Updated this week
- Releases from OpenAI Preparedness☆761Updated this week
- OO for LLMs☆779Updated this week
- An agent benchmark with tasks in a simulated software company.☆370Updated 2 weeks ago
- Tool for generating high quality Synthetic datasets☆896Updated this week
- Testing baseline LLMs performance across various models☆268Updated this week
- Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"☆464Updated last week
- Dream 7B, a large diffusion language model☆703Updated 3 weeks ago
- Recipes to scale inference-time compute of open models☆1,087Updated last week
- open source interpretability platform 🧠☆131Updated this week
- End-to-end Generative Optimization for AI Agents☆575Updated last week
- prime is a framework for efficient, globally distributed training of AI models over the internet.☆757Updated last week
- Agent Reinforcement Trainer for training multi-turn agents using GRPO☆628Updated this week
- MLGym A New Framework and Benchmark for Advancing AI Research Agents☆499Updated 3 weeks ago
- Synthetic data curation for post-training and structured data extraction☆1,364Updated this week
- Pretraining code for a large-scale depth-recurrent language model☆770Updated this week
- CodeScientist: An automated scientific discovery system for code-based experiments☆263Updated 2 months ago
- Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"☆530Updated 2 months ago
- Understanding R1-Zero-Like Training: A Critical Perspective☆956Updated last week
- Code for BLT research paper☆1,664Updated last week
- MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering☆728Updated 2 weeks ago
- An open-source tool for seamless migration from other LLMs to Llama, and for general prompt optimization.☆360Updated this week
- Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.☆309Updated 7 months ago
- System 2 Reasoning Link Collection☆835Updated 2 months ago
- 🤗 Benchmark Large Language Models Reliably On Your Data☆315Updated this week
- Fully open data curation for reasoning models☆1,796Updated last week