safety-research / circuit-tracerLinks
☆2,212Updated this week
Alternatives and similar repositories for circuit-tracer
Users that are interested in circuit-tracer are comparing it to the libraries listed below
Sorting:
- Verifiers for LLM Reinforcement Learning☆1,690Updated this week
- Darwin Gödel Machine: Open-Ended Evolution of Self-Improving Agents☆1,550Updated last month
- Synthetic data curation for post-training and structured data extraction☆1,468Updated last week
- A Self-adaptation Framework🐙 that adapts LLMs for unseen tasks in real-time!☆1,132Updated 6 months ago
- Training Large Language Model to Reason in a Continuous Latent Space☆1,224Updated 6 months ago
- Textbook on reinforcement learning from human feedback☆1,147Updated 2 weeks ago
- procedural reasoning datasets☆1,012Updated this week
- Releases from OpenAI Preparedness☆815Updated this week
- [ICLR 2025] Automated Design of Agentic Systems☆1,395Updated 6 months ago
- Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the input☆836Updated last month
- MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering☆823Updated last month
- Recipes to scale inference-time compute of open models☆1,110Updated 2 months ago
- Large Concept Models: Language modeling in a sentence representation space☆2,254Updated 6 months ago
- 🔥 Comprehensive survey on Context Engineering: from prompt engineering to production-grade AI systems. hundreds of papers, frameworks, …☆1,546Updated this week
- [COLM 2025] LIMO: Less is More for Reasoning☆993Updated last week
- AIDE: AI-Driven Exploration in the Space of Code. The machine Learning engineering agent that automates AI R&D.☆972Updated last week
- A reading list on LLM based Synthetic Data Generation 🔥☆1,379Updated 2 months ago
- Pretraining and inference code for a large-scale depth-recurrent language model☆808Updated 2 weeks ago
- Democratizing Reinforcement Learning for LLMs☆3,962Updated this week
- open source interpretability platform 🧠☆311Updated this week
- Atom of Thoughts for Markov LLM Test-Time Scaling☆580Updated last month
- Self-Adapting Language Models☆743Updated this week
- Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"☆573Updated 4 months ago
- Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard a…☆1,498Updated 6 months ago
- AlphaGo Moment for Model Architecture Discovery.☆794Updated last week
- Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL☆2,949Updated 3 weeks ago
- MLGym A New Framework and Benchmark for Advancing AI Research Agents☆538Updated 2 weeks ago
- TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.☆2,796Updated last week
- Tool for generating high quality Synthetic datasets☆1,100Updated this week
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends☆1,793Updated this week