safety-research / circuit-tracerLinks
β2,335Updated 2 weeks ago
Alternatives and similar repositories for circuit-tracer
Users that are interested in circuit-tracer are comparing it to the libraries listed below
Sorting:
- Textbook on reinforcement learning from human feedbackβ1,221Updated this week
- open source interpretability platform π§β398Updated this week
- Verifiers for LLM Reinforcement Learningβ3,057Updated this week
- Large Concept Models: Language modeling in a sentence representation spaceβ2,280Updated 7 months ago
- MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineeringβ957Updated this week
- Releases from OpenAI Preparednessβ860Updated 3 weeks ago
- β1,233Updated this week
- Training Large Language Model to Reason in a Continuous Latent Spaceβ1,265Updated last month
- Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the inputβ859Updated 3 months ago
- Synthetic data curation for post-training and structured data extractionβ1,500Updated last month
- Recipes to scale inference-time compute of open modelsβ1,110Updated 3 months ago
- Darwin GΓΆdel Machine: Open-Ended Evolution of Self-Improving Agentsβ1,656Updated last month
- A Self-adaptation Frameworkπ that adapts LLMs for unseen tasks in real-time!β1,145Updated 7 months ago
- procedural reasoning datasetsβ1,102Updated this week
- Renderer for the harmony response format to be used with gpt-ossβ3,774Updated last month
- This repo contains the dataset and code for the paper "SWE-Lancer: Can Frontier LLMs Earn $1 Million from Real-World Freelance Software Eβ¦β1,436Updated 2 months ago
- Self-Adapting Language Modelsβ785Updated last month
- Democratizing Reinforcement Learning for LLMsβ4,177Updated this week
- Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"β596Updated 6 months ago
- Code for BLT research paperβ1,983Updated 3 months ago
- Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard aβ¦β1,568Updated 8 months ago
- [COLM 2025] LIMO: Less is More for Reasoningβ1,018Updated last month
- Atom of Thoughts for Markov LLM Test-Time Scalingβ586Updated 3 months ago
- Code and Data for Tau-Benchβ834Updated 3 weeks ago
- A benchmark for LLMs on complicated tasks in the terminalβ691Updated this week
- TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients. Published in Nature.β2,919Updated last month
- Pretraining and inference code for a large-scale depth-recurrent language modelβ827Updated last week
- Tool for generating high quality Synthetic datasetsβ1,183Updated last month
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backendsβ1,890Updated last week
- Humanity's Last Examβ1,098Updated last month