safety-research / circuit-tracerLinks
☆2,023Updated 2 weeks ago
Alternatives and similar repositories for circuit-tracer
Users that are interested in circuit-tracer are comparing it to the libraries listed below
Sorting:
- Training Large Language Model to Reason in a Continuous Latent Space☆1,162Updated 5 months ago
- Synthetic data curation for post-training and structured data extraction☆1,414Updated last week
- A Self-adaptation Framework🐙 that adapts LLMs for unseen tasks in real-time!☆1,106Updated 4 months ago
- Atom of Thoughts for Markov LLM Test-Time Scaling☆574Updated last week
- Textbook on reinforcement learning from human feedback☆1,052Updated this week
- Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"☆551Updated 3 months ago
- Releases from OpenAI Preparedness☆783Updated 3 weeks ago
- Sky-T1: Train your own O1 preview model within $450☆3,272Updated last month
- LIMO: Less is More for Reasoning☆963Updated 2 months ago
- Large Concept Models: Language modeling in a sentence representation space☆2,233Updated 4 months ago
- Recipes to scale inference-time compute of open models☆1,097Updated last month
- The AI Scientist-v2: Workshop-Level Automated Scientific Discovery via Agentic Tree Search☆1,354Updated last month
- Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL☆2,656Updated this week
- Pretraining code for a large-scale depth-recurrent language model☆783Updated 2 weeks ago
- Verifiers for LLM Reinforcement Learning☆1,328Updated this week
- Democratizing Reinforcement Learning for LLMs☆3,396Updated last month
- [ICLR 2025] Automated Design of Agentic Systems☆1,345Updated 4 months ago
- Continuous Thought Machines, because thought takes time and reasoning is a process.☆1,026Updated 3 weeks ago
- Code and Data for Tau-Bench☆609Updated 5 months ago
- Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard a…☆1,434Updated 5 months ago
- ☆1,025Updated 6 months ago
- ☆570Updated 2 months ago
- Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the input☆737Updated 2 weeks ago
- Fast State-of-the-Art Static Embeddings☆1,740Updated 2 weeks ago
- TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.☆2,672Updated 2 months ago
- open source interpretability platform 🧠☆269Updated this week
- RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.☆2,016Updated 3 weeks ago
- MLGym A New Framework and Benchmark for Advancing AI Research Agents☆519Updated this week
- Awesome Reasoning LLM Tutorial/Survey/Guide☆1,781Updated last week
- Darwin Gödel Machine: Open-Ended Evolution of Self-Improving Agents☆1,346Updated 2 weeks ago