Shoalstone / helmLinks
An interface for text continuations emphasizing autonomous exploration and complex tree management
☆24Updated 2 months ago
Alternatives and similar repositories for helm
Users that are interested in helm are comparing it to the libraries listed below
Sorting:
- A Chrome extension that allows you to export your Claude.ai conversations in various formats (JSON, Markdown, Plain Text) with support fo…☆31Updated 3 months ago
- A Loom implementation in Obsidian☆322Updated 10 months ago
- A collection of projects designed to help developers quickly get started with building deployable applications using the Anthropic API☆31Updated last year
- a socketteer/loom reimplementation in obsidian☆44Updated last year
- A framework for optimizing DSPy programs with RL☆309Updated 2 weeks ago
- explore token trajectory trees on instruct and base models☆150Updated 8 months ago
- Inference-time scaling for LLMs-as-a-judge.☆327Updated 2 months ago
- A comprehensive repository of reasoning tasks for LLMs (and beyond)☆457Updated last year
- Aidan Bench attempts to measure <big_model_smell> in LLMs.☆315Updated 7 months ago
- smolLM with Entropix sampler on pytorch☆149Updated last year
- The State Of The Art, intelligence☆157Updated 5 months ago
- Super basic implementation (gist-like) of RLMs with REPL environments.☆574Updated 3 weeks ago
- Plotting (entropy, varentropy) for small LMs☆99Updated 8 months ago
- LLMProc: Unix-inspired runtime that treats LLMs as processes.☆34Updated 6 months ago
- Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.☆32Updated 3 months ago
- Parallel Reasoning: llm-consortium orchestrates mulitple LLMs, iteratively refines & achieves consensus.☆373Updated 3 weeks ago
- Claude Deep Research config for Claude Code.☆225Updated 10 months ago
- command loom interface☆111Updated 11 months ago
- they've simulated websites, worlds, and imaginary CLIs... but what if they simulated *you*?☆127Updated 3 months ago
- Letting Claude Code develop his own MCP tools :)☆123Updated 10 months ago
- Forecastbench Datasets, updated nightly☆22Updated this week
- Vivaria is METR's tool for running evaluations and conducting agent elicitation research.☆132Updated 3 weeks ago
- ☆33Updated 7 months ago
- smol models are fun too☆93Updated last year
- Official repository for "NoLiMa: Long-Context Evaluation Beyond Literal Matching"☆179Updated 6 months ago
- A graph visualization of attention☆57Updated 8 months ago
- ⚖️ Awesome LLM Judges ⚖️☆148Updated 9 months ago
- Kura is a simple reproduction of the CLIO paper which uses language models to label user behaviour before clustering them based on embedd…☆415Updated 4 months ago
- Agent workspace template for gptme☆42Updated this week
- cli loom that uses git to manage branches☆32Updated last year