monostate / weave-logprobs-reasoning-loopLinks
A notebook that compares a reasoning model x a non reasoning model that runs a loop using logprobs found uncertainty
☆25Updated 3 months ago
Alternatives and similar repositories for weave-logprobs-reasoning-loop
Users that are interested in weave-logprobs-reasoning-loop are comparing it to the libraries listed below
Sorting:
- Pivotal Token Search☆132Updated last week
- PILF: A IPWT-inspired bionic continual learning experiment focus on mitigate catastrophic forgetting with Surprise-gated Mixture of Exper…☆36Updated 4 months ago
- Fast Diversification for Search & Retrieval☆432Updated 3 weeks ago
- Chat strategies for LLMs☆122Updated this week
- ☆88Updated last month
- Build data processing and data analysis pipelines that leverage the power of LLMs 🧠☆242Updated last week
- Pixelagent — Multimodal stateful agents☆223Updated 6 months ago
- Official code for NeurIPS 2025 paper "AutoDiscovery: Open-ended Scientific Discovery via Bayesian Surprise"☆107Updated last week
- syftr is an agent optimizer that helps you find the best agentic workflows for your budget.☆322Updated last month
- Deploy any AI model, agent, database, RAG, and pipeline locally or remotely in minutes☆689Updated this week
- Heirarchical Navigable Small Worlds☆101Updated 4 months ago
- Demo of knowledge graph creation and Graph RAG with BAML and Kuzu☆73Updated 2 months ago
- An introduction to DSPy☆32Updated 3 months ago
- A library for building software agents using behavior trees and language models.☆90Updated 10 months ago
- An AI agent library using Python as the common language to define executable actions and tool interfaces.☆113Updated last month
- Securely run AI-generated code in stateful sandboxes that run forever.☆224Updated 7 months ago
- SWE-Bench Pro: Can AI Agents Solve Long-Horizon Software Engineering Tasks?☆223Updated 2 weeks ago
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆34Updated 7 months ago
- Open source local sandboxing for running AI generated code.☆244Updated last week
- Your AI research assistant☆79Updated 8 months ago
- ☆117Updated 4 months ago
- Parallel thinking for LLMs. Confidence‑gated, strategy‑driven, offline‑friendly☆259Updated 2 months ago
- Physical AI Assistant that illuminates your life☆189Updated 2 months ago
- Visual inference exploration & experimentation playground☆96Updated last year
- ~ streaming agents☆74Updated this week
- ☆35Updated 4 months ago
- This repo is for the demonstration of TSCE principles.☆32Updated 3 weeks ago
- Extremely memory-efficient vector database☆76Updated last year
- j1-micro (1.7B) & j1-nano (600M) are absurdly tiny but mighty reward models.☆99Updated 4 months ago
- Sculpt: Structuring unstructured data with LLMs☆38Updated 2 months ago