andyk / headlongLinks
A framework for collecting a large human-sourced chain-of-thoughts dataset
☆22Updated 10 months ago
Alternatives and similar repositories for headlong
Users that are interested in headlong are comparing it to the libraries listed below
Sorting:
- A better way of testing, inspecting, and analyzing AI Agent traces.☆37Updated this week
- Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.☆32Updated 3 months ago
- Pivotal Token Search☆89Updated 2 weeks ago
- Access the Cohere Command R family of models☆37Updated 2 months ago
- Let Claude control a web browser on your machine.☆29Updated 3 months ago
- Minimal example of MCP for parsing llms.txt☆38Updated last month
- LLMs sitting on a council together to decide, by consensus, who among them is the best.☆15Updated 3 weeks ago
- A structured framework for defining, verifying and certifying AI systems.☆13Updated 2 months ago
- never forget anything again! combine AI and intelligent tooling for a local knowledge base to track catalogue, annotate, and plan for you…☆36Updated last year
- Training hybrid models for dummies.☆21Updated 4 months ago
- A text-to-SQL prototype on the northwind sqlite dataset☆12Updated 8 months ago
- Leveraging DSPy for AI-driven task understanding and solution generation, the Self-Discover Framework automates problem-solving through r…☆60Updated 10 months ago
- LLM plugin for models hosted by Anyscale Endpoints☆33Updated last year
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆31Updated last month
- Benchmark that evaluates LLMs using 651 NYT Connections puzzles extended with extra trick words☆93Updated this week
- This unique variation on Thinking Claude maps Claude's thought process steps to unicode and forces Claude to think in unicode, potentiall…☆12Updated 3 months ago
- The original BabyAGI, updated with LiteLLM and no vector database reliance (csv instead)☆21Updated 8 months ago
- Minimal, clean code implementation of RAG with mlx using gguf model weights☆49Updated last year
- Run LLMs on Replicate with vLLM☆17Updated 7 months ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆90Updated 4 months ago
- ☆24Updated last year
- AI conflict resolution framework designed to work alongside existing AI orchestration tools☆24Updated 5 months ago
- ☆16Updated 7 months ago
- ☆20Updated 2 months ago
- Public reports detailing responses to sets of prompts by Large Language Models.☆30Updated 4 months ago
- Automated Capability Discovery via Foundation Model Self-Exploration☆49Updated 3 months ago
- ☆28Updated 9 months ago
- GraphRag vs Embeddings☆13Updated 10 months ago
- ☆36Updated 3 months ago
- a Python library that uses Reinforcement Learning (RL) to train LLMs.☆29Updated last week