logikon-ai / logikon
Analyzing and scoring reasoning traces of LLMs
☆41Updated 4 months ago
Alternatives and similar repositories for logikon:
Users that are interested in logikon are comparing it to the libraries listed below
- The data and implementation for the experiments in the paper "Flows: Building Blocks of Reasoning and Collaborating AI".☆31Updated 11 months ago
- DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.☆31Updated 11 months ago
- Reasoning by Communicating with Agents☆23Updated 3 months ago
- Sphynx Hallucination Induction☆51Updated 5 months ago
- never forget anything again! combine AI and intelligent tooling for a local knowledge base to track catalogue, annotate, and plan for you…☆35Updated 8 months ago
- Finding semantically meaningful and accurate prompts.☆46Updated last year
- Factored Cognition Primer: How to write compositional language model programs☆48Updated last year
- 🔔🧠 Easily experiment with popular language agents across diverse reasoning/decision-making benchmarks!☆51Updated this week
- AI Evaluation Platform☆45Updated this week
- Plug in and Play implementation of "Certified Reasoning with Language Models" that elevates model reasoning by 40%☆15Updated last year
- LLM finetuning☆43Updated last year
- An attribution library for LLMs☆35Updated 4 months ago
- A set of utilities for running few-shot prompting experiments on large-language models☆116Updated last year
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆47Updated last month
- Explore the use of DSPy for extracting features from PDFs 🔎☆37Updated 10 months ago
- Code interpreter support for o1☆32Updated 4 months ago
- Small, simple agent task environments for training and evaluation☆18Updated 2 months ago
- Track the progress of LLM context utilisation☆53Updated 6 months ago
- Code for our paper PAPILLON: PrivAcy Preservation from Internet-based and Local Language MOdel ENsembles☆20Updated 2 weeks ago
- Based on the tree of thoughts paper☆46Updated last year
- Agent computer interface for AI software engineer.☆22Updated this week
- Accompanying code and SEP dataset for the "Can LLMs Separate Instructions From Data? And What Do We Even Mean By That?" paper.☆46Updated 7 months ago
- LMQL implementation of tree of thoughts☆33Updated 11 months ago
- ☆52Updated this week
- Query language for blending SQL logic and LLM reasoning across structured + unstructured data. [Findings of ACL 2024]☆82Updated 2 months ago
- ☆38Updated last year
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).☆79Updated 10 months ago
- A library for benchmarking the Long Term Memory and Continual learning capabilities of LLM based agents. With all the tests and code you…☆62Updated last month
- Search through Facebook Research's PyTorch BigGraph Wikidata-dataset with the Weaviate vector search engine☆31Updated 3 years ago