logikon-ai/cot-eval

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/logikon-ai/cot-eval)

logikon-ai / cot-eval

A framework for evaluating the effectiveness of chain-of-thought reasoning in language models.

☆19

Alternatives and similar repositories for cot-eval

Users that are interested in cot-eval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

huhailinguist / ChineseNLIProbing
View on GitHub
☆10Oct 17, 2021Updated 4 years ago
gabeguo / any-order-speculative-decoding
View on GitHub
Reviving Any-Order Autoregressive Models via Principled Parallel Sampling and Speculative Decoding
☆16Nov 16, 2025Updated 8 months ago
Unified-Language-Model-Alignment / src
View on GitHub
☆14Oct 7, 2023Updated 2 years ago
yakazimir / semantic_fragments
View on GitHub
Code and data for experiments on semantic fragments
☆11Jun 23, 2022Updated 4 years ago
verypluming / HELP
View on GitHub
HELP: a dataset for Handling Entailments with Lexical and logical Phenomena (Ver.1.0)
☆15Jul 20, 2023Updated 3 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
perkinslr / schemepy
View on GitHub
Implementation of scheme in python supporting call/cc and hygenic macros
☆16Sep 12, 2015Updated 10 years ago
roeeaharoni / dynmt-py
View on GitHub
Neural machine translation implementation using dynet's python bindings
☆17Jan 24, 2018Updated 8 years ago
debatelab / aacorpus
View on GitHub
Code for the paper "Critical Thinking for Language Models"
☆13Jun 1, 2021Updated 5 years ago
enjalot / latent-data-modal
View on GitHub
Using modal.com to process FineWeb-edu data
☆20Apr 11, 2026Updated 3 months ago
garrettallen14 / CoT-Reasoning-Without-Prompting
View on GitHub
Exploring CoT-Decoding from Google DeepMind's paper, "Chain-of-Thought Reasoning Without Prompting".
☆13Feb 22, 2024Updated 2 years ago
liangyuRain / ForestColl
View on GitHub
☆20Jun 1, 2026Updated last month
ronentk / dbca-splitter
View on GitHub
Independent implementation of DBCA method from http://arxiv.org/abs/1912.09713
☆11Nov 25, 2020Updated 5 years ago
NUS-HPC-AI-Lab / DyVM
View on GitHub
☆18Apr 8, 2025Updated last year
allenai / CommaQA
View on GitHub
Code and Dataset for Learning to Solve Complex Tasks by Talking to Agents
☆24May 24, 2022Updated 4 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
lucadiliello / semantic-loss-pytorch
View on GitHub
PyPSDD porting to Python 3 + PyTorch equivalent tree construction.
☆16Jun 7, 2023Updated 3 years ago
0x9ef / openai-go
View on GitHub
OpenAI GPT-3/3.5/4 API client written in Go
☆20Apr 13, 2023Updated 3 years ago
art-ai / pypsdd
View on GitHub
The Python PSDD Package
☆19Jul 20, 2025Updated last year
siaen / python_finance_course
View on GitHub
CEU python for finance course material
☆22Feb 25, 2020Updated 6 years ago
yakazimir / esslli_neural_symbolic
View on GitHub
Course resources and notes for the ESSLLI 2023 course on neural symbolic methods.
☆18Feb 5, 2025Updated last year
allenai / recoma
View on GitHub
Reasoning by Communicating with Agents
☆30Apr 29, 2025Updated last year
nttrd-mdlab / wearable-seld-dataset
View on GitHub
☆10Feb 18, 2022Updated 4 years ago
allenai / tracie
View on GitHub
☆14May 7, 2021Updated 5 years ago
NeuroFusionAI / fibo-mcp
View on GitHub
Open-source MCP for financial ontology
☆23Jul 12, 2026Updated last week
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
allenai / hybrid-preferences
View on GitHub
Learning to route instances for Human vs AI Feedback (ACL Main '25)
☆29Jul 23, 2025Updated last year
ericmjl / website
View on GitHub
Eric Ma's Personal Website
☆18Updated this week
orra-dev / agent-fragile-to-prod-guide
View on GitHub
Guide: from fragile multi-agent app to prod ready with orra - code and resources.
☆14Mar 24, 2025Updated last year
I-AdityaGoyal / ML_Algorithms-In_Depth
View on GitHub
☆10Aug 5, 2023Updated 2 years ago
openHacking / bypass-captcha
View on GitHub
Use Nodejs + Playwright + 2Captcha, bypass captcha and automatically log in to bilibili.com
☆19Apr 6, 2022Updated 4 years ago
EduardoGarrido90 / ML_books
View on GitHub
This repository will contain links to the most famous available books of ML that are online
☆13Oct 15, 2024Updated last year
RUCAIBox / ChainLM
View on GitHub
☆31Mar 23, 2024Updated 2 years ago
albertotamajo / imagenet1k-coarse-classes
View on GitHub
This repository organizes the Imagnet1k dataset into 10 coarse classes, where each class consists of semantically similar image categorie…
☆22Dec 11, 2023Updated 2 years ago
thomeou / SALSA-Lite
View on GitHub
This is the public repository for SALSA-Lite features for polyphonic sound event localization and detection using microphone arrays.
☆15Dec 3, 2021Updated 4 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
Aunsiels / CSK
View on GitHub
Code for generating Quasimodo, a commonsense knowledge base.
☆20Sep 14, 2021Updated 4 years ago
BWR-hhh / TFlow
View on GitHub
☆19May 14, 2026Updated 2 months ago
YunusEmreAlps / Icarus
View on GitHub
Local Action, Global Impact (Selected as Top 50 in the 2022 Solution Challenge.)
☆17Jan 18, 2024Updated 2 years ago
andreasvc / seekaywhy
View on GitHub
A probabilistic CKY parser for PCFGs
☆19Mar 12, 2014Updated 12 years ago
gio54321 / hoare-logic-prover
View on GitHub
Proof-of-concept formal verification using Hoare logic
☆21Feb 29, 2020Updated 6 years ago
infi-coder / infibench-evaluator
View on GitHub
The evaluation framework for the InfiCoder-Eval benchmark.
☆21Jul 22, 2024Updated 2 years ago
allenai / beaker-gantry
View on GitHub
Gantry provides an API that streamlines running experiments in Beaker
☆31Updated this week