Peiyang-Song/Awesome-LLM-Reasoning-Failures

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Peiyang-Song/Awesome-LLM-Reasoning-Failures)

Peiyang-Song / Awesome-LLM-Reasoning-Failures

Repo for "Large Language Model Reasoning Failures"

☆204

Alternatives and similar repositories for Awesome-LLM-Reasoning-Failures

Users that are interested in Awesome-LLM-Reasoning-Failures are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

nerdslab / EIT
View on GitHub
PyTorch implementation of "Seeing the forest and the tree: Building representations of both individual and collective dynamics with trans…
☆14Jan 4, 2023Updated 3 years ago
lyan62 / vlm-info-loss
View on GitHub
☆22Sep 16, 2025Updated 9 months ago
boxabirds / claudit
View on GitHub
Uses conversation history to audit important decisions and changes.
☆18Jul 13, 2025Updated 11 months ago
fraimondo / cudaica
View on GitHub
☆14Mar 10, 2020Updated 6 years ago
mne-tools / mne-denoise
View on GitHub
mne-denoise provides narrow-band artefact removal tailored to MNE-Python workflows. It wraps harmonic regression techniques to suppress p…
☆29Updated this week
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
dan1t0 / binaryninja-ollama-plus
View on GitHub
Enhanced version of binaryninja-ollama and without using the ollama Python library
☆13Jan 23, 2025Updated last year
eth-lre / LLM_ICL
View on GitHub
ACL24
☆11Jun 7, 2024Updated 2 years ago
tjnull / cygor
View on GitHub
An modular asset discovery framework written in python to automate the repeating manual work
☆78Jun 21, 2026Updated last week
nishadsinghi / sc-genrm-scaling
View on GitHub
[COLM 2025] Official code for "When To Solve, When To Verify: Compute-Optimal Problem Solving and Generative Verification for LLM Reasoni…
☆15Oct 31, 2025Updated 7 months ago
kdu4108 / semiring-backprop-exps
View on GitHub
☆16Jul 10, 2023Updated 2 years ago
UCSB-NLP-Chang / Prereq_tune
View on GitHub
Implementation for the paper "Fictitious Synthetic Data Can Improve LLM Factuality via Prerequisite Learning"
☆11Jan 10, 2025Updated last year
patrickmineault / ml-theory-reading-list
View on GitHub
An annotated reference list of ML theory
☆36May 25, 2023Updated 3 years ago
splintersfury / AutoPiff
View on GitHub
Semantic analysis engine for detecting vulnerability fixes in Windows kernel driver patches — 58 YAML rules, Ghidra decompilation, reacha…
☆63Feb 26, 2026Updated 4 months ago
wangcunxiang / Graph-aS-Tokens
View on GitHub
☆10Nov 29, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
kmkrofficial / LiteGPT
View on GitHub
LiteGPT: A 124M Small Language Model (SLM) pre-trained on FineWeb and fine-tuned on Alpaca.
☆35Dec 16, 2025Updated 6 months ago
penghao-wu / visual_jigsaw
View on GitHub
☆78Apr 9, 2026Updated 2 months ago
JingWu321 / EraseDiff
View on GitHub
EraseDiff: Erasing Data Influence in Diffusion Models
☆14Nov 20, 2024Updated last year
yale-nlp / refdpo
View on GitHub
☆16Jul 23, 2024Updated last year
mmunir127 / LogViG-Official
View on GitHub
☆17Oct 17, 2025Updated 8 months ago
AtvikSecurity / CentralizedPotatoes
View on GitHub
A centralized list of the various Potato Windows exploits.
☆24Jun 23, 2026Updated last week
kkelchte / task_free_continual_learning
View on GitHub
This repository demonstrates the application of our proposed task-free continual learning method on a synthetic experiment.
☆13Jun 24, 2019Updated 7 years ago
d223302 / Over-Reasoning-of-LLMs
View on GitHub
Data and code for EACL'24 paper: Over-Reasoning and Redundant Calculation of Large Language Models
☆11Jan 23, 2024Updated 2 years ago
mengcaopku / Continual-LLaVA
View on GitHub
☆16Nov 12, 2024Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
openagentidentityprotocol / agentidentityprotocol
View on GitHub
Agent Identity Protocol - Zero-trust security layer for AI agents. Policy enforcement proxy for MCP with Human-in-the-Loop approval, DLP …
☆34Mar 5, 2026Updated 3 months ago
solyarisoftware / prompter.vim
View on GitHub
vim as a perfect large language models prompts playground
☆20Nov 29, 2023Updated 2 years ago
zjunlp / knowledge-rumination
View on GitHub
[EMNLP 2023] Knowledge Rumination for Pre-trained Language Models
☆18Jun 29, 2023Updated 3 years ago
levi-katarok / simplified-rag
View on GitHub
Simplifying RAG with PostgreSQL and PGVector
☆16Jul 31, 2024Updated last year
ALT-JS / OthelloSAE
View on GitHub
CS194-196 Course Project
☆14Feb 20, 2025Updated last year
zhao-zilong / ssc-cot
View on GitHub
Git for "Stepwise Self-Consistent Mathematical Reasoning with Large Language Models"
☆12Nov 26, 2024Updated last year
Babelscape / LLM-Oasis
View on GitHub
This repository contains the resource introduced in the paper: "Truth or Mirage? Towards End-to-End Factuality Evaluation with LLM-Oasis"…
☆25Oct 15, 2025Updated 8 months ago
m4lvin / tablean
View on GitHub
Tableau for basic modal logic in Lean 3 - This is OLD and not maintained. See https://github.com/m4lvin/lean4-pdl instead.
☆13Oct 24, 2023Updated 2 years ago
arvillion / ActiveVLN
View on GitHub
Official implementation of the paper: "ActiveVLN: Towards Active Exploration via Multi-Turn RL in Vision-and-Language Navigation"
☆69Feb 11, 2026Updated 4 months ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
go-nlp / tfidf
View on GitHub
tfidf provides TF-IDF functionality
☆14Nov 4, 2023Updated 2 years ago
liuchengwucn / Safe
View on GitHub
(ACL 2025 Main) Safe: Enhancing Mathematical Reasoning in Large Language Models via Retrospective Step-aware Formal Verification - Offici…
☆21Dec 26, 2025Updated 6 months ago
atfortes / LLMSymbolicReasoningBench
View on GitHub
Synthetic data generation for evaluating LLM symbolic and logic reasoning
☆23Mar 6, 2026Updated 3 months ago
U-C4N / Deepseek-CoT
View on GitHub
Deepseek-CoT
☆10Oct 6, 2024Updated last year
kaistAI / knowledge-reasoning
View on GitHub
[EMNLP 2024] Official implementation of "Hierarchical Deconstruction of LLM Reasoning: A Graph-Based Framework for Analyzing Knowledge Ut…
☆23Dec 4, 2024Updated last year
leloykun / mmsg
View on GitHub
Generate interleaved text and image content in a structured format you can directly pass to downstream APIs.
☆29Oct 18, 2024Updated last year
Aloriosa / srmt
View on GitHub
The original Shared Recurrent Memory Transformer implementation
☆36Jul 11, 2025Updated 11 months ago