Repo for "Large Language Model Reasoning Failures"
☆204Jun 16, 2026Updated last week
Alternatives and similar repositories for Awesome-LLM-Reasoning-Failures
Users that are interested in Awesome-LLM-Reasoning-Failures are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PyTorch implementation of "Seeing the forest and the tree: Building representations of both individual and collective dynamics with trans…☆14Jan 4, 2023Updated 3 years ago
- ☆22Sep 16, 2025Updated 9 months ago
- Uses conversation history to audit important decisions and changes.☆18Jul 13, 2025Updated 11 months ago
- ☆14Mar 10, 2020Updated 6 years ago
- mne-denoise provides narrow-band artefact removal tailored to MNE-Python workflows. It wraps harmonic regression techniques to suppress p…☆29Updated this week
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Enhanced version of binaryninja-ollama and without using the ollama Python library☆13Jan 23, 2025Updated last year
- ACL24☆11Jun 7, 2024Updated 2 years ago
- An modular asset discovery framework written in python to automate the repeating manual work☆78Jun 21, 2026Updated last week
- [COLM 2025] Official code for "When To Solve, When To Verify: Compute-Optimal Problem Solving and Generative Verification for LLM Reasoni…☆15Oct 31, 2025Updated 7 months ago
- ☆16Jul 10, 2023Updated 2 years ago
- Implementation for the paper "Fictitious Synthetic Data Can Improve LLM Factuality via Prerequisite Learning"☆11Jan 10, 2025Updated last year
- An annotated reference list of ML theory☆36May 25, 2023Updated 3 years ago
- Semantic analysis engine for detecting vulnerability fixes in Windows kernel driver patches — 58 YAML rules, Ghidra decompilation, reacha…☆63Feb 26, 2026Updated 4 months ago
- ☆10Nov 29, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- LiteGPT: A 124M Small Language Model (SLM) pre-trained on FineWeb and fine-tuned on Alpaca.☆35Dec 16, 2025Updated 6 months ago
- ☆78Apr 9, 2026Updated 2 months ago
- EraseDiff: Erasing Data Influence in Diffusion Models☆14Nov 20, 2024Updated last year
- ☆16Jul 23, 2024Updated last year
- ☆17Oct 17, 2025Updated 8 months ago
- A centralized list of the various Potato Windows exploits.☆24Jun 23, 2026Updated last week
- This repository demonstrates the application of our proposed task-free continual learning method on a synthetic experiment.☆13Jun 24, 2019Updated 7 years ago
- Data and code for EACL'24 paper: Over-Reasoning and Redundant Calculation of Large Language Models☆11Jan 23, 2024Updated 2 years ago
- ☆16Nov 12, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Agent Identity Protocol - Zero-trust security layer for AI agents. Policy enforcement proxy for MCP with Human-in-the-Loop approval, DLP …☆34Mar 5, 2026Updated 3 months ago
- vim as a perfect large language models prompts playground☆20Nov 29, 2023Updated 2 years ago
- [EMNLP 2023] Knowledge Rumination for Pre-trained Language Models☆18Jun 29, 2023Updated 3 years ago
- Simplifying RAG with PostgreSQL and PGVector☆16Jul 31, 2024Updated last year
- CS194-196 Course Project☆14Feb 20, 2025Updated last year
- Git for "Stepwise Self-Consistent Mathematical Reasoning with Large Language Models"☆12Nov 26, 2024Updated last year
- This repository contains the resource introduced in the paper: "Truth or Mirage? Towards End-to-End Factuality Evaluation with LLM-Oasis"…☆25Oct 15, 2025Updated 8 months ago
- Tableau for basic modal logic in Lean 3 - This is OLD and not maintained. See https://github.com/m4lvin/lean4-pdl instead.☆13Oct 24, 2023Updated 2 years ago
- Official implementation of the paper: "ActiveVLN: Towards Active Exploration via Multi-Turn RL in Vision-and-Language Navigation"☆69Feb 11, 2026Updated 4 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- tfidf provides TF-IDF functionality☆14Nov 4, 2023Updated 2 years ago
- (ACL 2025 Main) Safe: Enhancing Mathematical Reasoning in Large Language Models via Retrospective Step-aware Formal Verification - Offici…☆21Dec 26, 2025Updated 6 months ago
- Synthetic data generation for evaluating LLM symbolic and logic reasoning☆23Mar 6, 2026Updated 3 months ago
- Deepseek-CoT☆10Oct 6, 2024Updated last year
- [EMNLP 2024] Official implementation of "Hierarchical Deconstruction of LLM Reasoning: A Graph-Based Framework for Analyzing Knowledge Ut…☆23Dec 4, 2024Updated last year
- Generate interleaved text and image content in a structured format you can directly pass to downstream APIs.☆29Oct 18, 2024Updated last year
- The original Shared Recurrent Memory Transformer implementation☆36Jul 11, 2025Updated 11 months ago