Official repo of Knowledge or Reasoning? A Close Look at How LLMs Think Across Domains.
☆43Jun 6, 2025Updated last year
Alternatives and similar repositories for ReasoningEval
Users that are interested in ReasoningEval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆16Jul 23, 2024Updated last year
- [TMLR 2024] Official implementation of "Sight Beyond Text: Multi-Modal Training Enhances LLMs in Truthfulness and Ethics"☆20Sep 15, 2023Updated 2 years ago
- [TMLR 2025] Official implementation of AttnGCG: Enhancing Jailbreaking Attacks on LLMs with Attention Manipulation☆26Jun 17, 2025Updated 11 months ago
- ☆23Dec 17, 2024Updated last year
- The official repository for "One Model to Rule them All: Towards Universal Segmentation for Medical Images with Text Prompts"☆10Aug 16, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [EMNLP 2024 Findings] Wrong-of-Thought: An Integrated Reasoning Framework with Multi-Perspective Verification and Wrong Information☆13Oct 1, 2024Updated last year
- Repo for 2020 EMNLP paper "Conditional Causal Relationships between Emotions and Causes in Texts"☆14Apr 8, 2021Updated 5 years ago
- A toolkit for dialogue system evaluation via crowdsourcing☆18Apr 25, 2023Updated 3 years ago
- Implementation for the paper "Fictitious Synthetic Data Can Improve LLM Factuality via Prerequisite Learning"☆11Jan 10, 2025Updated last year
- ☆12Aug 6, 2024Updated last year
- This repo contains the code for "MEGA-Bench Scaling Multimodal Evaluation to over 500 Real-World Tasks" [ICLR 2025]☆81Jul 1, 2025Updated 11 months ago
- Official repository for “Reasoning in the Dark: Interleaved Vision-Text Reasoning in Latent Space”☆18Jan 27, 2026Updated 4 months ago
- [ML4H'25] m1: Unleash the Potential of Test-Time Scaling for Medical Reasoning in Large Language Models☆49Dec 21, 2025Updated 5 months ago
- ☆15Mar 12, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 深度学习与围棋学习☆16Oct 27, 2021Updated 4 years ago
- a simple script to detect word by word plagiarism for https://plagiarism.iu.edu/certificationTests/☆19Feb 22, 2024Updated 2 years ago
- ☆48Feb 26, 2025Updated last year
- Official Implementation of Knowledge Flow Prompting☆35Oct 20, 2025Updated 7 months ago
- REverse-Engineered Reasoning for Open-Ended Generation☆97Sep 10, 2025Updated 9 months ago
- A simple and flexible PyTorch implementation of StableDiffusion-XL based on diffusers.☆20Sep 2, 2024Updated last year
- Pytorch implementation of EpiFoundation☆27Feb 25, 2025Updated last year
- ☆32Dec 1, 2025Updated 6 months ago
- [EMNLP 2023] Knowledge Rumination for Pre-trained Language Models☆17Jun 29, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Code for LDLForests☆20Oct 4, 2018Updated 7 years ago
- Git for "Stepwise Self-Consistent Mathematical Reasoning with Large Language Models"☆12Nov 26, 2024Updated last year
- Official implementation of Scaling Laws in Patchification: An Image Is Worth 50,176 Tokens And More☆25Feb 25, 2025Updated last year
- [ECCV 2022] This repository includes the official implementation our paper "In Defense of Image Pre-Training for Spatiotemporal Recogniti…☆19Dec 22, 2022Updated 3 years ago
- (ACL 2025 Main) Safe: Enhancing Mathematical Reasoning in Large Language Models via Retrospective Step-aware Formal Verification - Offici…☆21Dec 26, 2025Updated 5 months ago
- ☆13Jan 13, 2025Updated last year
- ☆14May 28, 2023Updated 3 years ago
- Pytorch version of Continuous Language Generative Flow (ACL 2021)☆11Sep 14, 2021Updated 4 years ago
- Code for "Unsupervised Abstractive Dialogue Summarization with Word Graphs and POV Conversion"☆12May 26, 2022Updated 4 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- This repository contains the data and code for the paper "SideControl: Controlled Open-domain Dialogue Generation via Additive Side Netwo…☆12Dec 1, 2021Updated 4 years ago
- [SDM'23] ML4C: Seeing Causality Through Latent Vicinity☆14Nov 9, 2022Updated 3 years ago
- 计算TFIDF的三种方法:Python、sklearn、gensim☆11Feb 26, 2019Updated 7 years ago
- A very simple library for exploiting graph-of-words in NLP☆12Apr 3, 2021Updated 5 years ago
- DeepTumorVQA benchmark for VLMs and Agents (10k testing samples)☆36May 19, 2026Updated 3 weeks ago
- [WNGT(2019)] On the Importance of the Kullback-Leibler Divergence Term in Variational Autoencoders for Text Generation☆11Apr 27, 2022Updated 4 years ago
- ☆27Jan 23, 2024Updated 2 years ago