Official repo of Knowledge or Reasoning? A Close Look at How LLMs Think Across Domains.
☆44Jun 6, 2025Updated 10 months ago
Alternatives and similar repositories for ReasoningEval
Users that are interested in ReasoningEval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆16Jul 23, 2024Updated last year
- [TMLR 2024] Official implementation of "Sight Beyond Text: Multi-Modal Training Enhances LLMs in Truthfulness and Ethics"☆20Sep 15, 2023Updated 2 years ago
- ☆25Jun 17, 2025Updated 9 months ago
- ☆23Dec 17, 2024Updated last year
- An Enhanced CLIP Framework for Learning with Synthetic Captions☆40Apr 18, 2025Updated 11 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Code and updates for the ScoreRS project.☆42Sep 19, 2025Updated 6 months ago
- [EMNLP 2024 Findings] Wrong-of-Thought: An Integrated Reasoning Framework with Multi-Perspective Verification and Wrong Information☆13Oct 1, 2024Updated last year
- Official Pytorch Implementation of Paper "A Semantic Space is Worth 256 Language Descriptions: Make Stronger Segmentation Models with Des…☆55Aug 27, 2025Updated 7 months ago
- ACL24☆11Jun 7, 2024Updated last year
- 🚀 LLM-I: Transform LLMs into natural interleaved multimodal creators! ✨ Tool-use framework supporting image search, generation, code ex…☆40Oct 20, 2025Updated 5 months ago
- An extension of CellPyLib, enabling 3d cellular automaton☆13Aug 7, 2022Updated 3 years ago
- [COLM 2025] Official code for "When To Solve, When To Verify: Compute-Optimal Problem Solving and Generative Verification for LLM Reasoni…☆15Oct 31, 2025Updated 5 months ago
- ☆16Jul 10, 2023Updated 2 years ago
- A toolkit for dialogue system evaluation via crowdsourcing☆18Apr 25, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Implementation for the paper "Fictitious Synthetic Data Can Improve LLM Factuality via Prerequisite Learning"☆11Jan 10, 2025Updated last year
- ☆12Aug 6, 2024Updated last year
- Agent Laboratory is an end-to-end autonomous research workflow meant to assist you as the human researcher toward implementing your resea…☆12Mar 29, 2026Updated 2 weeks ago
- Tensorflow implementation of Spatial VAE via Matrix-Variate Normal Distributions☆14May 24, 2017Updated 8 years ago
- Official repository for “Reasoning in the Dark: Interleaved Vision-Text Reasoning in Latent Space”☆18Jan 27, 2026Updated 2 months ago
- [ML4H'25] m1: Unleash the Potential of Test-Time Scaling for Medical Reasoning in Large Language Models☆48Dec 21, 2025Updated 3 months ago
- 深度学习与围棋学习☆16Oct 27, 2021Updated 4 years ago
- ☆48Feb 26, 2025Updated last year
- REverse-Engineered Reasoning for Open-Ended Generation☆94Sep 10, 2025Updated 7 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆29Dec 1, 2025Updated 4 months ago
- Code for LDLForests☆20Oct 4, 2018Updated 7 years ago
- Official implementation of Scaling Laws in Patchification: An Image Is Worth 50,176 Tokens And More☆25Feb 25, 2025Updated last year
- Git for "Stepwise Self-Consistent Mathematical Reasoning with Large Language Models"☆12Nov 26, 2024Updated last year
- DeepTumorVQA benchmark (9262 CT images + 395k QA pairs)☆31Jul 8, 2025Updated 9 months ago
- [ECCV 2022] This repository includes the official implementation our paper "In Defense of Image Pre-Training for Spatiotemporal Recogniti…☆19Dec 22, 2022Updated 3 years ago
- (ACL 2025 Main) Safe: Enhancing Mathematical Reasoning in Large Language Models via Retrospective Step-aware Formal Verification - Offici…☆20Dec 26, 2025Updated 3 months ago
- Synthetic data generation for evaluating LLM symbolic and logic reasoning☆22Mar 6, 2026Updated last month
- X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains☆49Feb 4, 2026Updated 2 months ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Official GitHub repository of the lecture "Multimodal Deep Learning for Recommendation", at the 2024 ACM RecSys Summer School☆12Oct 12, 2024Updated last year
- Repository for the ACL 2024 conference website☆18Feb 3, 2025Updated last year
- Pytorch version of Continuous Language Generative Flow (ACL 2021)☆11Sep 14, 2021Updated 4 years ago
- Code for "Unsupervised Abstractive Dialogue Summarization with Word Graphs and POV Conversion"☆12May 26, 2022Updated 3 years ago
- [EMNLP 2024] Official implementation of "Hierarchical Deconstruction of LLM Reasoning: A Graph-Based Framework for Analyzing Knowledge Ut…☆23Dec 4, 2024Updated last year
- This repository contains the data and code for the paper "SideControl: Controlled Open-domain Dialogue Generation via Additive Side Netwo…☆12Dec 1, 2021Updated 4 years ago
- [SDM'23] ML4C: Seeing Causality Through Latent Vicinity☆14Nov 9, 2022Updated 3 years ago