Official repo of Knowledge or Reasoning? A Close Look at How LLMs Think Across Domains.
☆44Jun 6, 2025Updated 9 months ago
Alternatives and similar repositories for ReasoningEval
Users that are interested in ReasoningEval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆16Jul 23, 2024Updated last year
- [TMLR 2024] Official implementation of "Sight Beyond Text: Multi-Modal Training Enhances LLMs in Truthfulness and Ethics"☆20Sep 15, 2023Updated 2 years ago
- [TMLR 25] SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models☆150Oct 10, 2025Updated 5 months ago
- ☆23Dec 17, 2024Updated last year
- An Enhanced CLIP Framework for Learning with Synthetic Captions☆40Apr 18, 2025Updated 11 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- The official repository for "One Model to Rule them All: Towards Universal Segmentation for Medical Images with Text Prompts"☆10Aug 16, 2024Updated last year
- [EMNLP 2024 Findings] Wrong-of-Thought: An Integrated Reasoning Framework with Multi-Perspective Verification and Wrong Information☆13Oct 1, 2024Updated last year
- Repo for 2020 EMNLP paper "Conditional Causal Relationships between Emotions and Causes in Texts"☆14Apr 8, 2021Updated 4 years ago
- Official Pytorch Implementation of Paper "A Semantic Space is Worth 256 Language Descriptions: Make Stronger Segmentation Models with Des…☆55Aug 27, 2025Updated 6 months ago
- 🚀 LLM-I: Transform LLMs into natural interleaved multimodal creators! ✨ Tool-use framework supporting image search, generation, code ex…☆41Oct 20, 2025Updated 5 months ago
- [COLM 2025] Official code for "When To Solve, When To Verify: Compute-Optimal Problem Solving and Generative Verification for LLM Reasoni…☆15Oct 31, 2025Updated 4 months ago
- ☆16Jul 10, 2023Updated 2 years ago
- 最新LLMの一覧を作成します☆20Mar 18, 2026Updated last week
- ☆12Aug 6, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- This repo contains the code for "MEGA-Bench Scaling Multimodal Evaluation to over 500 Real-World Tasks" [ICLR 2025]☆79Jul 1, 2025Updated 8 months ago
- Scaffold for NLP researcher to quickly set up the codebase☆17Mar 25, 2025Updated last year
- ☆14Jul 2, 2024Updated last year
- ☆59Jun 18, 2024Updated last year
- Official repository for “Reasoning in the Dark: Interleaved Vision-Text Reasoning in Latent Space”☆18Jan 27, 2026Updated last month
- [ML4H'25] m1: Unleash the Potential of Test-Time Scaling for Medical Reasoning in Large Language Models☆48Dec 21, 2025Updated 3 months ago
- ☆15Mar 12, 2024Updated 2 years ago
- Code for LDLForests☆20Oct 4, 2018Updated 7 years ago
- REverse-Engineered Reasoning for Open-Ended Generation☆94Sep 10, 2025Updated 6 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A simple and flexible PyTorch implementation of StableDiffusion-XL based on diffusers.☆20Sep 2, 2024Updated last year
- Pytorch implementation of EpiFoundation☆25Feb 25, 2025Updated last year
- [EMNLP 2023] Knowledge Rumination for Pre-trained Language Models☆17Jun 29, 2023Updated 2 years ago
- Official implementation of Scaling Laws in Patchification: An Image Is Worth 50,176 Tokens And More☆25Feb 25, 2025Updated last year
- DeepTumorVQA benchmark (9262 CT images + 395k QA pairs)☆31Jul 8, 2025Updated 8 months ago
- [ECCV 2022] This repository includes the official implementation our paper "In Defense of Image Pre-Training for Spatiotemporal Recogniti…☆19Dec 22, 2022Updated 3 years ago
- (ACL 2025 Main) Safe: Enhancing Mathematical Reasoning in Large Language Models via Retrospective Step-aware Formal Verification - Offici…☆20Dec 26, 2025Updated 2 months ago
- Synthetic data generation for evaluating LLM symbolic and logic reasoning☆22Mar 6, 2026Updated 2 weeks ago
- ☆14May 28, 2023Updated 2 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Official GitHub repository of the lecture "Multimodal Deep Learning for Recommendation", at the 2024 ACM RecSys Summer School☆12Oct 12, 2024Updated last year
- Code for "Unsupervised Abstractive Dialogue Summarization with Word Graphs and POV Conversion"☆12May 26, 2022Updated 3 years ago
- Repository for the ACL 2024 conference website☆18Feb 3, 2025Updated last year
- Online NIfTI voxels to triangulated mesh conversion☆24Dec 5, 2022Updated 3 years ago
- [EMNLP 2024] Official implementation of "Hierarchical Deconstruction of LLM Reasoning: A Graph-Based Framework for Analyzing Knowledge Ut…☆23Dec 4, 2024Updated last year
- Complex-Edit: CoT-Like Instruction Generation for Complexity-Controllable Image Editing Benchmark☆28Apr 22, 2025Updated 11 months ago
- This repository contains the data and code for the paper "SideControl: Controlled Open-domain Dialogue Generation via Additive Side Netwo…☆12Dec 1, 2021Updated 4 years ago