exploring whether LLMs perform case-based or rule-based reasoning
☆30Mar 2, 2024Updated last year
Alternatives and similar repositories for Case_or_Rule
Users that are interested in Case_or_Rule are comparing it to the libraries listed below
Sorting:
- ☆19Sep 16, 2025Updated 5 months ago
- DOMAINEVAL is an auto-constructed benchmark for multi-domain code generation that consists of 2k+ subjects (i.e., description, reference …☆14Dec 12, 2024Updated last year
- ☆11Jan 3, 2024Updated 2 years ago
- [LREC-Coling 2024] PECC: Problem Extraction and Coding Challenges☆14May 30, 2024Updated last year
- Towards a Mechanistic Understanding of Large Reasoning Models: A Survey of Training, Inference, and Failures☆30Jan 29, 2026Updated last month
- This repository includes the official implementation our paper "Scaling White-Box Transformers for Vision"☆47Jun 3, 2024Updated last year
- [NeurIPS 2022] "Adversarial Training with Complementary Labels: On the Benefit of Gradually Informative Attacks"☆13Nov 11, 2022Updated 3 years ago
- [ICLR 2024 Oral] Improving Convergence and Generalization Using Parameter Symmetries☆31May 29, 2024Updated last year
- Official repository for the paper Number Cookbook: Number Understanding of Language Models and How to Improve It.☆19Mar 31, 2025Updated 11 months ago
- ☆16Nov 26, 2024Updated last year
- The official codes of Rethinking Knowledge Graph Evaluation Under the Open-World Assumption (NeurIPS 2022)☆22Sep 20, 2022Updated 3 years ago
- Official repository for Decentralized Arena via Collective LLM Intelligence☆17May 19, 2025Updated 9 months ago
- [ACL 2024] A Multimodal, Multigenre, and Multipurpose Audio-Visual Academic Lecture Dataset☆25May 29, 2025Updated 9 months ago
- [EMNLP 2024] Official implementation of "Hierarchical Deconstruction of LLM Reasoning: A Graph-Based Framework for Analyzing Knowledge Ut…☆23Dec 4, 2024Updated last year
- An evaluation suite for Retrieval-Augmented Generation (RAG).☆23Apr 26, 2025Updated 10 months ago
- Code Implementation, Evaluations, Documentation, Links and Resources for Min P paper☆47Aug 13, 2025Updated 6 months ago
- [EMNLP'22] Code for 'Exploring Representation-level Augmentation for Code Search'☆27Oct 9, 2023Updated 2 years ago
- Code Prompting Elicits Conditional Reasoning Abilities in Text+Code LLMs. EMNLP 2024☆27Nov 13, 2024Updated last year
- [NeurIPS 2023] Does Invariant Graph Learning via Environment Augmentation Learn Invariance?☆22Aug 25, 2024Updated last year
- This repo contains evaluation code for the paper "AV-Odyssey: Can Your Multimodal LLMs Really Understand Audio-Visual Information?"☆31Dec 23, 2024Updated last year
- A library for subgraph GNN based on pyg☆39Nov 28, 2024Updated last year
- Easy-to-use MIRAGE code for faithful answer attribution in RAG applications. Paper: https://aclanthology.org/2024.emnlp-main.347/☆26Mar 10, 2025Updated 11 months ago
- Repo for paper "Tell Me More! Towards Implicit User Intention Understanding of Language Model Driven Agents"☆61Feb 20, 2024Updated 2 years ago
- ☆83Sep 5, 2024Updated last year
- [IJCAI 2024] CMMU: A Benchmark for Chinese Multi-modal Multi-type Question Understanding and Reasoning☆24Feb 1, 2024Updated 2 years ago
- Cue-CoT: Chain-of-thought Prompting for Responding to In-depth Dialogue Questions with LLMs [EMNLP 2023 Findings]☆24Nov 18, 2023Updated 2 years ago
- [NeurIPS 2024] HonestLLM: Toward an Honest and Helpful Large Language Model☆29Jun 10, 2025Updated 8 months ago
- Official Repo for the paper: VCR: Visual Caption Restoration. Check arxiv.org/pdf/2406.06462 for details.☆32Feb 26, 2025Updated last year
- [CIKM'2024] "RecDiff: Diffusion Model for Social Recommendation"☆87Jun 16, 2025Updated 8 months ago
- Codebase for fine-tuning Llama2 70B to generate math test questions and answers.☆11Aug 30, 2024Updated last year
- This the implementation of LeCo☆31Jan 20, 2025Updated last year
- [NeurIPS 2023] Understanding and Improving Feature Learning for Out-of-Distribution Generalization☆29May 27, 2025Updated 9 months ago
- Concurrency library☆17Oct 13, 2024Updated last year
- ☆11Dec 23, 2024Updated last year
- ☆11May 25, 2023Updated 2 years ago
- [SIGGRAPH Asia 2024] Painting process generating using diffusion models☆94Nov 12, 2025Updated 3 months ago
- Baselines for all tasks from Long Code Arena benchmarks 🏟️☆39Mar 30, 2025Updated 11 months ago
- Offline RLHF codebase implementation for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human …☆41Mar 26, 2024Updated last year
- Unofficial Implementation of Chain-of-Thought Reasoning Without Prompting☆35Mar 19, 2024Updated last year