SkyRiver-2000/RuleArena

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/SkyRiver-2000/RuleArena)

SkyRiver-2000 / RuleArena

[ACL 2025] RuleArena: A Benchmark for Rule-Guided Reasoning with LLMs in Real-World Scenarios

☆29

Alternatives and similar repositories for RuleArena

Users that are interested in RuleArena are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

sitaocheng / DERL
View on GitHub
The code repo for the paper "Differentiable Evolutionary Reinforcement Learning"
☆18Jan 6, 2026Updated 6 months ago
facebookresearch / reasoning-memory
View on GitHub
Procedural Knowledge at Scale Improves ReasoningThis repository contains the minimal, end-to-end pipeline for reproducing the paper resul…
☆15Apr 1, 2026Updated 3 months ago
cdhx / MarkQA
View on GitHub
Code and data for EMNLP 2023 research track paper "MarkQA: A large scale KBQA dataset with numerical reasoning"
☆13Jan 2, 2024Updated 2 years ago
duyngtr16061999 / KDMCSE
View on GitHub
☆10Apr 7, 2024Updated 2 years ago
Zhaoyi-Li21 / creme
View on GitHub
[ACL 2024] "Understanding and Patching Compositional Reasoning in LLMs"
☆14Aug 28, 2024Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
sunnweiwei / MAIR
View on GitHub
MAIR: A Massive Benchmark for Evaluating Instructed Retrieval. Evaluate your retrieval models on 126 diverse tasks. [EMNLP 2024]
☆28Nov 3, 2024Updated last year
thu-coai / TransferAttack
View on GitHub
[ACL 2025] Guiding not Forcing: Enhancing the Transferability of Jailbreaking Attacks on LLMs via Removing Superfluous Constraints
☆19May 23, 2025Updated last year
cdhx / QDTQA
View on GitHub
Code for AAAI 2023 research track paper "Question Decomposition Tree for Answering Complex Questions over Knowledge Bases"
☆17Jan 3, 2024Updated 2 years ago
RJMillerLab / ModelTables
View on GitHub
Official repository for the paper "ModelTables: A Corpus of Tables about Models"
☆16Jul 14, 2026Updated last week
cezhang01 / Adjacent-Encoder
View on GitHub
Source code of the AAAI-2020 paper "Topic Modeling on Document Networks with Adjacent-Encoder"
☆10Jul 14, 2020Updated 6 years ago
dhh1995 / PromptCoder
View on GitHub
See also APPL: https://github.com/appl-team/appl that improves this project. A Python package for writing Language Models prompts in a ne…
☆42Oct 2, 2023Updated 2 years ago
m3yrin / NTM
View on GitHub
Testing of Neural Topic Modeling for Japanese articles
☆13Jul 24, 2019Updated 7 years ago
HXX97 / GMT-KBQA
View on GitHub
Code and data for GMT-KBQA
☆17Jan 5, 2023Updated 3 years ago
dongxinshuai / RIFT-NeurIPS2021
View on GitHub
☆11Mar 6, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
chenlong-clock / RULE-Unlearn
View on GitHub
[NeurIPS25] RULE: Reinforcement UnLEarning Achieves Forge-retain Pareto Optimality
☆20Oct 22, 2025Updated 9 months ago
SkyRiver-2000 / TRAD-Official
View on GitHub
[SIGIR 2024] TRAD: Enhancing LLM Agents with Step-Wise Thought Retrieval and Aligned Decision
☆20Mar 28, 2024Updated 2 years ago
bobxwu / CFDTM
View on GitHub
Modeling Dynamic Topics in Chain-Free Fashion by Evolution-Tracking Contrastive Learning and Unassociated Word Exclusion (ACL 2024 Findin…
☆16Aug 23, 2024Updated last year
xiaolin-cs / BackTime
View on GitHub
BackTime: Backdoor Attacks on Multivariate Time Series Forecasting
☆32Apr 14, 2025Updated last year
bengler / propinquity
View on GitHub
Pipeline for image classification at The Norwegian National Museum and zooming display mechanism.
☆14Nov 3, 2017Updated 8 years ago
WANGXinyiLinda / LM_random_walk
View on GitHub
Official code for paper Understanding the Reasoning Ability of Language Models From the Perspective of Reasoning Paths Aggregation
☆21Feb 29, 2024Updated 2 years ago
SaFo-Lab / MetaAgent
View on GitHub
Offical Repository of MetaAgent Program
☆53Dec 2, 2025Updated 7 months ago
cdhx / QueryAgent
View on GitHub
Code and data for QueryAgent(ACL 2024)
☆21Dec 19, 2024Updated last year
MorvanZhou / go-unit-test-demo
View on GitHub
some golang unit test demos
☆20Nov 21, 2019Updated 6 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
amayuelas / multi-agent-attack
View on GitHub
MutliAgent Attack
☆15Oct 3, 2024Updated last year
JasonGross / guarantees-based-mechanistic-interpretability
View on GitHub
☆18Updated this week
XinyuanLu00 / QACheck
View on GitHub
About Data and Codes for EMNLP 2023 System Demo Paper "QACHECK: A Demonstration System for Question-Guided Multi-Hop Fact-Checking"
☆19Dec 19, 2023Updated 2 years ago
mzsun01 / MM-LDM
View on GitHub
☆11Apr 12, 2024Updated 2 years ago
aitsc / GLMKD
View on GitHub
Are Intermediate Layers and Labels Really Necessary? A General Language Model Distillation Method ; GKD: A General Knowledge Distillation…
☆34Aug 4, 2023Updated 2 years ago
zhiyuanhubj / AAAI-19_slide_poster
View on GitHub
☆21Jan 15, 2019Updated 7 years ago
safety-research / believe-it-or-not
View on GitHub
Code and data for editing model beliefs with SDF and other methods, and for evaluating the depth of the implanted beliefs.
☆16Oct 23, 2025Updated 9 months ago
Mihir3009 / LogicBench
View on GitHub
LogicBench is a natural language question-answering dataset consisting of 25 different reasoning patterns spanning over propositional, fi…
☆40May 2, 2024Updated 2 years ago
konstantinjdobler / focus
View on GitHub
[EMNLP'23] Official Code for "FOCUS: Effective Embedding Initialization for Monolingual Specialization of Multilingual Models"
☆37Jun 7, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
zhangzh1328 / scSimGCL
View on GitHub
Research Paper: "Graph Contrastive Learning as a Versatile Foundation for Advanced scRNA-seq Data Analysis"
☆11Nov 20, 2024Updated last year
sdnr1 / EBIM-NLI
View on GitHub
Enhanced BiLSTM Inference Model for Natural Language Inference
☆26May 23, 2018Updated 8 years ago
Stochastic13 / Voronoi-Tessellations
View on GitHub
Python3 script to create Voronoi tessellations (mosaic pattern) on images
☆10May 25, 2019Updated 7 years ago
siyan-zhao / ICL_decision_boundary
View on GitHub
official code for paper Probing the Decision Boundaries of In-context Learning in Large Language Models. https://arxiv.org/abs/2406.11233…
☆20Jul 27, 2025Updated 11 months ago
okfn / data-catalog-spec
View on GitHub
Data Catalog Specification (Schema and Protocol)
☆21May 25, 2018Updated 8 years ago
FreedomIntelligence / MedGen
View on GitHub
MedGen: Unlocking Medical Video Generation by Scaling Granularly-annotated Medical Videos.
☆33Apr 18, 2026Updated 3 months ago
liuqi6777 / Awesome-LLM4Ranking
View on GitHub
A curated list of awesome papers about utilizing large language models for ranking.
☆32Apr 12, 2026Updated 3 months ago