[ACL 2025] RuleArena: A Benchmark for Rule-Guided Reasoning with LLMs in Real-World Scenarios
☆26Jul 2, 2025Updated 10 months ago
Alternatives and similar repositories for RuleArena
Users that are interested in RuleArena are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- INFINEL: An efficient GPU-based processing method for unpredictable large output graph queries [PPoPP'24]☆10Jan 15, 2024Updated 2 years ago
- Space-efficient graph data converter☆13Nov 3, 2022Updated 3 years ago
- ☆27Oct 23, 2025Updated 7 months ago
- 变邻域搜索算法(VNS)求解TSP(附C++详细代码及注释)☆10May 12, 2019Updated 7 years ago
- 📖 A curated list of awesome time-series papers, benchmarks, datasets, tutorials. (WIP)☆65Sep 9, 2025Updated 8 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Unsupervised Anomaly Detection System for Univariate Time Series☆21Sep 25, 2024Updated last year
- Transformer-based visually grounded speech models☆19Sep 22, 2022Updated 3 years ago
- ☆10Apr 7, 2024Updated 2 years ago
- ☆20Feb 25, 2026Updated 2 months ago
- MAIR: A Massive Benchmark for Evaluating Instructed Retrieval. Evaluate your retrieval models on 126 diverse tasks. [EMNLP 2024]☆26Nov 3, 2024Updated last year
- ☆16Mar 11, 2024Updated 2 years ago
- This is the repository for the paper 'DiaHalu: A Dialogue-level Hallucination Evaluation Benchmark for Large Language Models' (EMNLP2024 …☆18Apr 5, 2025Updated last year
- Source code of the AAAI-2020 paper "Topic Modeling on Document Networks with Adjacent-Encoder"☆10Jul 14, 2020Updated 5 years ago
- We developed a dynamic Bus scheduling and Allocation system in collaboration with public transit service BEST operating in Mumbai, India.…☆18Aug 4, 2018Updated 7 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Testing of Neural Topic Modeling for Japanese articles☆13Jul 24, 2019Updated 6 years ago
- 🕸️ A graph-augmented dense statute retriever. (EACL 2023)☆25Sep 26, 2023Updated 2 years ago
- Variable Neighborhood Search Function for TSP problems☆25Dec 22, 2022Updated 3 years ago
- [SIGIR 2024] TRAD: Enhancing LLM Agents with Step-Wise Thought Retrieval and Aligned Decision☆20Mar 28, 2024Updated 2 years ago
- [IJCAI2022] Unsupervised Voice-Face Representation Learning by Cross-Modal Prototype Contrast