a benchmark suite for testing logical reasoning abilities of prompt-based models
☆31Nov 20, 2023Updated 2 years ago
Alternatives and similar repositories for LogiEval
Users that are interested in LogiEval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Logiqa2.0 dataset - logical reasoning in MRC and NLI tasks☆101Aug 11, 2023Updated 2 years ago
- Dataset for AAAI paper "Natural Language Inference in Context - Investigating Contextual Reasoning over Long Texts"☆11Nov 18, 2022Updated 3 years ago
- Evaluation on Logical Reasoning and Abstract Reasoning Challenges☆30Apr 21, 2025Updated last year
- The code of Paper "Logic-Driven Context Extension and Data Augmentation for Logical Reasoning of Text".☆48Mar 2, 2023Updated 3 years ago
- The source code for Abstract Meaning Representation-Based Logic-Driven Data Augmentation for Logical Reasoning. #1 on the ReClor Leaderbo…☆18Dec 2, 2025Updated 5 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A repo for RLHF training and BoN over LLMs, with support for reward model ensembles.☆47Jan 16, 2025Updated last year
- Interview-based evaluation of LLMs☆28Updated this week
- Code and data for the paper "Turning English-centric LLMs Into Polyglots: How Much Multilinguality Is Needed?"☆26Jun 3, 2025Updated 11 months ago
- PaddlePaddle Course☆12Mar 4, 2021Updated 5 years ago
- This is the official implementation of TAGCOS: Task-agnostic Gradient Clustered Coreset Selection for Instruction Tuning Data☆13Jul 21, 2024Updated last year
- ☆18Jul 25, 2025Updated 10 months ago
- Convert Abstract Meaning Representation (AMR) into first-order logic☆17Aug 7, 2024Updated last year
- ☆11Nov 11, 2022Updated 3 years ago
- A Python script to delete all comment and submission data from a given Reddit account.☆11Jan 5, 2021Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆11Mar 25, 2022Updated 4 years ago
- Repo for paper "IDOL: Indicator-oriented Logic Pre-training for Logical Reasoning" accepted to the Findings of ACL 2023☆22Nov 7, 2023Updated 2 years ago
- ☆13Jun 4, 2023Updated 2 years ago
- Implementation of "Adversarial purification with Score-based generative models", ICML 2021☆30Oct 24, 2021Updated 4 years ago
- ARI (Abstract Reasoning Induction) is an innovative framework designed to enhance the temporal reasoning capabilities of Large Language M…☆13Dec 29, 2024Updated last year
- Official implementation of LLM+MAP: Bimanual Robot Task Planning using Large Language Models (LLMs) and Planning Domain Definition Langua…☆22Mar 24, 2025Updated last year
- ☆12Jun 30, 2024Updated last year
- We enable LLM with personalization capability☆11Nov 16, 2023Updated 2 years ago
- Grounding Language Models for Compositional and Spatial Reasoning☆18Oct 26, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The implementation of "Neural Networks for Open Domain Targeted Sentiment" based on package https://github.com/SUTDNLP/LibN3L☆18Dec 7, 2015Updated 10 years ago
- ☆23Oct 14, 2024Updated last year
- ☆16Apr 11, 2026Updated last month
- ELIXIR: Learning from User Feedback on Explanations to Improve Recommender Models☆10Feb 15, 2021Updated 5 years ago
- ☆10Oct 18, 2023Updated 2 years ago
- Logical fallacy online detection tools☆20Oct 31, 2022Updated 3 years ago
- Official homepage for Tab-CoT: Zero-shot Tabular Chain of Thought (Findings of ACL 2023)☆33May 31, 2023Updated 2 years ago
- We have released the code and demo program required for LLM with self-verification☆62Oct 18, 2023Updated 2 years ago
- Pi0-VLA Repository of "MotionTrans: Human VR Data Enable Motion-Level Learning for Robotic Manipulation Policies"☆27Mar 9, 2026Updated 2 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- implementation of Session-Based Social Recommendation via Dynamic Graph Attention Networks☆10Apr 17, 2020Updated 6 years ago
- CPRec: Learning Consumer and Producer Embeddings for User-Generated Content Recommendation☆13Apr 16, 2019Updated 7 years ago
- 测试 https://huggingface.co/OFA-Sys/gsm8k-rft-llama7b-u13b 的 GSM8K 分数☆15Aug 10, 2023Updated 2 years ago
- The official repository of the Omni-MATH benchmark.☆93Dec 22, 2024Updated last year
- [CVPR2025] Implementation of "FSHNet: Fully Sparse Hybrid Network for 3D Object Detection"☆44Dec 28, 2025Updated 4 months ago
- This is an implementation of the POI recommendation model-PPR.☆10Apr 19, 2023Updated 3 years ago
- ☆12Apr 3, 2026Updated last month