Strong-AI-Lab / Logical-and-abstract-reasoning
Evaluation on Logical Reasoning and Abstract Reasoning Challenges
☆20Updated 9 months ago
Related projects ⓘ
Alternatives and complementary repositories for Logical-and-abstract-reasoning
- Code and data for paper "Context-faithful Prompting for Large Language Models".☆39Updated last year
- AbstainQA, ACL 2024☆19Updated last month
- [EMNLP-2022 Findings] Code for paper “ProGen: Progressive Zero-shot Dataset Generation via In-context Feedback”.☆24Updated last year
- Code and pre-trained models for "ReasonBert: Pre-trained to Reason with Distant Supervision", EMNLP'2021☆29Updated last year
- Generating diverse counterfactual data for Natural Language Understanding tasks using Large Language Models (LLMs). The generator support…☆35Updated last year
- Source code and data for The Magic of IF: Investigating Causal Reasoning Abilities in Large Language Models of Code (Findings of ACL 2023…☆29Updated last year
- Resources for Retrieval Augmentation for Commonsense Reasoning: A Unified Approach. EMNLP 2022.☆20Updated 2 years ago
- [EMNLP 2021] Dataset and PyTorch Code for ExplaGraphs: An Explanation Graph Generation Task for Structured Commonsense Reasoning☆11Updated 2 years ago
- Supporting code for ReCEval paper☆26Updated 2 months ago
- Evaluate the Quality of Critique☆35Updated 5 months ago
- Code and dataset for the emnlp paper titled Instruct and Extract: Instruction Tuning for On-Demand Information Extraction☆50Updated 10 months ago
- ☆44Updated last year
- Code for the arXiv paper: "LLMs as Factual Reasoners: Insights from Existing Benchmarks and Beyond"☆59Updated 7 months ago
- Are LLMs Capable of Data-based Statistical and Causal Reasoning? Benchmarking Advanced Quantitative Reasoning with Data☆30Updated 3 months ago
- A unified benchmark for math reasoning☆87Updated last year
- ☆25Updated 2 years ago
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆49Updated 8 months ago
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆29Updated last week
- Code for COLING 2022 long paper: Answering Numerical Reasoning Questions in Table-Text Hybrid Contents with Graph-based Encoder and Tree-…☆21Updated last year
- This is the code for the Submission 3358 at NeurIPS 2022.☆21Updated last year
- Repository for paper Tools Are Instrumental for Language Agents in Complex Environments☆33Updated last month
- the instructions and demonstrations for building a formal logical reasoning capable GLM☆52Updated 2 months ago
- Grade-School Math with Irrelevant Context (GSM-IC) benchmark is an arithmetic reasoning dataset built upon GSM8K, by adding irrelevant se…☆55Updated last year
- ☆31Updated 7 months ago
- Adding new tasks to T0 without catastrophic forgetting☆30Updated 2 years ago
- WikiWhy is a new benchmark for evaluating LLMs' ability to explain between cause-effect relationships. It is a QA dataset containing 9000…☆46Updated 11 months ago
- ☆44Updated 2 months ago
- Implementation of the paper: "Turning Tables: Generating Examples from Semi-structured Tables for Endowing Language Models with Reasoning…☆22Updated 3 years ago
- CIKM 2022: Evaluating Interpolation and Extrapolation Performance of Neural Retrieval Models☆11Updated 2 years ago
- [EMNLP 2023] Knowledge Rumination for Pre-trained Language Models☆17Updated last year