3B-Group / ConvReLinks
π€ConvReπ€―: An Investigation of LLMsβ Inefficacy in Understanding Converse Relations (EMNLP 2023)
β23Updated last year
Alternatives and similar repositories for ConvRe
Users that are interested in ConvRe are comparing it to the libraries listed below
Sorting:
- The official repository for the paper "From Zero to Hero: Examining the Power of Symbolic Tasks in Instruction Tuning".β65Updated 2 years ago
- The Good, The Bad, and The Greedy: Evaluation of LLMs Should Not Ignore Non-Determinismβ30Updated 10 months ago
- β14Updated last year
- [ICML'24] TroVE: Inducing Verifiable and Efficient Toolboxes for Solving Programmatic Tasksβ28Updated 8 months ago
- [ICLR2025 Spotlight] Agent Trajectory Synthesis via Guiding Replay with Web Tutorialsβ32Updated 3 months ago
- [EMNLP 2022] TaCube: Pre-computing Data Cubes for Answering Numerical-Reasoning Questions over Tabular Dataβ17Updated 2 years ago
- Resources for our ACL 2023 paper: Distilling Script Knowledge from Large Language Models for Constrained Language Planningβ36Updated last year
- A zero-shot neural semantic parser without using annotated parallel training data.β8Updated 3 years ago
- GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.β62Updated 10 months ago
- β13Updated 6 months ago
- β41Updated last year
- β97Updated last year
- [ACL 2024] The project of Symbol-LLMβ54Updated 10 months ago
- β28Updated last year
- [ICLR 2023] Code for our paper "Selective Annotation Makes Language Models Better Few-Shot Learners"β109Updated last year
- [ICLR'24 spotlight] Tool-Augmented Reward Modelingβ50Updated 5 months ago
- [ACL 2024 Findings] CriticBench: Benchmarking LLMs for Critique-Correct Reasoningβ25Updated last year
- Code and data for paper "Context-faithful Prompting for Large Language Models".β40Updated 2 years ago
- A Portal Site for Structured Knowledge Grounding(SKG) Resources.β9Updated 2 years ago
- Supporting code for ReCEval paperβ28Updated 8 months ago
- Code for "RepoST: Scalable Repository-Level Coding Environment Construction with Sandbox Testing"β20Updated 2 months ago
- Analyzing LLM Alignment via Token distribution shiftβ16Updated last year
- Official code for our EMNLP2021 Outstanding Paper MindCraft: Theory of Mind Modeling for Situated Dialogue in Collaborative Tasksβ22Updated 2 years ago
- [EACL'23] MCoNaLa: A Benchmark for Code Generation from Multiple Natural Languagesβ23Updated 2 years ago
- [ICML 2024] Self-Infilling Code Generationβ19Updated last year
- [NeurIPS 2023] Repetition In Repetition Out: Towards Understanding Neural Text Degeneration from the Data Perspectiveβ31Updated last year
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"β54Updated last year
- β36Updated last year
- The source code for running LLMs on the AAAR-1.0 benchmark.β16Updated 2 months ago
- β27Updated 2 years ago