YangLinyi / GLUE-X
We leverage 14 datasets as OOD test data and conduct evaluations on 8 NLU tasks over 21 popularly used models. Our findings confirm that the OOD accuracy in NLP tasks needs to be paid more attention to since the significant performance decay compared to ID accuracy has been found in all settings.
☆94Updated last year
Alternatives and similar repositories for GLUE-X:
Users that are interested in GLUE-X are comparing it to the libraries listed below
- This includes the original implementation of CtrlA: Adaptive Retrieval-Augmented Generation via Inherent Control.☆60Updated 6 months ago
- [COLING'22] Code for "SCL-RAI: Span-based Contrastive Learning with Retrieval Augmented Inference for Unlabeled Entity Problem in NER"☆44Updated 7 months ago
- [EMNLP 2024] DA-Code: Agent Data Science Code Generation Benchmark for Large Language Models☆66Updated 5 months ago
- ☆102Updated last year
- This repo contains my customised style python based plots for NLP papers, and includes my reproduction for my favourite papers' plots☆39Updated last year
- Explore concepts like Self-Correct, Self-Refine, Self-Improve, Self-Contradict, Self-Play, and Self-Knowledge, alongside o1-like reasonin…☆166Updated 4 months ago
- This tool(enhance_long) aims to enhance the LlaMa2 long context extrapolation capability in the lowest-cost approach, preferably without …☆45Updated last year
- [ACL'23] Code for "SANTA: Separate Strategies for Inaccurate and Incomplete Annotation Noise in Distantly-Supervised Named Entity Recogni…☆40Updated last year
- ☆47Updated 6 months ago
- RobustFT: Robust Supervised Fine-tuning for Large Language Models under Noisy Response☆40Updated 4 months ago
- An Extensible Framework for Retrieval-Augmented LLM Applications: Learning Relevance Beyond Simple Similarity.☆39Updated 4 months ago
- A Unified Intermediate Representation for Graph Query Languages☆65Updated 2 years ago
- ☆46Updated 9 months ago
- notes for Multi-hop Reading Comprehension and open-domain question answering☆86Updated 3 years ago
- Official Implementation of "Pay Attention to What You Need"☆42Updated 2 months ago
- Author: Xiangyu Dong (xdong2ps@gmail.com) and Wenhao Yu (wyu1@nd.edu). EMMLP 2021. News text generation.☆18Updated 3 years ago
- LLM Benchmark for Code☆31Updated 8 months ago
- Author: Wenhao Yu (wyu1@nd.edu). ACL 2022. Commonsense Reasoning on Knowledge Graph for Text Generation☆55Updated 2 years ago
- Grimoire is All You Need for Enhancing Large Language Models☆113Updated last year
- Code of Journey to the Center of the Knowledge Neurons: Discoveries of Language-Independent Knowledge Neurons and Degenerate Knowledge Ne…☆24Updated last year
- ☆120Updated last month
- Code for "Aligning Large Language Models to Follow Instructions and Hallucinate Less via Effective Data Filtering"☆16Updated 2 months ago
- ☆51Updated this week
- Counterfactual-inference-based Text-classification Debiasing Framework.☆83Updated 3 years ago
- A curated list of awesome papers related to adversarial attacks and defenses for information retrieval. If I missed any papers, feel free…☆215Updated 9 months ago
- MPLSandbox is an out-of-the-box multi-programming language sandbox designed to provide unified and comprehensive feedback from compiler a…☆174Updated last week
- [ACL 24 main] Large Language Models Can Learn Temporal Reasoning☆51Updated 4 months ago
- Code for ACL 2024 long paper: Are AI-Generated Text Detectors Robust to Adversarial Perturbations?☆28Updated 9 months ago
- Deliberate Reasoning in Language Models as Structure-Aware Planning with an Accurate World Model☆34Updated 2 months ago
- ACL 2024☆31Updated 7 months ago