YangLinyi / GLUE-XLinks
We leverage 14 datasets as OOD test data and conduct evaluations on 8 NLU tasks over 21 popularly used models. Our findings confirm that the OOD accuracy in NLP tasks needs to be paid more attention to since the significant performance decay compared to ID accuracy has been found in all settings.
☆93Updated 2 years ago
Alternatives and similar repositories for GLUE-X
Users that are interested in GLUE-X are comparing it to the libraries listed below
Sorting:
- This includes the original implementation of CtrlA: Adaptive Retrieval-Augmented Generation via Inherent Control.☆65Updated last year
- [EMNLP 2024] DA-Code: Agent Data Science Code Generation Benchmark for Large Language Models☆89Updated this week
- [ACL 2025 main] SCAR: Data Selection via Style Consistency-Aware Response Ranking for Efficient Instruction-Tuning of Large Language Mode…☆39Updated 5 months ago
- ☆102Updated 2 years ago
- This tool(enhance_long) aims to enhance the LlaMa2 long context extrapolation capability in the lowest-cost approach, preferably without …☆45Updated 2 years ago
- [COLING'22] Code for "SCL-RAI: Span-based Contrastive Learning with Retrieval Augmented Inference for Unlabeled Entity Problem in NER"☆45Updated last year
- ☆52Updated last year
- A Unified Intermediate Representation for Graph Query Languages☆66Updated 3 years ago
- [NeurIPS 2025] Hybrid Latent Reasoning via Reinforcement Learning☆170Updated 4 months ago
- LLM Benchmark for Code☆32Updated last year
- An Extensible Framework for Retrieval-Augmented LLM Applications: Learning Relevance Beyond Simple Similarity.☆41Updated last year
- ☆111Updated last month
- ☆52Updated 4 months ago
- MPLSandbox is an out-of-the-box multi-programming language sandbox designed to provide unified and comprehensive feedback from compiler a…☆178Updated 9 months ago
- Code of Journey to the Center of the Knowledge Neurons: Discoveries of Language-Independent Knowledge Neurons and Degenerate Knowledge Ne…☆28Updated last year
- This repo contains my customised style python based plots for NLP papers, and includes my reproduction for my favourite papers' plots☆39Updated last year
- Official Implementation of "Pay Attention to What You Need"☆44Updated 10 months ago
- RobustFT: Robust Supervised Fine-tuning for Large Language Models under Noisy Response☆42Updated last year
- [ACL'23] Code for "SANTA: Separate Strategies for Inaccurate and Incomplete Annotation Noise in Distantly-Supervised Named Entity Recogni…☆39Updated 8 months ago
- ☆44Updated last year
- [NeurIPS 2025 Poster] Search and Refine During Think: Facilitating Knowledge Refinement for Improved Retrieval-Augmented Reasoning☆116Updated last month
- Counterfactual-inference-based Text-classification Debiasing Framework.☆83Updated 4 years ago
- [ACL'25] Code for "Aligning Large Language Models to Follow Instructions and Hallucinate Less via Effective Data Filtering"☆20Updated 5 months ago
- SQL-o1: A Self-Reward Heuristic Dynamic Search Method for Text-to-SQL☆197Updated 7 months ago
- A Graph Query Language Transpiler☆34Updated last year
- Author: Xiangyu Dong (xdong2ps@gmail.com) and Wenhao Yu (wyu1@nd.edu). EMMLP 2021. News text generation.☆18Updated 4 years ago
- The official code repo for "Safe Delta: Consistently Preserving Safety when Fine-Tuning LLMs on Diverse Datasets" in ICML 2025.☆56Updated 6 months ago
- Grimoire is All You Need for Enhancing Large Language Models☆117Updated last year
- Author: Wenhao Yu (wyu1@nd.edu). ACL 2022 Dict-BERT: Enhancing Language Model Pre-training with Dictionary☆42Updated 3 years ago
- [ACL 25 main] Deliberate Reasoning in Language Models as Structure-Aware Planning with an Accurate World Model☆42Updated 2 months ago