OAfzal / nlp-for-peer-review
☆37Updated 4 months ago
Alternatives and similar repositories for nlp-for-peer-review:
Users that are interested in nlp-for-peer-review are comparing it to the libraries listed below
- Code/data for MARG (multi-agent review generation)☆42Updated 5 months ago
- Code for preprint: Summarizing Differences between Text Distributions with Natural Language☆42Updated 2 years ago
- Code for "From Pretraining Data to Language Models to Downstream Tasks: Tracking the Trails of Political Biases Leading to Unfair NLP Mod…☆36Updated last year
- The data and the PyTorch implementation for the models and experiments in the paper "Exploiting Asymmetry for Synthetic Training Data Gen…☆60Updated last year
- ☆106Updated 11 months ago
- [ACL 2024] <Large Language Models for Automated Open-domain Scientific Hypotheses Discovery>. It has also received the best poster award …☆39Updated 5 months ago
- This repository contains data, code and models for contextual noncompliance.☆21Updated 9 months ago
- [ICLR 2023] Code for our paper "Selective Annotation Makes Language Models Better Few-Shot Learners"☆109Updated last year
- ☆73Updated last year
- This repository contains the dataset and code for "WiCE: Real-World Entailment for Claims in Wikipedia" in EMNLP 2023.☆41Updated last year
- [NeurIPS 2023 D&B Track] Code and data for paper "Revisiting Out-of-distribution Robustness in NLP: Benchmarks, Analysis, and LLMs Evalua…☆33Updated last year
- Evaluate the Quality of Critique☆34Updated 10 months ago
- [arXiv preprint] Official Repository for "Evaluating Language Models as Synthetic Data Generators"☆33Updated 4 months ago
- The source code for running LLMs on the AAAR-1.0 benchmark.☆16Updated 2 weeks ago
- AbstainQA, ACL 2024☆25Updated 6 months ago
- The LM Contamination Index is a manually created database of contamination evidences for LMs.☆78Updated last year
- An open-source library for contamination detection in NLP datasets and Large Language Models (LLMs).☆52Updated 8 months ago
- [NAACL'25 Oral] Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering☆53Updated 5 months ago
- ☆41Updated 11 months ago
- ☆24Updated 3 months ago
- Easy-to-use MIRAGE code for faithful answer attribution in RAG applications. Paper: https://aclanthology.org/2024.emnlp-main.347/☆22Updated last month
- Repository for the ACL 2024 conference website☆18Updated 2 months ago
- Repo for the paper "Large Language Models Struggle to Learn Long-Tail Knowledge"☆76Updated 2 years ago
- Tasks for describing differences between text distributions.☆16Updated 8 months ago
- Grade-School Math with Irrelevant Context (GSM-IC) benchmark is an arithmetic reasoning dataset built upon GSM8K, by adding irrelevant se…☆60Updated 2 years ago
- ☆41Updated last year
- NLPBench: Evaluating NLP-Related Problem-solving Ability in Large Language Models☆10Updated last year
- The Prism Alignment Project☆75Updated last year
- Augmenting Statistical Models with Natural Language Parameters☆26Updated 7 months ago
- ☆34Updated 3 years ago