OAfzal / nlp-for-peer-reviewLinks
☆44Updated 7 months ago
Alternatives and similar repositories for nlp-for-peer-review
Users that are interested in nlp-for-peer-review are comparing it to the libraries listed below
Sorting:
- ☆108Updated last year
- Code for "From Pretraining Data to Language Models to Downstream Tasks: Tracking the Trails of Political Biases Leading to Unfair NLP Mod…☆37Updated last year
- Code/data for MARG (multi-agent review generation)☆44Updated 8 months ago
- ☆51Updated 2 years ago
- Easy-to-use MIRAGE code for faithful answer attribution in RAG applications. Paper: https://aclanthology.org/2024.emnlp-main.347/☆24Updated 4 months ago
- [NeurIPS 2023 D&B Track] Code and data for paper "Revisiting Out-of-distribution Robustness in NLP: Benchmarks, Analysis, and LLMs Evalua…☆33Updated 2 years ago
- Data and code for the preprint "In-Context Learning with Long-Context Models: An In-Depth Exploration"☆37Updated 10 months ago
- ☆43Updated 11 months ago
- The Prism Alignment Project☆79Updated last year
- Official implementation of the ACL 2024: Scientific Inspiration Machines Optimized for Novelty☆81Updated last year
- Inspecting and Editing Knowledge Representations in Language Models☆116Updated last year
- This repository contains data, code and models for contextual noncompliance.☆23Updated 11 months ago
- An open-source library for contamination detection in NLP datasets and Large Language Models (LLMs).☆57Updated 11 months ago
- AbstainQA, ACL 2024☆27Updated 9 months ago
- [ACL 2024] <Large Language Models for Automated Open-domain Scientific Hypotheses Discovery>. It has also received the best poster award …☆42Updated 8 months ago
- ☆43Updated last year
- The LM Contamination Index is a manually created database of contamination evidences for LMs.☆78Updated last year
- Resources for cultural NLP research☆98Updated 2 months ago
- [ICLR 2023] Code for our paper "Selective Annotation Makes Language Models Better Few-Shot Learners"☆108Updated 2 years ago
- [NAACL'25 Oral] Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering☆60Updated 7 months ago
- Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)☆59Updated last year
- PASTA: Post-hoc Attention Steering for LLMs☆121Updated 7 months ago
- Repo accompanying our paper "Do Llamas Work in English? On the Latent Language of Multilingual Transformers".☆78Updated last year
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆131Updated last year
- Grade-School Math with Irrelevant Context (GSM-IC) benchmark is an arithmetic reasoning dataset built upon GSM8K, by adding irrelevant se…☆60Updated 2 years ago
- [arXiv preprint] Official Repository for "Evaluating Language Models as Synthetic Data Generators"☆33Updated 7 months ago
- ☆52Updated last year
- ☆163Updated 7 months ago
- [ICLR'24 Spotlight] "Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts"☆69Updated last year
- This repository includes code for the paper "Does Localization Inform Editing? Surprising Differences in Where Knowledge Is Stored vs. Ca…☆61Updated 2 years ago