Code and data for Koo et al's ACL 2024 paper "Benchmarking Cognitive Biases in Large Language Models as Evaluators"
☆22Feb 16, 2024Updated 2 years ago
Alternatives and similar repositories for cobbler
Users that are interested in cobbler are comparing it to the libraries listed below
Sorting:
- A flexible sentence segmentation library using CRF model and regex rules☆31Feb 22, 2026Updated 2 weeks ago
- A simple tutorial script on Streamlit using the Iris Dataset☆12Sep 13, 2023Updated 2 years ago
- ☆18Sep 3, 2024Updated last year
- Repository for DISRPT2021 shared task☆16Sep 5, 2022Updated 3 years ago
- Official implementation of the paper "IteraTeR: Understanding Iterative Revision from Human-Written Text" (ACL 2022)☆80Nov 15, 2023Updated 2 years ago
- ☆22Feb 26, 2024Updated 2 years ago
- ☆22Nov 23, 2023Updated 2 years ago
- ☆22Dec 1, 2022Updated 3 years ago
- 👻 Code and benchmark for our EMNLP 2023 paper - "FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions"☆59May 31, 2024Updated last year
- ☆25Nov 24, 2023Updated 2 years ago
- DSBA code study☆30Nov 7, 2023Updated 2 years ago
- ☆29Dec 1, 2022Updated 3 years ago
- ☆39Jun 7, 2023Updated 2 years ago
- ☆10Nov 1, 2022Updated 3 years ago
- ☆10Nov 8, 2022Updated 3 years ago
- Test code of Inverse cloze task for information retrieval☆33Jan 10, 2021Updated 5 years ago
- A utility for storing and reading files for Korean LM training 💾☆35Oct 15, 2025Updated 4 months ago
- ☆11Oct 30, 2024Updated last year
- using rulsif for abrupt-change detection focusing on Environment, Usage, References, Introduction, Rulsif abrupt change detection.☆10Sep 3, 2025Updated 6 months ago
- Ansible for building kaggle environment☆13Jul 30, 2019Updated 6 years ago
- 🎭 Official code and dataset for our CCGPK@COLING 2022 paper - "PersonaChatGen: Generating Personalized Dialogue using GPT-3"☆13Mar 26, 2024Updated last year
- HarmAug: Effective Data Augmentation for Knowledge Distillation of Safety Guard Models☆13Mar 6, 2025Updated last year
- ☆10Feb 12, 2024Updated 2 years ago
- ☆11Sep 8, 2023Updated 2 years ago
- Chrome Extension. As the name suggests.☆10Jan 30, 2022Updated 4 years ago
- An official codebase for "NormLens: Reading Books is Great, But Not if You Are Driving! Visually Grounded Reasoning about Defeasible Comm…☆10May 9, 2024Updated last year
- ☆11Nov 10, 2015Updated 10 years ago
- [NeurIPS 2024 D&B Track] DACO: Towards Application-Driven and Comprehensive Data Analysis via Code Generation☆12Mar 5, 2025Updated last year
- Unsupervised-Data-Augmentation-PyTorch☆12Dec 8, 2022Updated 3 years ago
- inductive reasoning benchmark with subregular hierarchy for string-to-string transformation☆16Jun 27, 2025Updated 8 months ago
- Implementation of Variational Hierarchical User-based Conversation Model☆10Jul 2, 2021Updated 4 years ago
- ☆11Jun 5, 2024Updated last year
- A3C tensorflow implementation☆11Jul 22, 2018Updated 7 years ago
- Fair paper matching☆11Jan 20, 2020Updated 6 years ago
- UI for ActivityWatch. Include category editor and viewer for multiple categorizations.☆10Jan 31, 2024Updated 2 years ago
- The official implemetation of "Evidentiality-guided Generation for Knowledge-Intensive NLP Tasks" (NAACL 2022).☆44Dec 25, 2022Updated 3 years ago
- Evaluating Durability: Benchmark Insights into Multimodal Watermarking☆12Jun 7, 2024Updated last year
- Tree-of-Debate converts scientific papers into LLM personas that debate their respective novelties. To emphasize structured, critical rea…☆18Jul 22, 2025Updated 7 months ago
- ☆11Oct 3, 2021Updated 4 years ago