minnesotanlp / cobblerView external linksLinks
Code and data for Koo et al's ACL 2024 paper "Benchmarking Cognitive Biases in Large Language Models as Evaluators"
☆21Feb 16, 2024Updated last year
Alternatives and similar repositories for cobbler
Users that are interested in cobbler are comparing it to the libraries listed below
Sorting:
- A flexible sentence segmentation library using CRF model and regex rules☆31Oct 5, 2025Updated 4 months ago
- Efficient Pre-training of Masked Language Model via Concept-based Curriculum Masking☆13Feb 5, 2023Updated 3 years ago
- FBI: Finding Blindspots in LLM Evaluations with Interpretable Checklists☆31Aug 14, 2025Updated 6 months ago
- ☆18Sep 3, 2024Updated last year
- Official implementation of the paper "IteraTeR: Understanding Iterative Revision from Human-Written Text" (ACL 2022)☆80Nov 15, 2023Updated 2 years ago
- ☆22Feb 26, 2024Updated last year
- ☆21Nov 30, 2022Updated 3 years ago
- ☆22Nov 23, 2023Updated 2 years ago
- ☆23Mar 19, 2024Updated last year
- ☆22Dec 1, 2022Updated 3 years ago
- 👻 Code and benchmark for our EMNLP 2023 paper - "FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions"☆59May 31, 2024Updated last year
- Implementation for https://arxiv.org/abs/2005.00652☆28Dec 8, 2022Updated 3 years ago
- DSBA code study☆30Nov 7, 2023Updated 2 years ago
- ☆29Dec 1, 2022Updated 3 years ago
- ☆30Dec 1, 2022Updated 3 years ago
- ☆39Jun 7, 2023Updated 2 years ago
- ☆10Nov 8, 2022Updated 3 years ago
- ☆10Nov 1, 2022Updated 3 years ago
- Test code of Inverse cloze task for information retrieval☆33Jan 10, 2021Updated 5 years ago
- Ansible for building kaggle environment☆13Jul 30, 2019Updated 6 years ago
- using rulsif for abrupt-change detection focusing on Environment, Usage, References, Introduction, Rulsif abrupt change detection.☆10Sep 3, 2025Updated 5 months ago
- HarmAug: Effective Data Augmentation for Knowledge Distillation of Safety Guard Models☆13Mar 6, 2025Updated 11 months ago
- Implementation of Variational Hierarchical User-based Conversation Model☆10Jul 2, 2021Updated 4 years ago
- An official codebase for "NormLens: Reading Books is Great, But Not if You Are Driving! Visually Grounded Reasoning about Defeasible Comm…☆10May 9, 2024Updated last year
- MultiVariate Convolutional Neural Network☆10May 10, 2018Updated 7 years ago
- ☆11Nov 10, 2015Updated 10 years ago
- sealos deck☆11Mar 30, 2024Updated last year
- Chrome Extension. As the name suggests.☆10Jan 30, 2022Updated 4 years ago
- Unsupervised-Data-Augmentation-PyTorch☆12Dec 8, 2022Updated 3 years ago
- inductive reasoning benchmark with subregular hierarchy for string-to-string transformation☆14Jun 27, 2025Updated 7 months ago
- A3C tensorflow implementation☆11Jul 22, 2018Updated 7 years ago
- [NeurIPS 2024 D&B] DetectRL: Benchmarking LLM-Generated Text Detection in Real-World Scenarios☆14Nov 19, 2024Updated last year
- ☆10Feb 12, 2024Updated 2 years ago
- Fair paper matching☆11Jan 20, 2020Updated 6 years ago
- ☆10Dec 18, 2023Updated 2 years ago
- ☆11Jun 5, 2024Updated last year
- ☆13Apr 12, 2024Updated last year
- The official implemetation of "Evidentiality-guided Generation for Knowledge-Intensive NLP Tasks" (NAACL 2022).☆44Dec 25, 2022Updated 3 years ago
- Flappy bird clone made to learn how to work with Phaser☆12Feb 10, 2018Updated 8 years ago