ACL 2023 paper "A Critical Evaluation of Evaluations for Long-form Question Answering"
☆21Mar 22, 2024Updated last year
Alternatives and similar repositories for lfqa_eval
Users that are interested in lfqa_eval are comparing it to the libraries listed below
Sorting:
- Task Compass: Scaling Multi-task Pre-training with Task Prefix (EMNLP 2022: Findings) (stay tuned & more will be updated)☆22Oct 17, 2022Updated 3 years ago
- A python package of common operations for AMRs☆29Jun 7, 2022Updated 3 years ago
- Continual Resilient (CoRe) Optimizer for PyTorch☆11Jun 10, 2024Updated last year
- ☆10Dec 6, 2022Updated 3 years ago
- ☆16Nov 17, 2025Updated 3 months ago
- Dataset and code to reproduce the results of the paper "Evolving Structures in Complex Systems"☆11Dec 16, 2019Updated 6 years ago
- Dataset and model in the paper "SciXGen: A Scientific Paper Dataset for Context-Aware Text Generation"☆13Feb 14, 2022Updated 4 years ago
- ☆11Nov 10, 2015Updated 10 years ago
- All the tools that allow me to never ever open up Final Cut☆11Feb 16, 2025Updated last year
- This Node.js script automates the process of downloading and extracting source maps from websites. It uses Puppeteer to navigate web page…☆18Dec 17, 2025Updated 2 months ago
- Code of our paper "Method-Level Bug Severity Prediction using Source Code Metrics and LLMs" which is accepted to ISSRE 2023.☆10Nov 12, 2023Updated 2 years ago
- A drag-and-drop-enabled, responsive, envelope graph that allows to shape a wave with attack, decay, sustain and release☆11Jan 5, 2023Updated 3 years ago
- Moral Machine Experiment on LLMs☆11Feb 2, 2026Updated last month
- Base definition of rk-boot plugins. rk-boot is a library to start goLang microservice from YAML☆11Jul 15, 2024Updated last year
- Discourse Probing of Pretrained Language Models. In Proceedings of NAACL 2021.☆10Jun 27, 2022Updated 3 years ago
- ☆13May 21, 2023Updated 2 years ago
- Official GraphQLBlog repository. Add your blog posts as pull request!☆13Jan 11, 2023Updated 3 years ago
- Python class to explore the ImageNet database☆16Jan 12, 2012Updated 14 years ago
- A library for simplifying training with multi gpu setups in the HuggingFace / PyTorch ecosystem.☆16Jan 9, 2026Updated last month
- Unbounded cache model for online language modeling with open vocabulary☆11Feb 15, 2019Updated 7 years ago
- ☆11Sep 8, 2024Updated last year
- benchmarks for evaluating MT models☆11Jun 26, 2024Updated last year
- A simple agent powered by LLMs that performs tasks.☆13Apr 25, 2025Updated 10 months ago
- Fair paper matching☆11Jan 20, 2020Updated 6 years ago
- Code and data for "A fine-grained comparison of pragmatic language understanding in humans and language models"