☆22Jan 5, 2024Updated 2 years ago
Alternatives and similar repositories for scale-score
Users that are interested in scale-score are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- AIS is an evaluation framework for assessing whether the output of natural language models only contains information about the external w…☆31Jan 14, 2023Updated 3 years ago
- ☆11Nov 27, 2022Updated 3 years ago
- ☆27Nov 6, 2022Updated 3 years ago
- 9th solution☆11Oct 11, 2022Updated 3 years ago
- Official implementation of the ACL Findings 2023 paper: Interpretable Automatic Fine-grained Inconsistency Detection in Text Summarizatio…☆14Jan 25, 2024Updated 2 years ago
- Code for the arXiv paper: "LLMs as Factual Reasoners: Insights from Existing Benchmarks and Beyond"☆61Jan 27, 2025Updated last year
- The code implementation of the EMNLP2022 paper: DisCup: Discriminator Cooperative Unlikelihood Prompt-tuning for Controllable Text Gene…☆27Nov 13, 2023Updated 2 years ago
- Exploring limitations of LLM-as-a-judge☆20Aug 17, 2024Updated last year
- ☆40Jun 7, 2023Updated 2 years ago
- ☆35Nov 17, 2021Updated 4 years ago
- Implementation for Decision-focused Summarization (EMNLP2021)☆12Mar 14, 2022Updated 4 years ago
- My solution for the ''LLM - Detect AI Generated Text'' kaggle competition☆16Feb 2, 2024Updated 2 years ago
- ☆10May 1, 2025Updated 10 months ago
- [APSIPA ASC 2023] The official code of paper, "FactLLaMA: Optimizing Instruction-Following Language Models with External Knowledge for Au…☆17Mar 7, 2024Updated 2 years ago
- [ACL2023] Source code for Dialogue Summarization with Static-Dynamic Structure Fusion Graph☆11Dec 17, 2023Updated 2 years ago
- Creating the tools and data sets necessary to evaluate vulnerabilities in LLMs.☆27Mar 14, 2025Updated last year
- Transformer-based Long Document Classification☆17Nov 2, 2022Updated 3 years ago
- Multi-task modelling extensions for huggingface transformers☆21Mar 3, 2023Updated 3 years ago
- A Universal Platform for Training and Evaluation of Mobile Interaction☆61Sep 24, 2025Updated 6 months ago
- Public repo for the paper: "Modeling Intensification for Sign Language Generation: A Computational Approach" by Mert Inan*, Yang Zhong*, …☆14Mar 15, 2022Updated 4 years ago
- Censored tweets annotated for specificity; AAAI 2019 paper: Predicting and Analyzing Language Specificity in Social Media Posts☆10Oct 19, 2021Updated 4 years ago
- ☆15Aug 3, 2021Updated 4 years ago
- EMNLP 2022: Leveraging Locality in Abstractive Text Summarization☆18Oct 21, 2024Updated last year
- An original implementation of the paper "CREPE: Open-Domain Question Answering with False Presuppositions"☆16Nov 5, 2024Updated last year
- Repository collecting resources and best practices to improve experimental rigour in deep learning research.☆27Mar 30, 2023Updated 2 years ago
- 6th Position Solution Code for Kaggle - LLM Science Exam Competition☆24Jul 8, 2024Updated last year
- This is the official code for the paper 'Systematically Exploring Redundancy Reduction inSummarizing Long Documents'.☆16Apr 30, 2021Updated 4 years ago
- This is the repository for the paper 'DiaHalu: A Dialogue-level Hallucination Evaluation Benchmark for Large Language Models' (EMNLP2024 …☆18Apr 5, 2025Updated 11 months ago
- A tool for extracting plain text and internal Wikipedia links from Wikipedia dumps☆11Apr 18, 2019Updated 6 years ago
- Code for the EMNLP'21 paper "Neural Path Hunter: Reducing Hallucination in Dialogue Systems via Path Grounding"☆16Mar 13, 2022Updated 4 years ago
- Code and dataset for the paper: Generating Literal and Implied Subquestions to Fact-check Complex Claims☆29May 30, 2023Updated 2 years ago
- EMNLP 2022: Analyzing and Evaluating Faithfulness in Dialogue Summarization☆13Mar 20, 2025Updated last year
- Implementation of the paper 'Improve Discourse Dependency Parsing with Contextualized Representations', Findings of NAACL 2022☆14Jul 15, 2022Updated 3 years ago
- The contrastive token loss function for reducing generative repetition of autoregressive neural language models.☆13May 11, 2022Updated 3 years ago
- MetricEval: A framework that conceptualizes and operationalizes four main components of metric evaluation, in terms of reliability and va…☆12Nov 6, 2023Updated 2 years ago
- A curated list of personalized Language model / Large language model (continually updated)☆10Nov 17, 2023Updated 2 years ago
- Codes for "Benchmarking the Generation of Fact Checking Explanations"☆10Aug 16, 2024Updated last year
- ☆21Aug 19, 2024Updated last year
- EMNLP 2021 - CTC: A Unified Framework for Evaluating Natural Language Generation☆97Mar 20, 2023Updated 3 years ago