☆47Mar 20, 2023Updated 3 years ago
Alternatives and similar repositories for benchmark_llm_summarization
Users that are interested in benchmark_llm_summarization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Perform facts checks on your conversations with LLMs to catch fake-news, misleading information, and LLMs confusion.☆10Apr 22, 2023Updated 3 years ago
- Experiments on GPT-3's ability to fit numerical models in-context.☆14Aug 11, 2022Updated 3 years ago
- ☆23Feb 26, 2024Updated 2 years ago
- [ACL2023, Findings] Source codes for the paper "Werewolf Among Us: Multimodal Resources for Modeling Persuasion Behaviors in Social Deduc…☆16Feb 22, 2025Updated last year
- Implementation of "Can we obtain significant success in RST discourse parsing by using Large Language Models?" (accepted by EACL 2024)☆20May 13, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Code for paper Towards Mitigating LLM Hallucination via Self Reflection☆30Oct 9, 2023Updated 2 years ago
- ☆25Dec 13, 2024Updated last year
- The official implement of paper 《DaMo: Data Mixing Optimizer in Fine-tuning Multimodal LLMs for Mobile Phone Agents》☆30Oct 23, 2025Updated 6 months ago
- ☆26Nov 21, 2022Updated 3 years ago
- A simple algorithm to identify and correct for label shift.☆21Feb 4, 2018Updated 8 years ago
- Python package for evaluating model calibration in classification☆20Nov 12, 2019Updated 6 years ago
- Code repo for the ICML 2021 paper "Making Paper Reviewing Robust to Bid Manipulation Attacks".☆10Sep 15, 2021Updated 4 years ago
- Synthetic data generation for evaluating LLM symbolic and logic reasoning☆22Mar 6, 2026Updated 2 months ago
- Codes for "Benchmarking the Generation of Fact Checking Explanations"☆10Aug 16, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Python package providing a simple interface to manipulate Elasticsearch queries and aggregations☆11Apr 1, 2026Updated last month
- Label shift experiments☆17Dec 3, 2020Updated 5 years ago
- ☆16Mar 26, 2026Updated last month
- Source code and data of our paper "Missing Counter-Evidence Renders NLP Fact-Checking Unrealistic for Misinformation" (https://arxiv.org/…☆10Jun 21, 2023Updated 2 years ago
- Regularized Learning under label shifts☆18May 1, 2019Updated 7 years ago
- ☆12Jun 7, 2025Updated 11 months ago
- Preprocessing scripts for ACE and ERE datasets☆15Jul 28, 2020Updated 5 years ago
- GPTSolver: question solving with selecting/screenshot☆15Mar 25, 2023Updated 3 years ago
- Original PyTorch Implementation for the EMNLP 2023 Paper "Beyond Detection: A Defend-and-Summarize Strategy for Robust and Interpretable …☆16Dec 14, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Repo for "On Learning to Summarize with Large Language Models as References"☆43May 24, 2023Updated 2 years ago
- ☆26Jul 5, 2022Updated 3 years ago
- A new benchmark of 118 ICPC problems for evaluating LLM reasoning in competitive coding, featuring realistic ICPC competition scenario, r…☆17May 18, 2025Updated 11 months ago
- ☆26Nov 7, 2022Updated 3 years ago
- Code for "ParaGuide: Guided Diffusion Paraphrasers for Plug-and-Play Textual Style Transfer"☆16Jul 17, 2024Updated last year
- A PyTorch implementation of DeepFD (Deep Structure Learning for Fraud Detection)☆32Oct 2, 2020Updated 5 years ago
- The offical code for paper "What Constitutes a Faithful Summary? Preserving Author Perspectives in News Summarization"☆10Jun 23, 2024Updated last year
- Proof system for Fact Verification☆14Jun 7, 2022Updated 3 years ago
- ☆11Sep 24, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆21Mar 25, 2023Updated 3 years ago
- ☆16Mar 27, 2023Updated 3 years ago
- The code and datasets of our ACM MM 2024 paper "Hallu-PI: Evaluating Hallucination in Multi-modal Large Language Models within Perturbed …☆11Sep 27, 2024Updated last year
- [NeurIPS 2025] RL Tango: Reinforcing Generator and Verifier Together for Language Reasoning☆54Oct 23, 2025Updated 6 months ago
- Code implementation of our ICLR'21 paper "Calibration of Neural Networks using Splines"☆21Mar 31, 2023Updated 3 years ago
- Dataset and baseline for Coling 2022 long paper (oral): "ConFiguRe: Exploring Discourse-level Chinese Figures of Speech"☆13Jul 27, 2023Updated 2 years ago
- GRENADE: Graph-Centric Language Model for Self-Supervised Representation Learning on Text-Attributed Graphs☆16Jan 6, 2025Updated last year