☆29Feb 26, 2024Updated 2 years ago
Alternatives and similar repositories for rag-convincingness
Users that are interested in rag-convincingness are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code and data for "ConflictBank: A Benchmark for Evaluating the Influence of Knowledge Conflicts in LLM" (NeurIPS 2024 Track Datasets and…☆66May 16, 2025Updated 11 months ago
- Official codebase for the NeurIPS 2023 paper: Towards Last-layer Retraining for Group Robustness with Fewer Annotations. https://arxiv.or…☆12May 15, 2024Updated last year
- ☆14Apr 21, 2023Updated 2 years ago
- Official implementation of BPA (CVPR 2022)☆13Jun 17, 2022Updated 3 years ago
- ☆25Nov 21, 2020Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A text-based game where language models learn to lie and to detect lies.☆12Oct 4, 2023Updated 2 years ago
- Official repository for ALT (ALignment with Textual feedback).☆10Jul 25, 2024Updated last year
- Extract links from Wikipedia pages to create a cross-document coreference dataset (multilingual support)☆11Apr 13, 2023Updated 3 years ago
- Code used to create the Linked WikiText-2 dataset☆16May 22, 2023Updated 2 years ago
- [ICLR'24 Spotlight] "Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts"☆80Apr 12, 2024Updated 2 years ago
- Github repository for ACL 2025 paper: VoxEval: Benchmarking the Knowledge Understanding Capabilities of End-to-End Spoken Language Models☆24Jun 16, 2025Updated 10 months ago
- Companion repo for "Evaluating Verifiability in Generative Search Engines".☆85May 12, 2023Updated 2 years ago
- ☆49Oct 10, 2023Updated 2 years ago
- A unified framework for evaluating LLM factuality with modular, plug-and-play multi-source verification.☆20Nov 3, 2025Updated 5 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- codebase for the Text-based NP Enrichment (TNE) paper☆19Mar 12, 2024Updated 2 years ago
- ☆10Jul 18, 2022Updated 3 years ago
- ☆15Aug 3, 2021Updated 4 years ago
- Extended Wikilinks dataset description☆15Apr 1, 2018Updated 8 years ago
- From Easy to Hard: A Dual Curriculum Learning Framework for Context-Aware Document Ranking☆14Oct 25, 2022Updated 3 years ago
- ✱ Understanding the underlying learning dynamics of simple tasks in Transformer networks☆18Aug 16, 2024Updated last year
- ☆22Feb 13, 2026Updated 2 months ago
- data and code for paper "CCGIR: Information Retrieval-based Code Comment Generation Method for Smart Contracts", which accepted in KBS. 智…☆17Apr 24, 2022Updated 3 years ago
- Entity Linking within a Social Media Platform☆11May 2, 2019Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Official Code Release for "Diagnosing and Rectifying Vision Models using Language" (ICLR 2023)☆34Jun 8, 2023Updated 2 years ago
- Bias-to-Text: Debiasing Unknown Visual Biases through Language Interpretation☆32May 21, 2023Updated 2 years ago
- public repo for ESTER dataset and modeling (EMNLP'21)☆20Feb 2, 2022Updated 4 years ago
- Codebase of ACL2024 paper "Spiral of Silence: How is Large Language Model Killing Information Retrieval?—A Case Study on Open Domain Ques…☆16Jun 4, 2024Updated last year
- A Large-Scale Gender Bias Dataset for Coreference Resolution and Machine Translation, Levy et al., Findings of EMNLP 2021☆14Apr 3, 2022Updated 4 years ago
- ☆40Aug 1, 2025Updated 8 months ago
- Finding Camouflaged Needle in a Haystack? Pornographic Products Detection via Berrypicking Tree Model☆10Jul 29, 2019Updated 6 years ago
- A Fast Medical Image Viewer☆10Dec 15, 2018Updated 7 years ago
- An assistant for betting on prediction markets on manifold.markets, utilizing OpenAI's GPT APIs.☆34Jan 9, 2024Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- EventEA: Benchmarking Entity Alignment for Event-centric Knowledge Graphs☆11May 8, 2022Updated 3 years ago
- ☆11Sep 27, 2022Updated 3 years ago
- CIKM 2021: Contrastive Learning of User Behavior Sequence for Context-Aware Document Ranking☆20Sep 28, 2022Updated 3 years ago
- RAG methods, benchmarks, and toolkits☆19Nov 28, 2024Updated last year
- A neural parser for QA-SRL.☆23Apr 29, 2019Updated 6 years ago
- Exact Single-Source SimRank Computation on Large Graphs☆13Oct 1, 2020Updated 5 years ago
- SpuCo is a Python package developed to further research to address spurious correlations.☆25Jan 16, 2025Updated last year