Unsupervised Information Refinement Training of Large Language Models for Retrieval-Augmented Generation
☆57Dec 25, 2024Updated last year
Alternatives and similar repositories for INFO-RAG
Users that are interested in INFO-RAG are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆10Nov 15, 2023Updated 2 years ago
- Code for Search-in-the-Chain: Interactively Enhancing Large Language Models with Search for Knowledge-intensive Tasks (WWW 2024))☆59Nov 15, 2025Updated 7 months ago
- {DeepL, Google, WMT-Best, davinci-003, turbo, gpt-4} × {En-De, En-Cs, En-Ru, En-Zh, De-Fr, En-Ja, Uk-En, Uk-Cs, En-Hr, En-Ha, En-Is}☆14Jun 18, 2023Updated 3 years ago
- Source code of DRAGIN, ACL 2024 main conference Long Paper (Oral)☆191Dec 5, 2025Updated 6 months ago
- Leveraging passage embeddings for efficient listwise reranking with large language models.☆51Dec 7, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [ACL-2024]Enhancing Noise Robustness of Retrieval-Augmented Language Models with Adaptive Adversarial Training☆43Oct 28, 2024Updated last year
- Answering Ambiguous Questions via Iterative Prompting☆14May 25, 2024Updated 2 years ago
- Contrastive Learning Reduces Hallucination in Conversations☆25Oct 17, 2023Updated 2 years ago
- Codebase of ACL2024 paper "Spiral of Silence: How is Large Language Model Killing Information Retrieval?—A Case Study on Open Domain Ques…☆16Jun 4, 2024Updated 2 years ago
- Implementation of "REAR: A Relevance-Aware Retrieval-Augmented Framework for Open-Domain Question Answering"☆35Nov 21, 2024Updated last year
- [NAACL 2025] Official Code Repository for the paper "Probing-RAG: Self-Probing to Guide Language Models in Selective Document Retrieval"☆22Jul 13, 2025Updated 11 months ago
- QuoteSum is a textual QA dataset containing Semi-Extractive Multi-source Question Answering (SEMQA) examples written by humans, based on …☆13Mar 25, 2024Updated 2 years ago
- ☆13Feb 13, 2026Updated 4 months ago
- Code to compute topic coherence for several topic cardinalities and aggregate scores across them☆21Sep 10, 2025Updated 9 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The official data and code for EMNLP 2023 main conference paper: CRT-QA: A Dataset of Complex Reasoning Question Answering over Tabular D…☆13May 19, 2025Updated last year
- [ICLR 2025] InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationales☆149Apr 26, 2026Updated last month
- End-to-End Neural Event Coreference Resolution☆11Jun 18, 2023Updated 3 years ago
- NLPIR tutorial: pretrain for IR. pre-train on raw textual corpus, fine-tune on MS MARCO Document Ranking☆13Sep 10, 2021Updated 4 years ago
- This repository provides the code for applying Contrastive Learning Penalty Loss (CLPL) and Mixture of Experts (MoE) to the BGE-M3 text e…☆11Dec 27, 2024Updated last year
- Released code for our ICLR23 paper.☆66Mar 23, 2023Updated 3 years ago
- ☆14Feb 2, 2023Updated 3 years ago
- ☆17Jul 18, 2022Updated 3 years ago
- From Easy to Hard: A Dual Curriculum Learning Framework for Context-Aware Document Ranking☆14Oct 25, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆13Oct 4, 2022Updated 3 years ago
- ☆52Nov 27, 2025Updated 6 months ago
- [EMNLP 2024: Demo Oral] RAGLAB: A Modular and Research-Oriented Unified Framework for Retrieval-Augmented Generation☆310Oct 18, 2024Updated last year
- ☆10Aug 16, 2022Updated 3 years ago
- [EMNLP 2022] Fine-grained Category Discovery under Coarse-grained supervision with Hierarchical Weighted Self-contrastive Learning☆14Jun 22, 2024Updated last year
- Evaluate state-of-the-art sparse embedding models on the LIMIT dataset (`limit-small` and `limit`) from google's paper `On the Theoretica…☆16Sep 4, 2025Updated 9 months ago
- Code for paper "W-RAG: Weakly Supervised Dense Retrieval in RAG for Open-domain Question Answering"☆16Oct 2, 2025Updated 8 months ago
- Chinese ancient books in JSON format☆13Aug 13, 2020Updated 5 years ago
- Code for "Challenges of Using Text Classifiers for Causal Inference," at EMNLP '18☆23Sep 23, 2020Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Code for NAACL 2022 paper "Automatic Multi-Label Prompting: Simple and Interpretable Few-Shot Classification"☆25Oct 13, 2022Updated 3 years ago
- Official dataset repository for "SciReviewGen: A Large-scale Dataset for Automatic Literature Review Generation."☆21Jun 4, 2023Updated 3 years ago
- ☆14Oct 17, 2024Updated last year
- The code is for our AAAI2023 paper: Efficient Embeddings of Logical Variables for Query Answering over Incomplete Knowledge Graphs (Ding…☆10Dec 17, 2022Updated 3 years ago
- The official implementation for BLIP4CIR with bi-directional training | Bi-directional Training for Composed Image Retrieval via Text Pro…☆34Feb 7, 2024Updated 2 years ago
- ☆10Jul 5, 2023Updated 2 years ago
- CapOS is an open source server operating system based on OpenWrt. It aims to provide an easy-to-use Linux server OS for everyone. CapOS u…☆40Updated this week