CCQA A New Web-Scale Question Answering Dataset for Model Pre-Training
☆32Jul 20, 2022Updated 3 years ago
Alternatives and similar repositories for CCQA
Users that are interested in CCQA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Source code of CIKM2021 Paper 'Pre-training for Ad-hoc Retrieval: Hyperlink is Also You Need'☆16Aug 30, 2021Updated 4 years ago
- This repository contains code used for our Multi Sentence Inference NAACL'22 paper.☆12Mar 6, 2023Updated 3 years ago
- This is the official code for the paper 'Systematically Exploring Redundancy Reduction inSummarizing Long Documents'.☆16Apr 30, 2021Updated 5 years ago
- A repository for experiments in quality-aware decoding☆18Jun 7, 2022Updated 4 years ago
- Scalable training for dense retrieval models.☆299May 18, 2026Updated 3 weeks ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- CIKM 2021 Full Paper: FedMatch: Federated Learning Over Heterogeneous Question Answering Data☆12Dec 14, 2021Updated 4 years ago
- Data mapping framework for rust stuff☆54Mar 25, 2026Updated 2 months ago
- codebase for the SIMAT dataset and evaluation☆38Feb 16, 2022Updated 4 years ago
- Code for SaGe subword tokenizer (EACL 2023)☆28Nov 30, 2024Updated last year
- Self-Supervised Document-to-Document Similarity Ranking via Contextualized Language Models and Hierarchical Inference☆44Nov 28, 2022Updated 3 years ago
- GPT as Knowledger Worker (or if you really want, GPT Sorta' Takes the CPA Exam)☆13Jan 24, 2023Updated 3 years ago
- Official library of images for the SIGIR 2019 Open-Source IR Replicability Challenge (OSIRRC 2019)☆13Jul 7, 2019Updated 6 years ago
- ☆15Aug 15, 2012Updated 13 years ago
- Synthetic Data Generation for Evaluation☆15Feb 21, 2025Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Anserini notebooks☆69Apr 2, 2023Updated 3 years ago
- BLOOM+1: Adapting BLOOM model to support a new unseen language☆74Mar 2, 2024Updated 2 years ago
- [EMNLP 2022] Language Model Pre-Training with Sparse Latent Typing☆14Feb 10, 2023Updated 3 years ago
- Generalised UDRL☆37May 12, 2022Updated 4 years ago
- A collection of Models, Datasets, DataModules, Callbacks, Metrics, Losses and Loggers to better integrate pytorch-lightning with transfor…☆47May 29, 2023Updated 3 years ago
- ☆18Jun 10, 2022Updated 4 years ago
- Meta-Analysis of Robust04 Papers (Yang et al., SIGIR 2019)☆12May 25, 2019Updated 7 years ago
- 업무자동화를 위한 Python 강의를 듣고 정리한 자료☆13Oct 10, 2017Updated 8 years ago
- An implementation of GrASP (Shnarch et. al., 2017)☆23Aug 29, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A library for Partially Homomorphic Encryption in Python☆12May 30, 2017Updated 9 years ago
- Software for building the IR Anthology.☆11Sep 19, 2023Updated 2 years ago
- Rank-Biased Precision, Overlap, Recall, and Alignment☆12Feb 18, 2025Updated last year
- No Parameter Left Behind: How Distillation and Model Size Affect Zero-Shot Retrieval☆29Sep 26, 2022Updated 3 years ago
- ☆91May 21, 2022Updated 4 years ago
- ☆41May 26, 2026Updated 2 weeks ago
- A library for creating complex experimental pipelines☆12Jul 25, 2022Updated 3 years ago
- Metadata browser of TREC☆10May 19, 2026Updated 3 weeks ago
- An Open-Source Package for Information Retrieval.☆442Oct 7, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆46Apr 13, 2022Updated 4 years ago
- pyndri is a Python interface to the Indri search engine.☆89Jun 21, 2022Updated 3 years ago
- Overview of IR/NLP papers covered in my team's reading group.☆10May 5, 2020Updated 6 years ago
- ☆14May 31, 2022Updated 4 years ago
- Binary Passage Retriever (BPR) - an efficient passage retriever for open-domain question answering☆174Jun 6, 2021Updated 5 years ago
- This is a repository for my work on the paper "Oracle Guided Image Synthesis with Relative Queries".☆24May 6, 2022Updated 4 years ago
- Generate visual podcasts about novels using open source models☆32Feb 15, 2023Updated 3 years ago