CCQA A New Web-Scale Question Answering Dataset for Model Pre-Training
☆32Jul 20, 2022Updated 3 years ago
Alternatives and similar repositories for CCQA
Users that are interested in CCQA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository contains code used for our Multi Sentence Inference NAACL'22 paper.☆12Mar 6, 2023Updated 3 years ago
- This is the official code for the paper 'Systematically Exploring Redundancy Reduction inSummarizing Long Documents'.☆16Apr 30, 2021Updated 5 years ago
- A repository for experiments in quality-aware decoding☆18Jun 7, 2022Updated 3 years ago
- Scalable training for dense retrieval models.☆298Apr 8, 2026Updated 3 weeks ago
- CIKM 2021 Full Paper: FedMatch: Federated Learning Over Heterogeneous Question Answering Data☆12Dec 14, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Data mapping framework for rust stuff☆51Mar 25, 2026Updated last month
- ☆25Jun 25, 2021Updated 4 years ago
- codebase for the SIMAT dataset and evaluation☆38Feb 16, 2022Updated 4 years ago
- Code for SaGe subword tokenizer (EACL 2023)☆28Nov 30, 2024Updated last year
- Self-Supervised Document-to-Document Similarity Ranking via Contextualized Language Models and Hierarchical Inference☆44Nov 28, 2022Updated 3 years ago
- GPT as Knowledger Worker (or if you really want, GPT Sorta' Takes the CPA Exam)☆13Jan 24, 2023Updated 3 years ago
- Official library of images for the SIGIR 2019 Open-Source IR Replicability Challenge (OSIRRC 2019)☆13Jul 7, 2019Updated 6 years ago
- ☆15Aug 15, 2012Updated 13 years ago
- ☆13Dec 11, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Synthetic Data Generation for Evaluation☆14Feb 21, 2025Updated last year
- [EMNLP 2022] Language Model Pre-Training with Sparse Latent Typing☆14Feb 10, 2023Updated 3 years ago
- Generalised UDRL☆37May 12, 2022Updated 3 years ago
- SIGIR'21: Optimizing DR with hard negatives and achieving SOTA first-stage retrieval performance on TREC DL Track.☆127Feb 15, 2022Updated 4 years ago
- A collection of Models, Datasets, DataModules, Callbacks, Metrics, Losses and Loggers to better integrate pytorch-lightning with transfor…☆47May 29, 2023Updated 2 years ago
- ☆18Jun 10, 2022Updated 3 years ago
- An implementation of GrASP (Shnarch et. al., 2017)☆23Aug 29, 2022Updated 3 years ago
- ☆44Mar 29, 2023Updated 3 years ago
- A Python Interface to Reproducibility Measures of System-Oriented IR Experiments☆11Dec 2, 2025Updated 5 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Rank-Biased Precision, Overlap, Recall, and Alignment☆12Feb 18, 2025Updated last year
- No Parameter Left Behind: How Distillation and Model Size Affect Zero-Shot Retrieval☆29Sep 26, 2022Updated 3 years ago
- ☆91May 21, 2022Updated 3 years ago
- ☆38Mar 26, 2026Updated last month
- A library for creating complex experimental pipelines☆12Jul 25, 2022Updated 3 years ago
- Metadata browser of TREC☆10Updated this week
- This is the official repo for Gradient Agreement Filtering (GAF).☆25Jan 27, 2025Updated last year
- An Open-Source Package for Information Retrieval.☆443Oct 7, 2022Updated 3 years ago
- ☆46Apr 13, 2022Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- pyndri is a Python interface to the Indri search engine.☆89Jun 21, 2022Updated 3 years ago
- Overview of IR/NLP papers covered in my team's reading group.☆10May 5, 2020Updated 5 years ago
- TIFMO: Textual Inference Forward-chaining MOdule☆12Apr 25, 2014Updated 12 years ago
- Evaluate Transformers from the Hub 🔥☆14Apr 3, 2026Updated last month
- Training project about Deep Learing☆12Jun 22, 2017Updated 8 years ago
- ☆14May 31, 2022Updated 3 years ago
- Binary Passage Retriever (BPR) - an efficient passage retriever for open-domain question answering☆174Jun 6, 2021Updated 4 years ago