facebookresearch / Sphere
Web-scale retrieval for knowledge-intensive NLP
☆555Updated last year
Related projects ⓘ
Alternatives and complementary repositories for Sphere
- The AI Knowledge Editor☆182Updated 2 years ago
- Multi-angle c(q)uestion answering☆458Updated 2 years ago
- Used for adaptive human in the loop evaluation of language and embedding models.☆304Updated last year
- 🌸 fastText + Bloom embeddings for compact, full-coverage vectors with spaCy☆287Updated last year
- A Python framework for performing information retrieval experiments, building on http://terrier.org/☆417Updated 2 weeks ago
- SpikeX - SpaCy Pipes for Knowledge Extraction☆398Updated 3 years ago
- Code and data to support the paper "PAQ 65 Million Probably-Asked Questions andWhat You Can Do With Them"☆202Updated 3 years ago
- ☆487Updated 9 months ago
- ToTTo is an open-domain English table-to-text dataset with over 120,000 training examples that proposes a controlled generation task: giv…☆436Updated 2 months ago
- Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: …☆323Updated last year
- Search Engines with Autoregressive Language models☆277Updated last year
- A suite of tools for managing crowdsourcing tasks from the inception through to data packaging for research use.☆304Updated this week
- Database Reasoning Over Text project for ACL paper☆353Updated 2 years ago
- Task-based datasets, preprocessing, and evaluation for sequence models.☆561Updated this week
- Forte is a flexible and powerful ML workflow builder. This is part of the CASL project: http://casl-project.ai/☆239Updated 9 months ago
- Organize your experiments into discrete steps that can be cached and reused throughout the lifetime of your research project.☆534Updated 5 months ago
- The prime repository for state-of-the-art Multilingual Question Answering research and development.☆728Updated this week
- Library for Knowledge Intensive Language Tasks☆916Updated 2 years ago
- The pipeline for the OSCAR corpus☆162Updated 11 months ago
- Reproduce results and replicate training fo T0 (Multitask Prompted Training Enables Zero-Shot Task Generalization)☆457Updated 2 years ago
- We introduce MKQA, an open-domain question answering evaluation set comprising 10k question-answer pairs aligned across 26 typologically …☆166Updated 2 years ago
- Pipeline for pulling and processing online language model pretraining data from the web☆174Updated last year
- NL-Augmenter 🦎 → 🐍 A Collaborative Repository of Natural Language Transformations☆777Updated 6 months ago
- Mistral: A strong, northwesterly wind: Framework for transparent and accessible large-scale language model training, built with Hugging F…☆563Updated last year
- skweak: A software toolkit for weak supervision applied to NLP tasks☆920Updated 2 months ago
- Zero and Few shot named entity & relationships recognition☆349Updated 2 months ago
- docTTTTTquery document expansion model☆356Updated last year
- An Analysis Toolkit for Natural Language Generation (Translation, Captioning, Summarization, etc.)☆443Updated 3 weeks ago
- This repository contains the code for "Generating Datasets with Pretrained Language Models".☆187Updated 3 years ago
- ☆179Updated last year