facebookresearch / SphereLinks
Web-scale retrieval for knowledge-intensive NLP
☆556Updated 2 years ago
Alternatives and similar repositories for Sphere
Users that are interested in Sphere are comparing it to the libraries listed below
Sorting:
- The AI Knowledge Editor☆185Updated 3 years ago
- Multi-angle c(q)uestion answering☆456Updated 3 years ago
- Database Reasoning Over Text project for ACL paper☆350Updated 3 years ago
- Ask Me Anything language model prompting☆546Updated 2 years ago
- The prime repository for state-of-the-art Multilingual Question Answering research and development.☆739Updated 3 weeks ago
- Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: …☆337Updated 2 years ago
- ☆363Updated 10 months ago
- ☆522Updated last year
- The pipeline for the OSCAR corpus☆172Updated last year
- Used for adaptive human in the loop evaluation of language and embedding models.☆308Updated 2 years ago
- Tools and Modeling Code for the MASSIVE dataset☆550Updated 2 years ago
- Mistral: A strong, northwesterly wind: Framework for transparent and accessible large-scale language model training, built with Hugging F…☆574Updated last year
- Zero and Few shot named entity & relationships recognition☆388Updated 3 weeks ago
- 🌸 fastText + Bloom embeddings for compact, full-coverage vectors with spaCy☆321Updated 5 months ago
- Reproduce results and replicate training fo T0 (Multitask Prompted Training Enables Zero-Shot Task Generalization)☆461Updated 2 years ago
- A suite of tools for managing crowdsourcing tasks from the inception through to data packaging for research use.☆312Updated 10 months ago
- A library to synthesize text datasets using Large Language Models (LLM)☆151Updated 2 years ago
- Find and fix bugs in natural language machine learning models using adaptive testing.☆186Updated last year
- Pipeline for pulling and processing online language model pretraining data from the web☆177Updated 2 years ago
- UnifiedQA: Crossing Format Boundaries With a Single QA System☆441Updated 3 years ago
- ToTTo is an open-domain English table-to-text dataset with over 120,000 training examples that proposes a controlled generation task: giv…☆457Updated last year
- A Python framework for performing information retrieval experiments, building on http://terrier.org/☆472Updated this week
- We introduce MKQA, an open-domain question answering evaluation set comprising 10k question-answer pairs aligned across 26 typologically …☆187Updated 3 years ago
- SummVis is an interactive visualization tool for text summarization.☆253Updated 3 years ago
- Semantic search through a vectorized Wikipedia (SentenceBERT) with the Weaviate vector search engine☆243Updated 2 years ago
- A large-scale multilingual dataset for Information Retrieval. Thorough human-annotations across 18 diverse languages.☆196Updated last year
- Task-based datasets, preprocessing, and evaluation for sequence models.☆586Updated 3 weeks ago
- Library for Knowledge Intensive Language Tasks☆956Updated 3 years ago
- An open-source text summarization toolkit for non-experts. EMNLP'2021 Demo☆280Updated 2 years ago
- TyDi QA contains 200k human-annotated question-answer pairs in 11 Typologically Diverse languages, written without seeing the answer and …☆314Updated 5 years ago