facebookresearch / Sphere
Web-scale retrieval for knowledge-intensive NLP
☆552Updated 2 years ago
Alternatives and similar repositories for Sphere:
Users that are interested in Sphere are comparing it to the libraries listed below
- The AI Knowledge Editor☆182Updated 2 years ago
- Multi-angle c(q)uestion answering☆458Updated 2 years ago
- Database Reasoning Over Text project for ACL paper☆354Updated 2 years ago
- 🌸 fastText + Bloom embeddings for compact, full-coverage vectors with spaCy☆310Updated last year
- Library for Knowledge Intensive Language Tasks☆938Updated 3 years ago
- Semantic search through a vectorized Wikipedia (SentenceBERT) with the Weaviate vector search engine☆242Updated last year
- DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.☆165Updated 2 weeks ago
- Used for adaptive human in the loop evaluation of language and embedding models.☆308Updated 2 years ago
- ☆362Updated 5 months ago
- Search Engines with Autoregressive Language models☆284Updated 2 years ago
- ☆501Updated last year
- SpikeX - SpaCy Pipes for Knowledge Extraction☆398Updated 3 years ago
- NL-Augmenter 🦎 → 🐍 A Collaborative Repository of Natural Language Transformations☆781Updated 10 months ago
- ToTTo is an open-domain English table-to-text dataset with over 120,000 training examples that proposes a controlled generation task: giv…☆442Updated 7 months ago
- Ask Me Anything language model prompting☆547Updated last year
- Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: …☆333Updated last year
- Question-answers, collected from Google☆129Updated 3 years ago
- Code and data to support the paper "PAQ 65 Million Probably-Asked Questions andWhat You Can Do With Them"☆202Updated 3 years ago
- TyDi QA contains 200k human-annotated question-answer pairs in 11 Typologically Diverse languages, written without seeing the answer and …☆302Updated 4 years ago
- Resources for the "SummEval: Re-evaluating Summarization Evaluation" paper☆391Updated 9 months ago
- Implementation of RETRO, Deepmind's Retrieval based Attention net, in Pytorch☆862Updated last year
- Mistral: A strong, northwesterly wind: Framework for transparent and accessible large-scale language model training, built with Hugging F…☆571Updated last year
- Reproduce results and replicate training fo T0 (Multitask Prompted Training Enables Zero-Shot Task Generalization)☆464Updated 2 years ago
- Pipeline for pulling and processing online language model pretraining data from the web☆177Updated last year
- A suite of tools for managing crowdsourcing tasks from the inception through to data packaging for research use.☆310Updated 4 months ago
- A library to synthesize text datasets using Large Language Models (LLM)☆152Updated 2 years ago
- The prime repository for state-of-the-art Multilingual Question Answering research and development.☆733Updated 3 months ago
- The pipeline for the OSCAR corpus☆168Updated last year
- SummVis is an interactive visualization tool for text summarization.☆252Updated 2 years ago
- Autoregressive Entity Retrieval☆786Updated last year