facebookresearch / SphereLinks
Web-scale retrieval for knowledge-intensive NLP
☆555Updated 2 years ago
Alternatives and similar repositories for Sphere
Users that are interested in Sphere are comparing it to the libraries listed below
Sorting:
- Database Reasoning Over Text project for ACL paper☆353Updated 3 years ago
- Multi-angle c(q)uestion answering☆458Updated 2 years ago
- The AI Knowledge Editor☆184Updated 3 years ago
- ☆363Updated 8 months ago
- Used for adaptive human in the loop evaluation of language and embedding models.☆309Updated 2 years ago
- ☆514Updated last year
- Ask Me Anything language model prompting☆547Updated 2 years ago
- A suite of tools for managing crowdsourcing tasks from the inception through to data packaging for research use.☆312Updated 7 months ago
- The prime repository for state-of-the-art Multilingual Question Answering research and development.☆734Updated 6 months ago
- Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: …☆336Updated 2 years ago
- 🌸 fastText + Bloom embeddings for compact, full-coverage vectors with spaCy☆315Updated 2 months ago
- Mistral: A strong, northwesterly wind: Framework for transparent and accessible large-scale language model training, built with Hugging F…☆573Updated last year
- Prompt programming with FMs.☆443Updated 11 months ago
- Semantic search through a vectorized Wikipedia (SentenceBERT) with the Weaviate vector search engine☆242Updated 2 years ago
- Tools and Modeling Code for the MASSIVE dataset☆547Updated 2 years ago
- Find and fix bugs in natural language machine learning models using adaptive testing.☆184Updated last year
- Implementation of RETRO, Deepmind's Retrieval based Attention net, in Pytorch☆869Updated last year
- A Python framework for performing information retrieval experiments, building on http://terrier.org/☆464Updated 2 weeks ago
- Reproduce results and replicate training fo T0 (Multitask Prompted Training Enables Zero-Shot Task Generalization)☆463Updated 2 years ago
- Task-based datasets, preprocessing, and evaluation for sequence models.☆583Updated 2 months ago
- The pipeline for the OSCAR corpus☆171Updated last year
- Search Engines with Autoregressive Language models☆288Updated 2 years ago
- A Multilingual Dataset for Parsing Realistic Task-Oriented Dialogs☆114Updated 2 years ago
- A library to synthesize text datasets using Large Language Models (LLM)☆152Updated 2 years ago
- Organize your experiments into discrete steps that can be cached and reused throughout the lifetime of your research project.☆563Updated last year
- Library for Knowledge Intensive Language Tasks☆949Updated 3 years ago
- Pipeline for pulling and processing online language model pretraining data from the web☆178Updated last year
- ToTTo is an open-domain English table-to-text dataset with over 120,000 training examples that proposes a controlled generation task: giv…☆452Updated 10 months ago
- An open-source text summarization toolkit for non-experts. EMNLP'2021 Demo☆277Updated last year
- Stanford's Alexa Prize socialbot☆133Updated last year