facebookresearch / sideLinks
The AI Knowledge Editor
☆185Updated 3 years ago
Alternatives and similar repositories for side
Users that are interested in side are comparing it to the libraries listed below
Sorting:
- multimodal document analysis☆166Updated last year
- 💫 SpaCy wrapper for ConceptNet 💫☆95Updated 2 years ago
- Seahorse is a dataset for multilingual, multi-faceted summarization evaluation. It consists of 96K summaries with human ratings along 6 q…☆89Updated last year
- An instruction-based benchmark for text improvements.☆142Updated 2 years ago
- Pretraining Efficiently on S2ORC!☆169Updated 10 months ago
- ☆193Updated last year
- Question-answers, collected from Google☆129Updated 4 years ago
- Open source library for few shot NLP☆79Updated 2 years ago
- ☆184Updated 2 years ago
- ☆82Updated 2 years ago
- Dataset from the paper "Mintaka: A Complex, Natural, and Multilingual Dataset for End-to-End Question Answering" (COLING 2022)☆114Updated 2 years ago
- The Python library with command line tools to interact with Dynabench(https://dynabench.org/), such as uploading models.☆55Updated 3 years ago
- Datasets collection and preprocessings framework for NLP extreme multitask learning☆186Updated 2 months ago
- The official code of LM-Debugger, an interactive tool for inspection and intervention in transformer-based language models.☆179Updated 3 years ago
- Documentation effort for the BookCorpus dataset☆34Updated 4 years ago
- An open-source text summarization toolkit for non-experts. EMNLP'2021 Demo☆280Updated last year
- The pipeline for the OSCAR corpus☆171Updated last year
- RaKUn 2.0 - A fast keyword detection algorithm☆68Updated last month
- Repo for the paper "Detecting Logical Fallacies: From Quiz to Climate Change News" (2021)☆79Updated last year
- Search through Facebook Research's PyTorch BigGraph Wikidata-dataset with the Weaviate vector search engine☆31Updated 3 years ago
- Neighborhood Contrastive Learning for Scientific Document Representations with Citation Embeddings (EMNLP 2022 paper)☆71Updated 2 years ago
- Experiments on including metadata such as URLs, timestamps, website descriptions and HTML tags during pretraining.☆31Updated 2 years ago
- Evaluation suite for large-scale language models.☆128Updated 4 years ago
- Pipeline for pulling and processing online language model pretraining data from the web☆177Updated 2 years ago
- ☆96Updated last year
- A diff tool for language models☆44Updated last year
- Web-scale retrieval for knowledge-intensive NLP☆556Updated 2 years ago
- Tools for managing datasets for governance and training.☆83Updated last month
- Semantically Structured Sentence Embeddings☆67Updated 11 months ago
- Code for SaGe subword tokenizer (EACL 2023)☆26Updated 9 months ago