facebookresearch / sideLinks
The AI Knowledge Editor
☆182Updated 2 years ago
Alternatives and similar repositories for side
Users that are interested in side are comparing it to the libraries listed below
Sorting:
- Question-answers, collected from Google☆129Updated 3 years ago
- Pretraining Efficiently on S2ORC!☆164Updated 7 months ago
- ☆183Updated 2 years ago
- ☆95Updated last year
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆108Updated last year
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆128Updated last year
- ☆98Updated 2 years ago
- Web-scale retrieval for knowledge-intensive NLP☆554Updated 2 years ago
- Used for adaptive human in the loop evaluation of language and embedding models.☆308Updated 2 years ago
- Apps built using Inspired Cognition's Critique.☆58Updated 2 years ago
- Seahorse is a dataset for multilingual, multi-faceted summarization evaluation. It consists of 96K summaries with human ratings along 6 q…☆88Updated last year
- ☆78Updated 2 years ago
- A diff tool for language models☆42Updated last year
- Probabilistic LLM evaluations. [CogSci2023; ACL2023]☆72Updated 10 months ago
- ☆211Updated 3 months ago
- Semantic search through a vectorized Wikipedia (SentenceBERT) with the Weaviate vector search engine☆242Updated 2 years ago
- Code and data to support the paper "PAQ 65 Million Probably-Asked Questions andWhat You Can Do With Them"☆202Updated 3 years ago
- Dataset from the paper "Mintaka: A Complex, Natural, and Multilingual Dataset for End-to-End Question Answering" (COLING 2022)☆113Updated 2 years ago
- A library to synthesize text datasets using Large Language Models (LLM)☆151Updated 2 years ago
- The Python library with command line tools to interact with Dynabench(https://dynabench.org/), such as uploading models.☆55Updated 2 years ago
- 💫 SpaCy wrapper for ConceptNet 💫☆93Updated last year
- Pipeline for pulling and processing online language model pretraining data from the web☆178Updated last year
- multimodal document analysis☆164Updated 11 months ago
- A data set based on all arXiv publications, pre-processed for NLP, including structured full-text and citation network☆289Updated 8 months ago
- The pipeline for the OSCAR corpus☆167Updated last year
- The official code of LM-Debugger, an interactive tool for inspection and intervention in transformer-based language models.☆177Updated 3 years ago
- Factored Cognition Primer: How to write compositional language model programs☆49Updated 2 years ago
- minimal pytorch implementation of bm25 (with sparse tensors)☆101Updated last year
- ☆76Updated 3 years ago
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆93Updated 2 years ago