facebookresearch / sideLinks
The AI Knowledge Editor
β184Updated 3 years ago
Alternatives and similar repositories for side
Users that are interested in side are comparing it to the libraries listed below
Sorting:
- Pretraining Efficiently on S2ORC!β164Updated 8 months ago
- π« SpaCy wrapper for ConceptNet π«β94Updated last year
- β182Updated 2 years ago
- β94Updated last year
- A library to synthesize text datasets using Large Language Models (LLM)β152Updated 2 years ago
- Seahorse is a dataset for multilingual, multi-faceted summarization evaluation. It consists of 96K summaries with human ratings along 6 qβ¦β88Updated last year
- multimodal document analysisβ166Updated last year
- An instruction-based benchmark for text improvements.β141Updated 2 years ago
- β192Updated last year
- Find and fix bugs in natural language machine learning models using adaptive testing.β184Updated last year
- Pipeline for pulling and processing online language model pretraining data from the webβ178Updated last year
- Datasets collection and preprocessings framework for NLP extreme multitask learningβ184Updated last week
- An open-source text summarization toolkit for non-experts. EMNLP'2021 Demoβ277Updated last year
- The pipeline for the OSCAR corpusβ171Updated last year
- Used for adaptive human in the loop evaluation of language and embedding models.β309Updated 2 years ago
- Stanford's Alexa Prize socialbotβ133Updated last year
- A data set based on all arXiv publications, pre-processed for NLP, including structured full-text and citation networkβ290Updated 9 months ago
- β79Updated 2 years ago
- Mining Legal Arguments in Court Decisions - Data and softwareβ68Updated 2 years ago
- Semantic search through a vectorized Wikipedia (SentenceBERT) with the Weaviate vector search engineβ242Updated 2 years ago
- Our open source implementation of MiniLMv2 (https://aclanthology.org/2021.findings-acl.188)β61Updated 2 years ago
- β215Updated 2 weeks ago
- Database Reasoning Over Text project for ACL paperβ353Updated 3 years ago
- Question-answers, collected from Googleβ129Updated 3 years ago
- Documentation effort for the BookCorpus datasetβ34Updated 4 years ago
- Neighborhood Contrastive Learning for Scientific Document Representations with Citation Embeddings (EMNLP 2022 paper)β69Updated 2 years ago
- Tools for managing datasets for governance and training.β85Updated last month
- Repository for Zheng and Guha et al., 2021, "When Does Pretraining Help? Assessing Self-Supervised Learning for Law and the CaseHOLD Dataβ¦β90Updated 2 years ago
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' puβ¦β40Updated 3 years ago
- Apps built using Inspired Cognition's Critique.β58Updated 2 years ago