A community-built high-quality repository of NLP corpora
☆64Jan 8, 2022Updated 4 years ago
Alternatives and similar repositories for nlp-corpora
Users that are interested in nlp-corpora are comparing it to the libraries listed below
Sorting:
- Question-Answer Meaning Representation☆48Feb 17, 2022Updated 4 years ago
- PyTorch Implementation of NeurIPS 2020 paper "Learning Sparse Prototypes for Text Generation"☆22Jul 8, 2021Updated 4 years ago
- Simple setup for personal dotfiles☆11Nov 29, 2025Updated 3 months ago
- Notes from CSE 446, Winter 2016☆28Mar 16, 2016Updated 9 years ago
- Elastic Workplace Search Official Python Client☆10Aug 8, 2024Updated last year
- Deep Learning for Video Retrieval by Natural Language☆11Oct 20, 2019Updated 6 years ago
- Augmentation scripts for the bAbI Dialog Tasks dataset☆13Oct 16, 2018Updated 7 years ago
- DSTC6 Dialog System Technology Challenges, Track1, End-to-End Goal Oriented Dialog Learning☆17Dec 16, 2017Updated 8 years ago
- Unofficial implementation of Adaptive Input in PyTorch☆12Feb 22, 2019Updated 7 years ago
- This repository contains the source code and links to some datasets used in the CoNLL 2019 paper "Learning to Represent Bilingual Diction…☆12Oct 1, 2020Updated 5 years ago
- Repository for ACL2020 paper "Refer360° A Referring Expression Recognition Dataset in 360°Images"☆13Jun 26, 2021Updated 4 years ago
- ☆32Jun 14, 2019Updated 6 years ago
- Normalize text string☆12Nov 6, 2018Updated 7 years ago
- “Open terminals”, “load CSVs”, “start hacking”☆16May 2, 2017Updated 8 years ago
- Code and dataset "ZEST" from "Learning from task descriptions", Weller et al, EMNLP 2020☆17Mar 15, 2021Updated 4 years ago
- GluonNLP tutorial for Pycon2019☆14Aug 16, 2019Updated 6 years ago
- Real-Time Open-Domain Question Answering with Dense-Sparse Phrase Index (DenSPI)☆200Jul 6, 2023Updated 2 years ago
- Semantic search using Transformers and others☆110Aug 27, 2020Updated 5 years ago
- Awesome Chinese Corpus Datasets and Models.☆18Oct 28, 2019Updated 6 years ago
- Codes for arXiv paper "Semi-supervised Few-shot Atomic Action Recognition".☆18Jan 2, 2021Updated 5 years ago
- Data splits for the NAACL 2016 paper☆22Mar 17, 2016Updated 9 years ago
- TensorFlow code and pre-trained models for BERT☆17Feb 28, 2019Updated 7 years ago
- 语雀 Yuque python SDK & Command line interface☆17Sep 11, 2019Updated 6 years ago
- Topic clustering library built on Transformer embeddings and cosine similarity metrics.Compatible with all BERT base transformers from hu…☆44Jun 11, 2021Updated 4 years ago
- This is the official repository for NAACL 2021, "XOR QA: Cross-lingual Open-Retrieval Question Answering".☆80Jun 3, 2021Updated 4 years ago
- NLG and NLU for dialogue processing☆41Jun 17, 2023Updated 2 years ago
- An inventory of data sets around Question Generation and Question Answering☆21Apr 2, 2019Updated 6 years ago
- Transformer model for the Amazon Topical-Chat Corpus. Baselines for DSTC9 Track 3.☆19Jul 9, 2020Updated 5 years ago
- Code for ACL 2019 oral paper - Learning Compressed Sentence Representations for On-Device Text Processing.☆45Sep 1, 2020Updated 5 years ago
- Translating neuralese☆46Apr 26, 2017Updated 8 years ago
- Word sense disambiguation using contextualized word embedding☆17Dec 18, 2019Updated 6 years ago
- Central repository for QA-SRL data.☆21Feb 13, 2021Updated 5 years ago
- Dialog State Tracking Challenge 6 (DSTC6)☆54Jan 19, 2018Updated 8 years ago
- Text Content Manipulation☆45Nov 16, 2020Updated 5 years ago
- Sentence encoder and training code for Mean-Max AAE☆16Nov 8, 2018Updated 7 years ago
- reference pytorch code for named entity tagging☆87Oct 18, 2024Updated last year
- scalable knowledge graph construction from unstructured text☆95Mar 4, 2020Updated 6 years ago
- CliniQG4QA: Generating Diverse Questions for Domain Adaptation of Clinical Question Answering☆23Feb 26, 2021Updated 5 years ago
- Finetune CPM-1☆24Jun 20, 2021Updated 4 years ago