niderhoff / nlp-datasets
Alphabetical list of free/public domain datasets with text data for use in Natural Language Processing (NLP)
☆5,768Updated last year
Related projects ⓘ
Alternatives and complementary repositories for nlp-datasets
- Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the mo…☆22,697Updated 3 months ago
- A Code-First Introduction to NLP course☆3,425Updated last year
- XLNet: Generalized Autoregressive Pretraining for Language Understanding☆6,181Updated last year
- A curated list of resources dedicated to Natural Language Processing (NLP)☆16,716Updated 11 months ago
- Course materials for Georgia Tech CS 4650 and 7650, "Natural Language"☆4,945Updated last year
- Natural Language Processing Tasks and References☆3,015Updated 6 years ago
- NLP made easy☆2,557Updated last year
- An open-source NLP research library, built on PyTorch.☆11,756Updated last year
- A library for Multilingual Unsupervised or Supervised word Embeddings☆3,189Updated 2 years ago
- ✨Fast Coreference Resolution in spaCy with Neural Networks☆2,856Updated last year
- A very simple framework for state-of-the-art Natural Language Processing (NLP)☆13,928Updated 2 weeks ago
- Software in C and data files for the popular GloVe model for distributed word representations, a.k.a. word vectors or embeddings☆6,874Updated last year
- A natural language modeling framework based on PyTorch☆6,338Updated 2 years ago
- A curated list of awesome Deep Learning (DL) for Natural Language Processing (NLP) resources☆1,282Updated last year
- all kinds of text classification models and more with deep learning☆7,859Updated last year
- InferSent sentence embeddings☆2,280Updated 3 years ago
- Natural Language Processing Best Practices & Examples☆6,376Updated 2 years ago
- Pytorch implementations of various Deep NLP models in cs-224n(Stanford Univ)☆2,956Updated 5 years ago
- Topic Modelling for Humans☆15,657Updated 2 months ago
- 100 Must-Read NLP Papers☆3,748Updated 3 years ago
- Language-Agnostic SEntence Representations☆3,596Updated 6 months ago
- A curated list of pretrained sentence and word embedding models☆2,224Updated 3 years ago
- This repository recorded my NLP journey.☆1,073Updated 4 years ago
- PyTorch original implementation of Cross-lingual Language Model Pretraining.☆2,889Updated last year
- Basic Utilities for PyTorch Natural Language Processing (NLP)☆2,213Updated last year
- TensorFlow Neural Machine Translation Tutorial☆6,386Updated 2 years ago
- The Natural Language Decathlon: A Multitask Challenge for NLP☆2,344Updated 9 months ago
- Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages☆7,284Updated last week
- Papers & presentation materials from Hugging Face's internal science day☆2,035Updated 4 years ago
- Learning embeddings for classification, retrieval and ranking.☆3,943Updated last year