niderhoff / nlp-datasets
Alphabetical list of free/public domain datasets with text data for use in Natural Language Processing (NLP)
☆5,886Updated 2 years ago
Alternatives and similar repositories for nlp-datasets:
Users that are interested in nlp-datasets are comparing it to the libraries listed below
- An open-source NLP research library, built on PyTorch.☆11,844Updated 2 years ago
- Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the mo…☆22,846Updated 9 months ago
- A natural language modeling framework based on PyTorch☆6,325Updated 2 years ago
- ✨Fast Coreference Resolution in spaCy with Neural Networks☆2,876Updated 2 years ago
- Natural Language Processing Tasks and References☆3,024Updated 6 years ago
- InferSent sentence embeddings☆2,284Updated 3 years ago
- Pytorch implementations of various Deep NLP models in cs-224n(Stanford Univ)☆2,954Updated 5 years ago
- A curated list of resources dedicated to Natural Language Processing (NLP)☆17,129Updated last year
- The Natural Language Decathlon: A Multitask Challenge for NLP☆2,349Updated this week
- XLNet: Generalized Autoregressive Pretraining for Language Understanding☆6,187Updated last year
- A model library for exploring state-of-the-art deep learning topologies and techniques for optimizing Natural Language Processing neural …☆2,939Updated 2 years ago
- Models, data loaders and abstractions for language processing, powered by PyTorch☆3,539Updated this week
- Toolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow. This is part of the CASL project: http://…☆2,388Updated 3 years ago
- all kinds of text classification models and more with deep learning☆7,910Updated last year
- Go to https://github.com/pytorch/tutorials - this repo is deprecated and no longer maintained☆4,545Updated 3 years ago
- Basic Utilities for PyTorch Natural Language Processing (NLP)☆2,219Updated last year
- Software in C and data files for the popular GloVe model for distributed word representations, a.k.a. word vectors or embeddings☆7,005Updated 5 months ago
- A python tool for evaluating the quality of sentence embeddings.☆2,106Updated last year
- Super easy library for BERT based NLP models☆1,895Updated 8 months ago
- Must-read Papers on pre-trained language models.☆3,358Updated 2 years ago
- Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages☆7,456Updated this week
- Tensorflow implementation of contextualized word representations from bi-directional language models☆1,619Updated 2 years ago
- A curated list of resources dedicated to text summarization☆1,543Updated 2 years ago
- Natural Language Processing Best Practices & Examples☆6,407Updated 2 years ago
- NLP made easy☆2,560Updated last year
- A fast, efficient universal vector embedding utility package.☆1,647Updated last year
- Gathers machine learning and Tensorflow deep learning models for NLP problems, 1.13 < Tensorflow < 2.0☆1,790Updated 4 years ago
- Facilitating the design, comparison and sharing of deep text matching models.☆3,855Updated 9 months ago
- NLP, before and after spaCy☆2,225Updated last year
- A curated list of pretrained sentence and word embedding models☆2,256Updated 4 years ago