maxent-ai / Datasets
datasets with text data for use in NLP, Text analysis, information extraction, ML research.
☆16Updated 6 years ago
Alternatives and similar repositories for Datasets:
Users that are interested in Datasets are comparing it to the libraries listed below
- ☆16Updated last year
- Build intelligent data-driven applications with minimal effort. Sentence Clustering, Topics Extraction, Text Similarity, Opinion Summariz…☆40Updated 5 years ago
- Question generation from Reading Comprehension☆18Updated 3 years ago
- classify a job description (or noisy job title) into a ONET job title☆19Updated 8 years ago
- An inventory of data sets around Question Generation and Question Answering☆21Updated 6 years ago
- Text preprocessing tools in python.☆27Updated 7 years ago
- ☆32Updated 6 years ago
- Political Discourse Analysis Using Pre-Trained Word Vectors.☆22Updated 2 years ago
- This repository provides our datasets for Arabic emotion detection in Twitter☆9Updated 6 years ago
- Natural Language Generation for Gramex applications.☆24Updated 2 years ago
- Comparable documents miner: Arabic-English morphological analysis, text processing, n-gram features extraction, POS tagging, dictionary t…☆34Updated 7 years ago
- BERT semantic search engine for searching literature research papers for coronavirus covid-19 in google colab☆31Updated 4 years ago
- The objective of this project is to scrape a corpus of news articles from a set of web pages, pre-process the corpus, and then to apply u…☆50Updated 7 years ago
- CrowdTruth framework for crowdsourcing ground truth for training & evaluation of AI systems☆58Updated 11 months ago
- The Heracles framework for developing and evaluating text mining algorithms☆10Updated 2 years ago
- Regex like pattern tree matching but on sentence's tree instead of Strings☆42Updated 7 years ago
- Repo containing the Twitter preprocessor module, developed by the AUTH OSWinds team☆28Updated 4 years ago
- Clinical spelling correction with word and character n-gram embeddings.☆74Updated 2 years ago
- Using NLP to cluster reddit user comments by topics☆13Updated 7 years ago
- Clinical NLP Analysis with Elasticsearch and Kibana☆35Updated 6 years ago
- ☆64Updated 2 years ago
- A simple Flask API for named entity extraction using spaCy Model☆47Updated 6 years ago
- A natural language processing tool for automatically detecting quotations in text.☆15Updated 3 years ago
- Dataset and code of our EMNLP 2019 paper "Multilingual and Multi-Aspect Hate Speech Analysis"☆56Updated 4 months ago
- Text processing library for sentiment analysis and related tasks☆27Updated 6 years ago
- Deep-learning system presented in "EmoSence at SemEval-2019 Task 3: Bidirectional LSTM Network for Contextual Emotion Detection in Textua…☆27Updated 5 years ago
- spaCy-to-naf converter☆21Updated 9 months ago
- Language Model and Text Classification for German Language using Deep Learning☆18Updated 6 years ago
- A Multilingual Latent Dirichlet Allocation (LDA) Pipeline with Stop Words Removal, n-gram features, and Inverse Stemming, in Python.☆84Updated 8 months ago
- ☆43Updated 9 years ago