delvinso / covid19_unique_tweets
An on-going dataset consisting of hashtags, n-gram counts and other misc NLP things for covid-19 analysis, stemming from over 100 000 000 tweets collected since mid-January 2020.
☆57Updated 3 years ago
Alternatives and similar repositories for covid19_unique_tweets
Users that are interested in covid19_unique_tweets are comparing it to the libraries listed below
Sorting:
- Getting recommendations from natural language☆123Updated 4 years ago
- Topic Inference with Zeroshot models☆61Updated last year
- Code for obtaining the Curation Corpus abstractive text summarisation dataset☆126Updated 4 years ago
- Clean personally identifiable information from dirty dirty text using spaCy.☆41Updated last year
- Cleans Reddit Text Data☆83Updated 5 years ago
- Browse Covid-19 & SARS-CoV-2 Scientific Papers with Transformers 🦠 📖☆184Updated 2 years ago
- Minimal starting point for rapid prototyping interactive Human-AI tools☆33Updated 3 years ago
- Dataframe Integration with spaCy.☆103Updated 4 years ago
- On Generating Extended Summaries of Long Documents☆78Updated 4 years ago
- Tools for helping out with COVID-19 research☆28Updated 4 years ago
- Linguistic and stylistic complexity measures for (literary) texts☆81Updated last year
- DeEpLearning models for MultIlingual haTespeech (DELIMIT): Benchmarking multilingual models across 9 languages and 16 datasets.☆109Updated last year
- Healthsea is a spaCy pipeline for analyzing user reviews of supplementary products for their effects on health.☆91Updated 3 years ago
- Code for the CUP Elements on text analysis in Python for social scientists☆136Updated 2 years ago
- Explainable Zero-Shot Topic Extraction☆62Updated 8 months ago
- Interpretable data visualizations for understanding how texts differ at the word level☆275Updated 3 months ago
- Corresponding code repo for the paper at COLING 2020 - ARGMIN 2020: "DebateSum: A large-scale argument mining and summarization dataset"☆54Updated 3 years ago
- classy is a simple-to-use library for building high-performance Machine Learning models in NLP.☆87Updated last month
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆41Updated 5 years ago
- A monolingual and cross-lingual meta-embedding generation and evaluation framework☆80Updated 3 years ago
- State of the art open-source translation for Indic languages.☆5Updated 4 years ago
- [LREC 2022] An off-the-shelf pre-trained Tweet NLP Toolkit (NER, tokenization, lemmatization, POS tagging, dependency parsing) + Tweeban…☆104Updated last year
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …☆106Updated last year
- open datasets for sentiment analysis based on tweets in English/Spanish/French/German/Italian☆72Updated last year
- Information extraction from English and German texts based on predicate logic☆135Updated last year
- Creating class-based TF-IDF matrices☆83Updated 2 years ago
- Generate realistic Instagram captions using transformers 🤗☆102Updated last year
- a bot that generates realistic replies using a combination of pretrained GPT-2 and BERT models☆195Updated 4 years ago
- NERtwork is a collection of scripts to help you create a network graph of co-occurring named entities using open source tools. This is do…☆48Updated last year
- A multithread Pushshift.io API Wrapper for reddit.com comment and submission searches.☆218Updated 2 years ago