delvinso / covid19_unique_tweets
An on-going dataset consisting of hashtags, n-gram counts and other misc NLP things for covid-19 analysis, stemming from over 100 000 000 tweets collected since mid-January 2020.
☆57Updated 3 years ago
Alternatives and similar repositories for covid19_unique_tweets:
Users that are interested in covid19_unique_tweets are comparing it to the libraries listed below
- Interpretable data visualizations for understanding how texts differ at the word level☆274Updated last week
- Cleans Reddit Text Data☆81Updated 4 years ago
- Topic Inference with Zeroshot models☆61Updated last year
- A repository to house model building experiments and tools that are part of the Conversation AI effort.☆139Updated 3 weeks ago
- Code for the CUP Elements on text analysis in Python for social scientists☆136Updated 2 years ago
- The world's largest social media toxicity dataset.☆177Updated 2 years ago
- Tutorial for using twarc, with steps for installing software.☆25Updated 7 years ago
- Code for obtaining the Curation Corpus abstractive text summarisation dataset☆125Updated 4 years ago
- Browse Covid-19 & SARS-CoV-2 Scientific Papers with Transformers 🦠 📖☆182Updated 2 years ago
- Spacy NER annotator using ipywidgets☆119Updated 10 months ago
- Getting recommendations from natural language☆123Updated 4 years ago
- A multi-modal Twitter dataset with 7.6M tweets and 25.6M retweets related to voter fraud claims.☆53Updated 3 years ago
- spaCy pipeline object for negating concepts in text☆279Updated 8 months ago
- A python package to enrich Twitter Data☆74Updated last year
- [LREC 2022] An off-the-shelf pre-trained Tweet NLP Toolkit (NER, tokenization, lemmatization, POS tagging, dependency parsing) + Tweeban…☆104Updated last year
- Group thousands of similar spreadsheet or database text entries in seconds☆156Updated last year
- DeEpLearning models for MultIlingual haTespeech (DELIMIT): Benchmarking multilingual models across 9 languages and 16 datasets.☆108Updated last year
- Experiments to help discussion on Wikipedia talk pages☆66Updated 3 months ago
- Sentence transformers models for SpaCy☆107Updated last year
- A Python module for clustering creators of social media content into networks☆74Updated 3 years ago
- Text analysis with networks.☆286Updated 9 months ago
- A spaCy custom component that extracts and normalizes temporal expressions☆54Updated 2 years ago
- ☆164Updated 2 years ago
- Tag news stories based on models trained on the NYT corpus.☆42Updated last year
- Google USE (Universal Sentence Encoder) for spaCy☆182Updated last year
- Social Media Mining Toolkit (SMMT) main repository☆134Updated 2 years ago
- A set of utility scripts to process Wikipedia related data☆37Updated 2 years ago
- Healthsea is a spaCy pipeline for analyzing user reviews of supplementary products for their effects on health.☆91Updated 3 years ago
- Pushshift Telegram Ingest☆85Updated 5 years ago
- Pretrained BERT model for analysing COVID-19 Twitter data☆184Updated last year