delvinso / covid19_unique_tweetsLinks
An on-going dataset consisting of hashtags, n-gram counts and other misc NLP things for covid-19 analysis, stemming from over 100 000 000 tweets collected since mid-January 2020.
β59Updated 3 years ago
Alternatives and similar repositories for covid19_unique_tweets
Users that are interested in covid19_unique_tweets are comparing it to the libraries listed below
Sorting:
- Browse Covid-19 & SARS-CoV-2 Scientific Papers with Transformers π¦ πβ184Updated 3 years ago
- Cleans Reddit Text Dataβ84Updated 5 years ago
- π Easy training and deployment of seq2seq models.β228Updated 4 years ago
- Getting recommendations from natural languageβ123Updated 5 years ago
- Python script to download public Tweets from a given Twitter account into a format suitable for AI text generation.β226Updated 5 years ago
- Conversational text Analysis using various NLP techniquesβ182Updated 2 years ago
- Generate realistic Instagram captions using transformers π€β101Updated 2 years ago
- A simple NLP library allows profiling datasets with one or more text columns. When given a dataset and a column name containing text dataβ¦β243Updated last year
- A set of tools for leveraging pre-trained embeddings, active learning and model explainability for effecient document classificationβ29Updated last year
- a bot that generates realistic replies using a combination of pretrained GPT-2 and BERT modelsβ192Updated 5 years ago
- DeEpLearning models for MultIlingual haTespeech (DELIMIT): Benchmarking multilingual models across 9 languages and 16 datasets.β110Updated 2 years ago
- Topic Inference with Zeroshot modelsβ61Updated 2 years ago
- Quote extraction for modular journalism (JournalismAI collab 2021)β229Updated 3 years ago
- Social Media Mining Toolkit (SMMT) main repositoryβ136Updated 3 years ago
- Explainable Zero-Shot Topic Extractionβ65Updated last year
- ππ A browser extension that displays the GPT-2 Log Probability of selected textβ112Updated 2 years ago
- Question Generation - Question Answering for Automatic Flashcardsβ66Updated 3 years ago
- DRIFT is a tool for Diachronic Analysis of Scientific Literature.β126Updated 3 months ago
- Hate Speech Detection Library for Python.β194Updated 3 months ago
- Social Analysis based on Whatsapp dataβ149Updated 2 years ago
- AlBERTo the first italian BERT model for Twitter languange understandingβ72Updated 5 years ago
- Minimal starting point for rapid prototyping interactive Human-AI toolsβ33Updated 3 years ago
- open datasets for sentiment analysis based on tweets in English/Spanish/French/German/Italianβ75Updated 2 years ago
- Catalog of abusive language data (PLoS 2020)β321Updated last year
- Code for obtaining the Curation Corpus abstractive text summarisation datasetβ128Updated 5 years ago
- The world's largest social media toxicity dataset.β189Updated 3 years ago
- No Teacher BART distillation experiment for NLI tasksβ28Updated 5 years ago
- Code and data for the paper, "Automatically Neutralizing Subjective Bias in Text"β198Updated last year
- A comprehensive tool for linguistic analysis of communitiesβ49Updated 4 years ago
- This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with entiβ¦β244Updated 2 years ago