osdg-ai / osdg-dataLinks
The OSDG Community Dataset (OSDG-CD) is a public dataset of thousands of text excerpts, validated by OSDG Community Platform (OSDG-CP) citizen scientists with respect to the Sustainable Development Goals (SDGs). The dataset is updated every quarter and published on Zenodo.
☆37Updated 2 years ago
Alternatives and similar repositories for osdg-data
Users that are interested in osdg-data are comparing it to the libraries listed below
Sorting:
- OSDG is an open-source tool that maps and connects activities to the UN Sustainable Development Goals (SDGs) by identifying SDG-relevant …☆45Updated 2 years ago
- Text analysis with networks.☆291Updated 2 weeks ago
- A Python client for the GDELT 2.0 Doc API☆164Updated 7 months ago
- ☆16Updated 4 years ago
- A deep learning system for demographic inference (gender, age, and individual/person) that was trained on massive Twitter dataset using p…☆154Updated 2 years ago
- Code and experiments for *BERTopic: Neural topic modeling with a class-based TF-IDF procedure*☆83Updated last year
- Interpretable data visualizations for understanding how texts differ at the word level☆284Updated 9 months ago
- Full text geoparsing/toponym resolution with event geolocation☆81Updated last month
- Code for the CUP Elements on text analysis in Python for social scientists☆138Updated 3 years ago
- Data and code accompanying the Nature paper "Quantifying social organization and political polarization in online platforms"☆66Updated 3 years ago
- Tutorial for using twarc, with steps for installing software.☆25Updated 7 years ago
- Semantic Scholar's Author Disambiguation Algorithm & Evaluation Suite☆100Updated 2 months ago
- A Flexible Deep Learning Approach to Fuzzy String Matching☆147Updated last year
- ☆24Updated 4 years ago
- Python based framework to retreive Global Database of Events, Language, and Tone (GDELT) version 1.0 and version 2.0 data.☆237Updated 2 years ago
- ☆55Updated 2 years ago
- Text and statistics utilities from Pew Research Center☆85Updated 3 years ago
- A very simple library for exploiting graph-of-words in NLP☆12Updated 4 years ago
- An affect generator based on TextBlob and the NRC affect lexicon. Note that lexicon license is for research purposes only.☆75Updated 2 years ago
- A tool to assign Sustainable Development Goals to a scientific abstract☆17Updated 4 years ago
- geoparsepy is a Python geoparsing library that will extract and disambiguate locations from text. It uses a local OpenStreetMap database …☆64Updated 4 years ago
- This page is meant to provide current research updates on Polarization and Echo-chambers on Social Media. Unlike other survey pages, this…☆45Updated 2 years ago
- This repository contains the underlying code for the paper "Monitoring Global Development Aid with Machine Learning" by Toetzke, M.; N. B…☆13Updated 3 years ago
- A file that contains the schema for GDELT 2.0 Header rows for the Events Database.☆50Updated 7 years ago
- A python package to enrich Twitter Data☆75Updated 2 years ago
- Handy Jupyter Notebooks that I use in for Topic Modeling. Including text mining from PDF files, text preprocessing, Latent Dirichlet Allo…☆42Updated 6 years ago
- This package consists of functionalities for dynamic topic modelling and its visualization☆26Updated 5 years ago
- dynamic topic modeling☆42Updated 2 years ago
- Fine-tuning a Hugging Face BERT model for the United Nations Named Entity Recognition task.☆35Updated 4 years ago
- Using the Gmail API to topic model my recommended Medium reads☆24Updated 4 years ago