osdg-ai / osdg-dataLinks
The OSDG Community Dataset (OSDG-CD) is a public dataset of thousands of text excerpts, validated by OSDG Community Platform (OSDG-CP) citizen scientists with respect to the Sustainable Development Goals (SDGs). The dataset is updated every quarter and published on Zenodo.
☆37Updated 2 years ago
Alternatives and similar repositories for osdg-data
Users that are interested in osdg-data are comparing it to the libraries listed below
Sorting:
- OSDG is an open-source tool that maps and connects activities to the UN Sustainable Development Goals (SDGs) by identifying SDG-relevant …☆44Updated 2 years ago
- Code for the Master Thesis "Enhancing the Microsoft Academic Knowledge Graph"☆14Updated 5 years ago
- This repository contains the underlying code for the paper "Monitoring Global Development Aid with Machine Learning" by Toetzke, M.; N. B…☆13Updated 3 years ago
- A deep learning system for demographic inference (gender, age, and individual/person) that was trained on massive Twitter dataset using p…☆156Updated 2 years ago
- A Python client for the GDELT 2.0 Doc API☆178Updated 9 months ago
- Text and statistics utilities from Pew Research Center☆86Updated 3 years ago
- Python based framework to retreive Global Database of Events, Language, and Tone (GDELT) version 1.0 and version 2.0 data.☆241Updated 2 years ago
- Text analysis with networks.☆292Updated this week
- Fuzzy matches and merging of datasets in pandas using csvmatch☆77Updated 5 years ago
- ☆16Updated 4 years ago
- Code and experiments for *BERTopic: Neural topic modeling with a class-based TF-IDF procedure*☆83Updated 2 years ago
- A very simple library for exploiting graph-of-words in NLP☆12Updated 4 years ago
- Code for the CUP Elements on text analysis in Python for social scientists☆139Updated 3 years ago
- A tool to assign Sustainable Development Goals to a scientific abstract☆17Updated 4 years ago
- How are words loaded with meaning? Repository accompanying research by Alina Arseniev-Koehler and Jacob G. Foster, titled "Machine learn…☆42Updated 2 years ago
- Full text geoparsing/toponym resolution with event geolocation☆83Updated this week
- A python package to enrich Twitter Data☆75Updated 2 years ago
- Interpretable data visualizations for understanding how texts differ at the word level☆286Updated 11 months ago
- Tutorial for using twarc, with steps for installing software.☆25Updated 7 years ago
- An affect generator based on TextBlob and the NRC affect lexicon. Note that lexicon license is for research purposes only.☆76Updated 3 years ago
- ☆24Updated 4 years ago
- A Flexible Deep Learning Approach to Fuzzy String Matching☆149Updated last year
- The FBAdLibrarian is a simple tool that can pull ad data and collects images offered by Facebook’s Ad Library API.☆16Updated 2 years ago
- ☆55Updated 2 years ago
- Source code and data for paper "Neutral Bots Probe Political Bias on Social Media" by Chen et al.☆31Updated 3 years ago
- Data and code accompanying the Nature paper "Quantifying social organization and political polarization in online platforms"☆66Updated 3 years ago
- An End-to-End Evaluation Framework for Entity Resolution Systems☆36Updated 2 years ago
- Covid-19 Twitter dataset for non-commercial research use and pre-processing scripts - under active development☆480Updated 2 years ago
- A tool for Semantic Scaling of Political Text (branch of Topfish, a suite of tools for Political Text Analysis)☆28Updated 6 months ago
- Turn Tweet IDs into Twitter JSON & CSV from your desktop!☆436Updated 2 years ago