osdg-ai / osdg-dataLinks
The OSDG Community Dataset (OSDG-CD) is a public dataset of thousands of text excerpts, validated by OSDG Community Platform (OSDG-CP) citizen scientists with respect to the Sustainable Development Goals (SDGs). The dataset is updated every quarter and published on Zenodo.
☆37Updated 2 years ago
Alternatives and similar repositories for osdg-data
Users that are interested in osdg-data are comparing it to the libraries listed below
Sorting:
- OSDG is an open-source tool that maps and connects activities to the UN Sustainable Development Goals (SDGs) by identifying SDG-relevant …☆44Updated 2 years ago
- ☆16Updated 4 years ago
- Text analysis with networks.☆291Updated last month
- Full text geoparsing/toponym resolution with event geolocation☆81Updated last week
- Code for the Master Thesis "Enhancing the Microsoft Academic Knowledge Graph"☆14Updated 5 years ago
- Code for the CUP Elements on text analysis in Python for social scientists☆138Updated 3 years ago
- A Python client for the GDELT 2.0 Doc API☆169Updated 7 months ago
- A Flexible Deep Learning Approach to Fuzzy String Matching☆149Updated last year
- The dataset used to evaluate JobBERT on the task of job title normalization.☆27Updated 3 years ago
- A deep learning system for demographic inference (gender, age, and individual/person) that was trained on massive Twitter dataset using p…☆154Updated 2 years ago
- A very simple library for exploiting graph-of-words in NLP☆12Updated 4 years ago
- Fine-tuning a Hugging Face BERT model for the United Nations Named Entity Recognition task.☆35Updated 4 years ago
- Python based framework to retreive Global Database of Events, Language, and Tone (GDELT) version 1.0 and version 2.0 data.☆239Updated 2 years ago
- Semantic Scholar's Author Disambiguation Algorithm & Evaluation Suite☆101Updated 3 months ago
- This repository contains the underlying code for the paper "Monitoring Global Development Aid with Machine Learning" by Toetzke, M.; N. B…☆13Updated 3 years ago
- Name matching is a Python package for the matching of company names. This package has been developed to match the names of companies from…☆160Updated 2 weeks ago
- Interpretable data visualizations for understanding how texts differ at the word level☆285Updated 10 months ago
- ☆24Updated 4 years ago
- A python package to enrich Twitter Data☆75Updated 2 years ago
- Code and experiments for *BERTopic: Neural topic modeling with a class-based TF-IDF procedure*☆83Updated 2 years ago
- Blazing fast topic modelling for short texts.☆34Updated 2 months ago
- geoparsepy is a Python geoparsing library that will extract and disambiguate locations from text. It uses a local OpenStreetMap database …☆64Updated 4 years ago
- Using stochastic block models for topic modeling☆196Updated last year
- The FBAdLibrarian is a simple tool that can pull ad data and collects images offered by Facebook’s Ad Library API.☆16Updated 2 years ago
- This page is meant to provide current research updates on Polarization and Echo-chambers on Social Media. Unlike other survey pages, this…☆45Updated 2 years ago
- Text and statistics utilities from Pew Research Center☆85Updated 3 years ago
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidata☆169Updated 3 years ago
- Train, evaluate, and use different unsupervised topic modelling algorithms using a RESTful API.☆37Updated 2 years ago
- Data and code accompanying the Nature paper "Quantifying social organization and political polarization in online platforms"☆66Updated 3 years ago
- The Python crash course of the Summer Institute in Computational Social Science 2022!☆10Updated 3 years ago