osdg-ai / osdg-data
The OSDG Community Dataset (OSDG-CD) is a public dataset of thousands of text excerpts, validated by OSDG Community Platform (OSDG-CP) citizen scientists with respect to the Sustainable Development Goals (SDGs). The dataset is updated every quarter and published on Zenodo.
☆28Updated last year
Alternatives and similar repositories for osdg-data:
Users that are interested in osdg-data are comparing it to the libraries listed below
- OSDG is an open-source tool that maps and connects activities to the UN Sustainable Development Goals (SDGs) by identifying SDG-relevant …☆36Updated last year
- ☆16Updated 3 years ago
- Code for the Master Thesis "Enhancing the Microsoft Academic Knowledge Graph"☆14Updated 4 years ago
- Full text geoparsing/toponym resolution with event geolocation☆71Updated 5 months ago
- A python package to enrich Twitter Data☆74Updated last year
- Python package for text mining of time-series data☆68Updated last month
- Using the Gmail API to topic model my recommended Medium reads☆24Updated 3 years ago
- A very simple library for exploiting graph-of-words in NLP☆12Updated 3 years ago
- Multi-Label Text Classification with Transfer Learning☆17Updated 4 years ago
- Code for the CUP Elements on text analysis in Python for social scientists☆135Updated 2 years ago
- Tutorial for using twarc, with steps for installing software.☆25Updated 6 years ago
- A Python client for the GDELT 2.0 Doc API☆109Updated 9 months ago
- ☆31Updated this week
- Easy PDF to text to spaCy text extraction in Python.☆38Updated 3 months ago
- A Flexible Deep Learning Approach to Fuzzy String Matching☆139Updated 3 months ago
- Extract city and country mentions from Text like GeoText without regex, but FlashText, a Aho-Corasick implementation.☆60Updated this week
- A light-weight wrapper for the Datawrapper API.☆61Updated 6 months ago
- Semantic Scholar's Author Disambiguation Algorithm & Evaluation Suite☆91Updated 11 months ago
- Fast, flexible name matching for large datasets☆70Updated last year
- Tutorial for Topic Modelling using PySpark and Spark NLP☆16Updated 4 years ago
- The repository for the Sustainable Development Goals Interface Ontology☆64Updated 4 years ago
- An EUR-Lex parser for Python.☆29Updated 6 months ago
- ☆54Updated last year
- Text analysis with networks.☆286Updated 8 months ago
- This repository contains the underlying code for the paper "Monitoring Global Development Aid with Machine Learning" by Toetzke, M.; N. B…☆12Updated 2 years ago
- Code and experiments for *BERTopic: Neural topic modeling with a class-based TF-IDF procedure*☆72Updated last year
- Measuring Sustainability Reporting using Web Scraping and Natural Language Processing☆35Updated 7 years ago
- Text based Named Entity Recognition data pipeline to annotate, and geo locate disaster related data points.☆11Updated last year
- A list of GDELT themes that taken together broadly represent "issues" and media source lists, a way to split GDELT sources into more conc…☆20Updated 5 years ago
- A Python tool to pull the complete edit history of a Wikipedia page☆20Updated last month