osdg-ai / osdg-data
The OSDG Community Dataset (OSDG-CD) is a public dataset of thousands of text excerpts, validated by OSDG Community Platform (OSDG-CP) citizen scientists with respect to the Sustainable Development Goals (SDGs). The dataset is updated every quarter and published on Zenodo.
☆30Updated last year
Alternatives and similar repositories for osdg-data:
Users that are interested in osdg-data are comparing it to the libraries listed below
- OSDG is an open-source tool that maps and connects activities to the UN Sustainable Development Goals (SDGs) by identifying SDG-relevant …☆38Updated 2 years ago
- ☆16Updated 4 years ago
- Using the Gmail API to topic model my recommended Medium reads☆24Updated 3 years ago
- Code for the Master Thesis "Enhancing the Microsoft Academic Knowledge Graph"☆14Updated 4 years ago
- ☆54Updated last year
- Full text geoparsing/toponym resolution with event geolocation☆74Updated last month
- Helpers for our open data☆7Updated 4 months ago
- Tutorial for using twarc, with steps for installing software.☆25Updated 7 years ago
- Extract networks of entities from journalistic reporting☆48Updated last year
- The Open Jobs Observatory public mirror repo☆21Updated last year
- A tool for Semantic Scaling of Political Text (branch of Topfish, a suite of tools for Political Text Analysis)☆27Updated last year
- Python package for text mining of time-series data☆71Updated 4 months ago
- Semantic Scholar's Author Disambiguation Algorithm & Evaluation Suite☆90Updated last year
- DashMap is an open source web platform that gathers, analyses and visualises urban data.☆45Updated 2 years ago
- ☆17Updated 2 years ago
- Tools for interactive visual exploration of semantic embeddings.☆32Updated 6 months ago
- A BERT-based application for reusable text classification at scale☆38Updated last year
- A list of GDELT themes that taken together broadly represent "issues" and media source lists, a way to split GDELT sources into more conc…☆20Updated 5 years ago
- Blazing fast topic modelling for short texts.☆31Updated 2 months ago
- A collection of notebooks for Natural Language Processing☆25Updated 2 months ago
- Political Discourse Analysis Using Pre-Trained Word Vectors.☆22Updated 2 years ago
- Measuring Sustainability Reporting using Web Scraping and Natural Language Processing☆36Updated 7 years ago
- A python package to enrich Twitter Data☆75Updated last year
- Set of scripts to aid in the download of the GDELT data files from gdelt.utdallas.edu☆16Updated 10 years ago
- Wellcome tool to parse references scraped from policy documents using machine learning☆25Updated 3 years ago
- A light-weight wrapper for the Datawrapper API.☆63Updated 8 months ago
- Fast, flexible name matching for large datasets☆71Updated last year
- A spaCy wrapper of OpenTapioca for named entity linking on Wikidata☆94Updated 2 years ago
- ☆22Updated 4 years ago
- An End-to-End Evaluation Framework for Entity Resolution Systems☆27Updated last year