osdg-ai / osdg-data
The OSDG Community Dataset (OSDG-CD) is a public dataset of thousands of text excerpts, validated by OSDG Community Platform (OSDG-CP) citizen scientists with respect to the Sustainable Development Goals (SDGs). The dataset is updated every quarter and published on Zenodo.
☆27Updated 11 months ago
Related projects: ⓘ
- OSDG is an open-source tool that maps and connects activities to the UN Sustainable Development Goals (SDGs) by identifying SDG-relevant …☆35Updated last year
- A tool to assign Sustainable Development Goals to a scientific abstract☆15Updated 3 years ago
- Measuring Sustainability Reporting using Web Scraping and Natural Language Processing☆35Updated 7 years ago
- Using the Gmail API to topic model my recommended Medium reads☆24Updated 2 years ago
- ☆14Updated 3 years ago
- This package consists of functionalities for dynamic topic modelling and its visualization☆24Updated 4 years ago
- Blazing fast topic modelling for short texts.☆28Updated 2 months ago
- A Python client for the GDELT 2.0 Doc API☆91Updated 6 months ago
- Code for the Master Thesis "Enhancing the Microsoft Academic Knowledge Graph"☆14Updated 3 years ago
- Full text geoparsing/toponym resolution with event geolocation☆70Updated last month
- A very simple library for exploiting graph-of-words in NLP☆12Updated 3 years ago
- Multi-Label Text Classification with Transfer Learning☆16Updated 4 years ago
- Material for course "Geospatial Analytics" (GSA), master degree in Data Science and Business Analytics, University of Pisa☆23Updated 9 months ago
- Easy PDF to text to spaCy text extraction in Python.☆33Updated 11 months ago
- Handy Jupyter Notebooks that I use in for Topic Modeling. Including text mining from PDF files, text preprocessing, Latent Dirichlet Allo…☆40Updated 5 years ago
- A python package to enrich Twitter Data☆73Updated last year
- Semantic Scholar's Author Disambiguation Algorithm & Evaluation Suite☆87Updated 7 months ago
- Tutorial for using twarc, with steps for installing software.☆25Updated 6 years ago
- Analysis and experiments on the UN General Debate corpus☆37Updated 5 years ago
- An open interface to GDELT APIs☆40Updated 9 months ago
- Influence of fake news in Twitter during the 2016 US presidential election☆10Updated 3 years ago
- The Open Jobs Observatory public mirror repo☆20Updated last year
- dynamic topic modeling☆39Updated last year
- Text based Named Entity Recognition data pipeline to annotate, and geo locate disaster related data points.☆11Updated last year
- Building a model to recognize incentives for landscape restoration in environmental policies from Latin America, the US and India. Bringi…☆34Updated 2 years ago
- This is the repository for the files and documents used in the Smart Literature Review paper from (Boye, Møller, 2019)☆19Updated 2 years ago
- Nesta's Skills Extractor Library☆118Updated last month
- A package to easily train Bert-like models for text classification☆14Updated 10 months ago
- NLP: An Application for Public Policy, PyCon Ireland 2018☆23Updated last year
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidata☆151Updated last year