osdg-ai / osdg-dataLinks
The OSDG Community Dataset (OSDG-CD) is a public dataset of thousands of text excerpts, validated by OSDG Community Platform (OSDG-CP) citizen scientists with respect to the Sustainable Development Goals (SDGs). The dataset is updated every quarter and published on Zenodo.
☆36Updated last year
Alternatives and similar repositories for osdg-data
Users that are interested in osdg-data are comparing it to the libraries listed below
Sorting:
- OSDG is an open-source tool that maps and connects activities to the UN Sustainable Development Goals (SDGs) by identifying SDG-relevant …☆43Updated 2 years ago
- Text analysis with networks.☆288Updated 5 months ago
- Code for the Master Thesis "Enhancing the Microsoft Academic Knowledge Graph"☆14Updated 4 years ago
- A Flexible Deep Learning Approach to Fuzzy String Matching☆147Updated 11 months ago
- Fuzzy matches and merging of datasets in pandas using csvmatch☆75Updated 5 years ago
- Semantic Scholar's Author Disambiguation Algorithm & Evaluation Suite☆95Updated last week
- A very simple library for exploiting graph-of-words in NLP☆12Updated 4 years ago
- Python based framework to retreive Global Database of Events, Language, and Tone (GDELT) version 1.0 and version 2.0 data.☆233Updated last year
- Interpretable data visualizations for understanding how texts differ at the word level☆280Updated 7 months ago
- A deep learning system for demographic inference (gender, age, and individual/person) that was trained on massive Twitter dataset using p…☆153Updated 2 years ago
- Public repository for the research outputs of the Mapping Career Causeways project☆25Updated 4 years ago
- A Python client for the GDELT 2.0 Doc API☆148Updated 4 months ago
- ☆16Updated 4 years ago
- Full text geoparsing/toponym resolution with event geolocation☆78Updated this week
- Code for the CUP Elements on text analysis in Python for social scientists☆137Updated 3 years ago
- Code and experiments for *BERTopic: Neural topic modeling with a class-based TF-IDF procedure*☆82Updated last year
- Nesta's Skills Extractor Library☆141Updated 3 months ago
- This package consists of functionalities for dynamic topic modelling and its visualization☆26Updated 5 years ago
- Using stochastic block models for topic modeling☆196Updated last year
- Data and code accompanying the Nature paper "Quantifying social organization and political polarization in online platforms"☆65Updated 3 years ago
- Introduction to Cultural Analytics & Python, course website and online textbook powered by Jupyter Book☆271Updated last year
- This page is meant to provide current research updates on Polarization and Echo-chambers on Social Media. Unlike other survey pages, this…☆44Updated 2 years ago
- Fast, flexible name matching for large datasets☆72Updated 2 weeks ago
- Tutorial for using twarc, with steps for installing software.☆25Updated 7 years ago
- Science of Science☆181Updated 3 months ago
- Record linking package that fuzzy matches two Python pandas dataframes using sqlite3 fts4☆285Updated 3 years ago
- A python package to enrich Twitter Data☆75Updated 2 years ago
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidata☆164Updated 2 years ago
- Using the Gmail API to topic model my recommended Medium reads☆24Updated 3 years ago
- Scripts used to make and evaluate OpenAlex's concept tagging model☆49Updated 2 years ago