datasciencecampus / pygrams
Extracts key terminology (n-grams) from any large collection of documents (>1000) and forecasts emergence
☆62Updated last year
Alternatives and similar repositories for pygrams:
Users that are interested in pygrams are comparing it to the libraries listed below
- Turning news into events since 2014.☆51Updated 7 years ago
- The Python-language successor to the TABARI event-data coding software.☆45Updated 7 years ago
- Political Discourse Analysis Using Pre-Trained Word Vectors.☆22Updated 2 years ago
- Extract place names from a URL or text, and add context to those names -- for example distinguishing between a country, region or city.☆62Updated 8 years ago
- A text processing pipeline for turning unstructured text data into hierarchical datasets☆14Updated 4 years ago
- Pandas-based utility to calculate weighted means, medians, distributions, standard deviations, and more.☆107Updated 3 months ago
- Fuzzy matches and merging of datasets in pandas using csvmatch☆75Updated 4 years ago
- Next generation event data ontology☆72Updated last year
- An open interface to GDELT APIs☆45Updated last year
- Dataset: BuzzFeed News “Trending” Strip, 2018–2023☆19Updated last year
- Visual analytics application for qualitative text analysis☆24Updated 2 years ago
- Python package aiding in entity disambiguation based on string and location matching☆18Updated last year
- Package for performing Reddit-based text analysis☆20Updated 6 years ago
- A simple command line interface to the datamade/dedupe library.☆42Updated 2 years ago
- Python wrapper for the US Census Geocoder☆73Updated 9 months ago
- Downloader, preprocessor, parser and deduper for NIH and NSF grants☆20Updated 6 years ago
- Command line tool to convert spreadsheets to databases, made for the UK's Office for National Statistics.☆78Updated last year
- ☆16Updated 6 years ago
- Another next-generation event coding platform.☆73Updated 5 years ago
- Predict age and gender from a first name☆60Updated 6 years ago
- A base library for building web scrapers for statistical data, and a helper ontology for (primarily Swedish) statistical data.☆13Updated this week
- Various functions to make bag-of-words approaches to text analysis more user-friendly☆24Updated 7 years ago
- Language-agnostic political event coding using universal dependencies☆18Updated 5 years ago
- Making Patent Citations Uncool Again☆110Updated last year
- Python based Wikidata framework for easy dataframe extraction☆42Updated last year
- Python library providing sentiment lexicons.☆26Updated 8 years ago
- Extract networks of entities from journalistic reporting☆48Updated last year
- modification of bibliotools 2.2 from Sébastian Grauwin☆11Updated 5 years ago
- Introduction to Topic Modeling for TextXD 2019, 12/3/2019☆10Updated 5 years ago
- Examples for getting started using https://case.law☆65Updated 2 years ago