datasciencecampus / pygrams
Extracts key terminology (n-grams) from any large collection of documents (>1000) and forecasts emergence
☆63Updated last year
Related projects ⓘ
Alternatives and complementary repositories for pygrams
- A maximum-strength name parser for record linkage.☆34Updated 3 months ago
- Turning news into events since 2014.☆50Updated 7 years ago
- A text processing pipeline for turning unstructured text data into hierarchical datasets☆15Updated 4 years ago
- A browser user interface for manual labeling of record pairs.☆41Updated last year
- Extract networks of entities from journalistic reporting☆47Updated last year
- Fast, flexible name matching for large datasets☆70Updated 11 months ago
- Making Patent Citations Uncool Again☆108Updated last year
- Data Server for Topic Models☆121Updated last year
- Make it easier to compare and cross-reference the names of companies and people by applying strong normalisation.☆145Updated 9 months ago
- Next generation event data ontology☆69Updated 9 months ago
- Fuzzy matches and merging of datasets in pandas using csvmatch☆74Updated 4 years ago
- The Python-language successor to the TABARI event-data coding software.☆45Updated 7 years ago
- An open interface to GDELT APIs☆41Updated 11 months ago
- Downloader, preprocessor, parser and deduper for NIH and NSF grants☆20Updated 6 years ago
- ☆16Updated 6 years ago
- A simple command line interface to the datamade/dedupe library.☆42Updated last year
- Scrapes the web. Gets the news.☆13Updated 8 years ago
- Visual analytics application for qualitative text analysis☆24Updated last year
- ☆71Updated this week
- Given a job title and job description, the algorithm assigns a standard occupational classification (SOC) code to the job.☆73Updated 4 months ago
- Predict age and gender from a first name☆60Updated 6 years ago
- Another next-generation event coding platform.☆71Updated 5 years ago
- Package for performing Reddit-based text analysis☆20Updated 5 years ago
- Political Discourse Analysis Using Pre-Trained Word Vectors.☆22Updated last year
- A set of jupyter notebooks demonstrating how to use the Media Cloud API.☆34Updated 11 months ago
- Pandas-based utility to calculate weighted means, medians, distributions, standard deviations, and more.☆105Updated last week
- API client for Aleph, supports bulk entity and document upload.☆28Updated last month
- Python wrapper for a C++ Double Metaphone☆15Updated last year
- Python module for bibliographic network analysis.☆84Updated 4 years ago