chicago-justice-project / article-taggingLinks
Natural Language Processing of Chicago news articles
☆52Updated last month
Alternatives and similar repositories for article-tagging
Users that are interested in article-tagging are comparing it to the libraries listed below
Sorting:
- Extracts key terminology (n-grams) from any large collection of documents (>1000) and forecasts emergence☆66Updated 2 weeks ago
- Using ML to extract campaign finance data from messy forms for journalism☆77Updated 3 years ago
- Quick tutorial on getting started with GDELT☆45Updated 9 years ago
- ScraperWiki Python library for scraping and saving data☆158Updated 2 years ago
- Command line tool to convert spreadsheets to databases, made for the UK's Office for National Statistics.☆80Updated last year
- Python script for matching a list of messy addresses against a gazetteer using dedupe.☆63Updated 5 years ago
- API client for Aleph, supports bulk entity and document upload.☆28Updated last year
- Interactive and searchable House staffer directory, based on House disbursement data.☆29Updated last year
- Tracing policy ideas from think tanks and lobbyists through state legislative bills☆47Updated 9 years ago
- How Quartz used AI to help reporters search the Mauritius Leaks☆47Updated 6 years ago
- Get Census Data from the API for arbitrary areas☆46Updated 6 months ago
- ☆73Updated last year
- Extract place names from a URL or text, and add context to those names -- for example distinguishing between a country, region or city.☆62Updated 8 years ago
- Uses NLP methods to parse and classify contracts from The City of New Orleans☆10Updated 10 years ago
- A simple command line interface to the datamade/dedupe library.☆42Updated 2 years ago
- Turning news into events since 2014.☆51Updated 8 years ago
- Scraping Assisted by Learning☆35Updated last month
- A Los Angeles Times analysis of serious assaults misclassified by LAPD☆63Updated 7 years ago
- Predict age and gender from a first name☆59Updated 7 years ago
- A library for extracting tables from PDF files☆92Updated 5 years ago
- PDF Table Extractor - repository to hold revisable version of code from https://www.cvast.tuwien.ac.at/projects/pdf2table by Burcu Yildiz☆39Updated last year
- This Python package can be used to systematically extract multiple data elements (e.g., title, keywords, text) from news sources around t…☆33Updated 2 years ago
- ⛏ a library for scraping unreliable pages☆211Updated last month
- ☆46Updated 2 months ago
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆41Updated 6 years ago
- Extract dates from text☆65Updated 4 years ago
- Code supporting the dissertation "Agents in Conflict," George Mason University, 2016☆20Updated 9 years ago
- Pandas-based utility to calculate weighted means, medians, distributions, standard deviations, and more.☆112Updated 11 months ago
- Binary Python bindings for poppler utils for content extraction☆42Updated 4 years ago
- Python client for the Center for Responsive Politics API at OpenSecrets.org.☆43Updated 5 years ago