Quartz / aistudio-searching-data-dumps-with-use
searching large heterogenous data dumps with Universal Sentence Encoder
☆62Updated 3 years ago
Alternatives and similar repositories for aistudio-searching-data-dumps-with-use:
Users that are interested in aistudio-searching-data-dumps-with-use are comparing it to the libraries listed below
- How Quartz used AI to help reporters search the Mauritius Leaks☆46Updated 5 years ago
- Trying to generate name synonyms from wikidata☆32Updated 4 years ago
- The core of sunlightlabs' Data Commons project. Includes the Transparency Data site and the APIs that power TransparencyData.com and Infl…☆38Updated 8 years ago
- An alpha project combining beneficial ownership and contracting data☆13Updated 3 years ago
- Docker Container for a Make-based, PDF extraction using OCR☆12Updated 5 months ago
- Investigative tool for extracting relevant areas from many documents☆14Updated 9 years ago
- ⚡️ Enriches data, adding columns based on lookups to online services☆22Updated 3 weeks ago
- Make it easier to compare and cross-reference the names of companies and people by applying strong normalisation.☆148Updated this week
- Interactive and searchable House staffer directory, based on House disbursement data.☆26Updated 10 months ago
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆42Updated 5 years ago
- Information extraction and interactive visualization of textual datasets for investigative data-driven journalism and eDiscovery☆53Updated 6 months ago
- A toolkit for mapping networks of political and economic influence through diverse types of entities and their relations. Accessible at h…☆186Updated 3 years ago
- Extract networks of entities from journalistic reporting☆47Updated last year
- a general list of resources and articles for people interested in getting into data journalism☆16Updated last year
- Set of scripts to aid in the download of the GDELT data files from www.gdeltproject.org☆11Updated 10 years ago
- framework for scraping legislative/government data☆86Updated 4 months ago
- Materials to reproduce findings in our story, "Google’s Top Search Result? Surprise! It’s Google"☆34Updated 4 years ago
- An experiment to standardize individual donor names in campaign finance data using simple graph theory and machine learning.☆63Updated 11 years ago
- Data and scripts relating to the publishing of the House expenditure reports, and hopefully the Senate's in future.☆24Updated 4 years ago
- Materials for Frontiers of Computational Journalism, Columbia Journalism School 2018☆11Updated 6 years ago
- A simple command line interface to the datamade/dedupe library.☆42Updated 2 years ago
- Command-line tool for exploring the PAC donor-recipient relationship☆54Updated 10 years ago
- Reading legal authority for the last time☆34Updated 8 months ago
- Parser and standardizer for politician, individual and organization names.☆129Updated 7 years ago
- ☆23Updated 9 years ago
- Front-end for the MediaCloud database☆16Updated 6 years ago
- Code for Newslynx App☆22Updated 9 years ago