Quartz/aistudio-searching-data-dumps-with-use

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Quartz/aistudio-searching-data-dumps-with-use)

Quartz / aistudio-searching-data-dumps-with-use

searching large heterogenous data dumps with Universal Sentence Encoder

☆64

Alternatives and similar repositories for aistudio-searching-data-dumps-with-use

Users that are interested in aistudio-searching-data-dumps-with-use are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

occrp / COVID-19-spending-2020
View on GitHub
OCCRP and media partners collected data on COVID-19 related spending from across Europe from February to October 2020
☆14Nov 26, 2020Updated 5 years ago
ANCIR / mozambique
View on GitHub
Who are the people behind the mining industry in Mozambique? A partial answer can be found by connecting minerals concessions to the peop…
☆25Jul 30, 2015Updated 10 years ago
thecarebot / carebot-tracker
View on GitHub
carebot-tracker.js — Carebot's tracking component for Google Analytics events
☆17Apr 19, 2016Updated 10 years ago
The-Politico / gspan.js
View on GitHub
Parses Google Documents formatted for annotated transcripts –– with JavaScript
☆18Feb 14, 2022Updated 4 years ago
mysociety / bluetail
View on GitHub
An alpha project combining beneficial ownership and contracting data
☆13Jun 9, 2021Updated 5 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
jonkeegan / nicar20-radio
View on GitHub
Notes for my talk "Exploring the Radio Spectrum for News"
☆13Mar 6, 2020Updated 6 years ago
occrp-attic / datacommons
View on GitHub
A fleet of Memorious scrapers for crawling various open data sources
☆15Sep 24, 2020Updated 5 years ago
associatedpress / harvester
View on GitHub
Collaborative data collection tool developed by the Associated Press
☆109Feb 24, 2023Updated 3 years ago
The-Politico / politico-civic
View on GitHub
POLITICO's system for managing civic data
☆20Dec 7, 2022Updated 3 years ago
opensanctions / offshore-graph
View on GitHub
Loading OpenSanctions into Neo4J and Linkurious
☆32Dec 17, 2024Updated last year
CivOmega / civomega
View on GitHub
Ask questions about government data.
☆38Jan 17, 2019Updated 7 years ago
rdmurphy / quaff
View on GitHub
A data pipeline helper written in node to convert a folder of JS/ArchieML/JSON/YAML/CSV/TSV files into usable data.
☆47Sep 4, 2023Updated 2 years ago
newsdev / stevedore
View on GitHub
search document dumps: ingest and explore in one extensible framework
☆123Jun 22, 2020Updated 6 years ago
dannguyen / nicar-2019-pdfplumbing
View on GitHub
NICAR 2019 workshop on using Python and PDFplumber to extract text from PDFs
☆12Mar 9, 2019Updated 7 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
bettergov / foiamail
View on GitHub
yet another foia automation service
☆44Jul 6, 2022Updated 4 years ago
Sigma-Awards / The-Sigma-Awards-projects-data
View on GitHub
This is the repository of all projects data submitted to The Sigma Awards.
☆18May 11, 2026Updated 2 months ago
datadesk / django-bigbuild
View on GitHub
The open-source engine that powers bigbuilder, the Los Angeles Times Data Desk's system for publishing standalone pages
☆24Mar 30, 2020Updated 6 years ago
guardian / giant
View on GitHub
Platform for journalists to search, analyse, categorise and share unstructured data
☆59Jul 22, 2026Updated last week
campagnucci / dados-abertos-gov-br
View on GitHub
Tratamento e análise de dados abertos do Governo Federal do Brasil
☆10Oct 26, 2022Updated 3 years ago
Quartz / aistudio-doc2vec-for-investigative-journalism
View on GitHub
How Quartz used AI to help reporters search the Mauritius Leaks
☆49Aug 13, 2019Updated 6 years ago
newsdev / elex-loader
View on GitHub
The NYT AP election loader scripts
☆22Nov 7, 2017Updated 8 years ago
maxharlow / csvmatch
View on GitHub
🔎 Finds fuzzy matches between CSV files
☆189Mar 26, 2025Updated last year
anthonydb / data-wrangling-python-nicar-2017
View on GitHub
Materials for the NICAR 2017 Data Wrangling with Python hands-on class
☆14Mar 4, 2017Updated 9 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
jsvine / data-tactics
View on GitHub
Half-baked idea: Conceptual building blocks for data analysis.
☆12May 7, 2015Updated 11 years ago
newsdev / apfake
View on GitHub
A command-line tool for generating AP API JSON files for testing elections applications.
☆15Jul 5, 2022Updated 4 years ago
datamade / dossier
View on GitHub
Machine assisted dossiers
☆19Oct 12, 2017Updated 8 years ago
ireapps / install-guides
View on GitHub
Install guides for IRE/NICAR conferences.
☆16Mar 16, 2018Updated 8 years ago
The-Politico / generator-politico-graphics
View on GitHub
☆10Mar 10, 2019Updated 7 years ago
fer-aguirre / pmdm
View on GitHub
Political Misogynistic Discourse Monitor team from the 2021 JournalismAI Collab Challenges
☆21Jul 25, 2023Updated 3 years ago
robbarry / nicar19-internetwar
View on GitHub
☆19Mar 20, 2019Updated 7 years ago
The-Politico / gootenberg
View on GitHub
A tool for handling news developer needs from the Google API.
☆38Jan 3, 2023Updated 3 years ago
dataresearchcenter / investigraph
View on GitHub
etl pipeline, graphical explorer and general toolbox for investigations with follow the money data
☆28Jul 15, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
alephdata / ingest-file
View on GitHub
Ingestors extract the contents of mixed unstructured documents into structured (followthemoney) data.
☆68Dec 19, 2025Updated 7 months ago
alexbyrnes / Datapiece
View on GitHub
Investigative tool for extracting relevant areas from many documents
☆14Nov 17, 2015Updated 10 years ago
cjdd3b / nicar2013
View on GitHub
Various documents and code examples for NICAR 2013 presentations.
☆38Mar 1, 2013Updated 13 years ago
jsvine / nbexec
View on GitHub
A simple tool for executing Jupyter notebooks from the command line.
☆22Feb 11, 2023Updated 3 years ago
DallasMorningNews / chartwerk-editor
View on GitHub
React/Redux Chartwerk editor.
☆10Oct 5, 2018Updated 7 years ago
washingtonpost / 2020-election-night-model
View on GitHub
2020-election-night-model
☆60Jan 4, 2021Updated 5 years ago
18F / formsgov-demo
View on GitHub
☆10Dec 8, 2021Updated 4 years ago