pudo-attic / extractorsView external linksLinks
Re-usable wrapper scripts for text document extractors.
☆37Jun 18, 2016Updated 9 years ago
Alternatives and similar repositories for extractors
Users that are interested in extractors are comparing it to the libraries listed below
Sorting:
- A re-useable, stand-alone version of LittleSis network storytelling tool☆12Jan 30, 2016Updated 10 years ago
- A platform for collecting, analyzing, and visualizing social media data.☆13Dec 27, 2020Updated 5 years ago
- ArchiveKit manages data and documents during ETL processes, either on a local file system or on S3.☆15May 2, 2015Updated 10 years ago
- A simple platform for managing structured data.☆28Feb 28, 2022Updated 3 years ago
- Little JSON object want to be graphs, too!☆17Oct 2, 2015Updated 10 years ago
- Archive of political ad data from the Federal Communications Commission☆20Oct 25, 2017Updated 8 years ago
- Who are the people behind the mining industry in Mozambique? A partial answer can be found by connecting minerals concessions to the peop…☆25Jul 30, 2015Updated 10 years ago
- A contextual news development environment.☆49Dec 19, 2014Updated 11 years ago
- Make workflow for downloading Census geodata and joining it to survey data☆37Dec 6, 2021Updated 4 years ago
- Data notification service: subscribe to keywords and get notified whenever an open data sources mentions that keyword.☆24Aug 25, 2013Updated 12 years ago
- A scrapper to identify whether a person is of interest against key databases.☆21Apr 17, 2019Updated 6 years ago
- Scraper built with Scrapy.☆18Aug 14, 2024Updated last year
- Track the keyword positions☆19Oct 26, 2013Updated 12 years ago
- A small repo of notes and scripts for collecting data on U.S. deadly force police incidents☆10Aug 9, 2015Updated 10 years ago
- List of Sanctions and Most wanted☆29Jun 9, 2017Updated 8 years ago
- Newsclipse: The IDE for news production.☆91Dec 11, 2014Updated 11 years ago
- A dashboard with insights into Mexico's procurement performance☆12Jul 17, 2020Updated 5 years ago
- A command-line and programmatic interface to various social sharecount endpoints.☆30Nov 18, 2018Updated 7 years ago
- A repo of class materials for NICAR16☆12Mar 12, 2016Updated 9 years ago
- A fork of telescope, a SPARQL query building library for Python☆11Nov 29, 2017Updated 8 years ago
- Siyazana is an isiZulu word that means we know each other or we are connected. This website has been designed to provide users with a too…☆15Sep 22, 2018Updated 7 years ago
- Dexter document monitor for MMA☆16May 8, 2024Updated last year
- Write Claim Reviews for Tweets☆11Dec 24, 2019Updated 6 years ago
- Facet management for backendless datascapes☆12Jan 21, 2016Updated 10 years ago
- Collaborative Innovation Class Project☆14Jun 12, 2015Updated 10 years ago
- Using social media to steer web archiving and curation.☆18Nov 20, 2015Updated 10 years ago
- A command line and Python client for Open-Spending☆10Nov 24, 2017Updated 8 years ago
- JavaScript based graph visualization library with emphasis on customization and modularity.☆13Mar 21, 2019Updated 6 years ago
- A cross-platform command line tool for parallelised content extraction and analysis.☆252Jan 21, 2026Updated 3 weeks ago
- For watching a set of URLs and notifying someone when something has changed.☆32Jun 12, 2017Updated 8 years ago
- Investigative tool for extracting relevant areas from many documents☆14Nov 17, 2015Updated 10 years ago
- JSON schemas for OpenCorporates data☆21Jan 20, 2026Updated 3 weeks ago
- A Python library that standardizes the names of U.S. states☆25Mar 24, 2015Updated 10 years ago
- ☆24Mar 9, 2016Updated 9 years ago
- ☆23Mar 7, 2015Updated 10 years ago
- A semantic analysis tool to generate synonym.txt files for Solr. [RETIRED]☆25Sep 14, 2016Updated 9 years ago
- A visual timeline authoring tool that extracts temporal information from freeform text☆65Jul 25, 2023Updated 2 years ago
- Code for extracting data from a large number of PDFs, particularly FCC political ad documents☆15Oct 26, 2017Updated 8 years ago
- NPR Visual's Carebot (deprecated, now in: https://github.com/thecarebot/carebot)☆15Jul 8, 2015Updated 10 years ago