medialab / hyphe
Websites crawler with built-in exploration and control web interface
☆328Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for hyphe
- Twitter stream + search API grabber☆104Updated last year
- Social Feed Manager user interface application.☆153Updated 4 months ago
- A webmining CLI tool & library for python.☆285Updated 3 weeks ago
- Browser version of Hyphe (WIP)☆29Updated 3 weeks ago
- A cross-platform command line tool for parallelised content extraction and analysis.☆241Updated last month
- Digital Methods Initiative - Twitter Capture and Analysis Toolset☆367Updated this week
- Lightweight web scraping toolkit for documents and structured data.☆309Updated 9 months ago
- Data model and processing tools for investigative entity data☆217Updated last week
- Framework and command-line tools for integrating FollowTheMoney data streams from multiple sources☆199Updated this week
- The 4CAT Capture and Analysis Toolkit provides modular data capture & analysis for a variety of social media platforms.☆257Updated this week
- WARC and ARC indexing and discovery tools.☆116Updated 3 months ago
- A search interface and wayback machine for the UKWA Solr based warc-indexer framework.☆102Updated 3 months ago
- A helper library full of URL-related heuristics.☆63Updated last month
- YTDT is a collection of simple tools for extracting data from the YouTube platform via the YouTube API v3.☆121Updated last month
- Extract networks of entities from journalistic reporting☆47Updated last year
- Make it easier to compare and cross-reference the names of companies and people by applying strong normalisation.☆145Updated 9 months ago
- A collection of scripts that help with downloading data from the Facebook platform for research purposes☆55Updated 8 years ago
- An Apache Spark framework for easy data processing, extraction as well as derivation for web archives and archival collections, developed…☆144Updated last month
- Scrapes posts and comments from public Facebook pages.☆107Updated 5 years ago
- Actor Network Text Analyser☆56Updated 9 years ago
- Data conversions and examples for generating reports from twarc collections using tools such as D3.js☆55Updated 4 years ago
- A simple script for using Google's Vision API that will possibly develop into an actual tool.☆13Updated 6 years ago
- A browser extension to collect social media data with.☆169Updated 3 weeks ago
- The OpenRefine Python Client from Paul Makepeace provides a library for communicating with an OpenRefine server. This fork extends the co…☆82Updated 2 years ago
- Annuaire des comptes Twitter des parlementaires☆42Updated last year
- French stopwords collection☆94Updated 4 years ago
- ☆20Updated 3 years ago
- The Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.☆137Updated 8 months ago
- Command-line utility to help researchers collect video metadata from Youtube API☆29Updated 2 months ago
- A small command line tool and set of functions for studying coordination networks in Twitter and other social media data.☆74Updated 2 years ago