archivesunleashed / twutLinks
An open-source toolkit for analyzing line-oriented JSON Twitter archives with Apache Spark.
☆9Updated 5 months ago
Alternatives and similar repositories for twut
Users that are interested in twut are comparing it to the libraries listed below
Sorting:
- A simple catalog of Twitter ID Datasets☆28Updated 6 months ago
- A gathering of digital methods recipes for research, teaching and collaborations from across the Public Data Lab.☆11Updated last year
- Service for creating Twitter datasets for research and archiving.☆26Updated 2 years ago
- Various examples of notebooks for working with web archives with the Archives Unleashed Toolkit, and derivatives generated by the Archive…☆26Updated 2 years ago
- Web Archives for Historical Research☆13Updated 7 years ago
- ☆14Updated 8 years ago
- A Twitter data collection and appraisal application.☆51Updated 2 years ago
- A digital humanities operating system that runs on a USB disk.☆31Updated 7 years ago
- WASAPI data transfer APIs☆44Updated 3 years ago
- A collection of ipython/jupyter notebooks☆16Updated 6 years ago
- Web application for distributed compute analysis of Archive-It web archive collections.☆18Updated 2 months ago
- A LevelDB backed URL unshortening microservice written in JavaScript☆31Updated 2 years ago
- Humanities Data Curation Record☆11Updated 7 years ago
- This is a public repository for sharing, improving, and versioning "The Topic Modeling Game," a lesson developed by Lisa Rhody to teach t…☆10Updated 7 years ago
- Prototype SOLR-powered web archive exploration UI.☆43Updated 4 years ago
- A financial disclosure data extraction tool.☆16Updated last year
- Ask questions about government data.☆37Updated 6 years ago
- Data conversions and examples for generating reports from twarc collections using tools such as D3.js☆55Updated 5 years ago
- Data Mining Historical Newspaper Metadata (METS/ALTO formats)☆25Updated 2 years ago
- OpenRefine for Social Science Data☆25Updated last week
- A tool for working with tweet archives.☆15Updated 2 years ago
- A PDF classifier ensemble with REST API service☆23Updated 4 years ago
- Internet Research Agency Facebook ads as structured data☆22Updated 5 years ago
- Docker image for the Archives Unleashed Toolkit☆12Updated 2 years ago
- The GitHub repository for the AI for Humanists Project☆18Updated last month
- Heritage Connector: Transforming text into data to extract meaning and make connections☆24Updated 2 years ago
- A deep learning architecture for reference mining from literature in the arts and humanities.☆16Updated 5 years ago
- A Python tool to search for and remove duplicated files in messy datasets☆16Updated 5 months ago
- utility to fetch provenance information from Internet Archive's Wayback Machine☆13Updated 3 years ago
- Python for Humanities☆13Updated last week