r-spark / sparkwarc
Load WARC files into Apache Spark with sparklyr
☆13Updated 3 years ago
Alternatives and similar repositories for sparkwarc:
Users that are interested in sparkwarc are comparing it to the libraries listed below
- Easily Install and Load Modern Web-Scraping Packages☆50Updated 6 years ago
- Collection of statistical and other for work at the Wikimedia Foundation☆10Updated 5 years ago
- ARCHIVED☆37Updated 2 years ago
- Comparing fitting HMMs with R, RcppArmadillo, and TMB☆12Updated 7 years ago
- Quick tidytext examination of Mueller vs. Watergate reports☆17Updated 5 years ago
- R Wrapper for Google's Compact Language Detector 2☆38Updated 5 months ago
- Tools to Work with the Web Archive Ecosystem in R☆21Updated 7 years ago
- A sentiment analysis package for R.☆22Updated last year
- Access to the stringi API from within an Rcpp-based Project☆11Updated last month
- ARCHIVED Extract Text from 'PDFs'☆20Updated 2 years ago
- R Package For Accessing Docker via Docker APIs☆21Updated 7 years ago
- Quickly transform data.frames into onehot encoded matrices☆11Updated 5 years ago
- flag geom for ggplot2☆4Updated 7 years ago
- 📚🔍 All of my old gists in one place☆10Updated 6 years ago
- Small R package for no-API-required URL expansion☆33Updated 4 years ago
- The Mechanics of ggplot2☆16Updated 7 years ago
- R helpers for using Google Fonts☆18Updated 8 years ago
- https://tmastny.github.io/leadr/☆26Updated 6 years ago
- 📰🗞 New York Times data☆12Updated 6 years ago
- R interface to the Algorithmia API☆12Updated 6 years ago
- Build a social network dashboard in R (Twitter/Facebook/GitHub/etc...)☆14Updated 8 years ago
- Run R Scripts and Jobs with Pushbullet Alerts☆10Updated 5 years ago
- Tools to Retrieve Economic Policy Institute Data Library Extracts in R☆20Updated 4 years ago
- Make handling decision trees easy. Treezy.☆14Updated 5 years ago
- Set of fonts with permissive licences☆17Updated 6 years ago
- Create your own R Archive Network☆35Updated 6 years ago
- Tools to complement building and using R packages installed from GitHub☆25Updated 8 years ago
- Extension of xml2 package for xsl transformations☆29Updated 2 weeks ago
- Make a DocumentTermMatrix faster☆20Updated last year
- Demonstrate the process of improving BoM heatmaps☆19Updated 8 years ago