r-spark / sparkwarcLinks
Load WARC files into Apache Spark with sparklyr
☆12Updated 3 years ago
Alternatives and similar repositories for sparkwarc
Users that are interested in sparkwarc are comparing it to the libraries listed below
Sorting:
- Collection of statistical and other for work at the Wikimedia Foundation☆9Updated 6 years ago
- Easily Install and Load Modern Web-Scraping Packages☆49Updated 7 years ago
- An R package for out-of-core regressions☆14Updated 7 years ago
- Make a DocumentTermMatrix faster☆21Updated last year
- An R client implementing w3c webdriver☆53Updated 8 years ago
- 📰🗞 New York Times data☆12Updated 6 years ago
- ARCHIVED☆36Updated 3 years ago
- Presentation of {polite} at UseR'19 in Toulouse (11 July 2019)☆17Updated 5 years ago
- R Wrapper for Google's Compact Language Detector 2☆38Updated 2 months ago
- Access to the stringi API from within an Rcpp-based Project☆11Updated 4 months ago
- R Package For Accessing Docker via Docker APIs☆20Updated 7 years ago
- Quick tidytext examination of Mueller vs. Watergate reports☆16Updated 6 years ago
- U.S. House and Senate Voting Cartogram Generators in R☆42Updated 2 years ago
- R Installer Package for Pre-Built X-13ARIMA-SEATS Binaries☆11Updated 11 months ago
- Small R package for no-API-required URL expansion☆32Updated 5 years ago
- Make R and your Mac speak☆15Updated 3 years ago
- An R package that returns tidy data from the World Prison Brief website.☆17Updated 4 years ago
- Demonstrate the process of improving BoM heatmaps☆17Updated 8 years ago
- An R package for reading from and writing to a PostgreSQL database☆14Updated 5 years ago
- Build a social network dashboard in R (Twitter/Facebook/GitHub/etc...)☆14Updated 8 years ago
- R bindings to apache arrow☆31Updated 6 years ago
- Create "Object Tables" with R Functions☆21Updated 3 years ago
- Tools for reshaping text data☆52Updated last year
- A collection of thoughts and methods for text mining in R☆21Updated 6 years ago
- 🔤 Lightweight R package for manipulating [string] characters☆17Updated 6 years ago
- 🕸🧰☕️Tools to Scrape Dynamic Web Content via the 'HtmlUnit' Java Library☆36Updated 2 months ago
- Year in Review with R Rmd Template☆34Updated 7 years ago
- ⌚️ A faster unique() function☆19Updated 6 years ago
- This package has functions which will generate a data frame of US zip codes when given a starting zip code and a radius in miles as well …☆12Updated 5 years ago
- Official repo for:☆13Updated 2 months ago