kbullaughey / warc-toolsLinks
Miscellaneous tools for processing WARC files from the CommonCrawl
☆24Updated 11 years ago
Alternatives and similar repositories for warc-tools
Users that are interested in warc-tools are comparing it to the libraries listed below
Sorting:
- Chrome Automation Library using Google Chrome Remote Debugger API in Go☆85Updated 3 years ago
- Render screenshots of given urls on Linux using Xvfb, midori, ratpoison & ImageMagick☆83Updated 9 years ago
- Summarizes text☆39Updated 9 years ago
- [Go] FreeTree - generic binary-search-tree without any GC overhead☆45Updated 9 years ago
- interactive, configurable content-blocking proxy written in golang☆95Updated 9 years ago
- Cross-platform persistent and distributed web crawler☆112Updated 7 years ago
- A simple domain name server to tolerate typos in subdomains written in Go☆52Updated 9 years ago
- A Go library which determines the dominant colors in an image.☆19Updated 10 years ago
- Simple Go implementation of the Porter Stemmer algorithm with powerful features.☆27Updated 4 years ago
- Fast identification of character sequences in text or documents (multi-lingual)☆18Updated 9 years ago
- Online Change Detection Algorithm☆53Updated 5 years ago
- A simple, lightweight, embedded geocoder for Golang with city level accuracy☆73Updated 9 years ago
- Temporary (disposable/throwaway) email detection library in Go(Golang)☆21Updated 9 years ago
- Deal with CLI prompts in style☆40Updated 6 years ago
- Middleware decorator for go servers.☆37Updated 10 years ago
- simple tools for encryption from the command-line☆24Updated 9 years ago
- A probabilistic data structure service and storage☆92Updated 9 years ago
- ipLocator - a basic Geo-Ip Server☆71Updated 6 years ago
- All-in-one text tokenizer for Go. Super-fast. Lots of features.☆13Updated 9 years ago
- Serve millions of JSON documents via HTTP.☆70Updated 8 months ago
- A pure Go implementation of security question style password recovery that preserves end-to-end cryptographic security.☆29Updated 10 years ago
- QR code printer for your terminal☆10Updated 4 years ago
- Lightweight key-value interface to a bunch of storage engines with middleware support, organized as a chain of operations; written in Go☆129Updated 7 years ago
- ramcache implements an in-memory key/value cache with expirations based on access and insertion times.☆13Updated 6 years ago
- A pure Go implementation of the smaz compression library for short strings.☆20Updated 9 years ago
- A generic oplog/replication system for microservices☆110Updated 10 months ago
- An API optimized for processing timeseries data.☆35Updated 4 years ago
- 💧 In memory dataset filtering☆49Updated 3 years ago
- 🏠 Explode one-line address strings using Golang☆53Updated 2 years ago
- Self-Hosted Event Analytics Service☆77Updated 9 years ago