kbullaughey / warc-toolsLinks
Miscellaneous tools for processing WARC files from the CommonCrawl
☆24Updated 11 years ago
Alternatives and similar repositories for warc-tools
Users that are interested in warc-tools are comparing it to the libraries listed below
Sorting:
- Chrome Automation Library using Google Chrome Remote Debugger API in Go☆85Updated 4 years ago
- Render screenshots of given urls on Linux using Xvfb, midori, ratpoison & ImageMagick☆83Updated 9 years ago
- Temporary (disposable/throwaway) email detection library in Go(Golang)☆21Updated 9 years ago
- A simple domain name server to tolerate typos in subdomains written in Go☆52Updated 9 years ago
- Simple Go implementation of the Porter Stemmer algorithm with powerful features.☆27Updated 4 years ago
- Summarizes text☆39Updated 10 years ago
- A robust framing and encryption layer for your Go network programs, based on CurveZMQ.☆36Updated 7 years ago
- interactive, configurable content-blocking proxy written in golang☆94Updated 9 years ago
- ipLocator - a basic Geo-Ip Server☆72Updated 7 years ago
- [Go] FreeTree - generic binary-search-tree without any GC overhead☆45Updated 9 years ago
- An API optimized for processing timeseries data.☆35Updated 4 years ago
- Detect malicious homoglyphs in Go source code☆47Updated 8 years ago
- Follow Twitter users based on keywords☆29Updated 8 years ago
- A multicore csv reader library in Go☆47Updated last year
- Lightweight key-value interface to a bunch of storage engines with middleware support, organized as a chain of operations; written in Go☆129Updated 7 years ago
- An efficient map from a range of keys to a single value☆54Updated 7 years ago
- Cross-platform persistent and distributed web crawler☆112Updated 8 years ago
- A Go library which determines the dominant colors in an image.☆19Updated 10 years ago
- Golang Spellcheck based on "How to Write a Spelling Corrector"☆22Updated 10 years ago
- Package atomicfile provides an atomically written/replaced file.☆51Updated 7 years ago
- A go package to parse human-readble date and time strings☆54Updated 6 years ago
- Genex package for Go☆76Updated 5 years ago
- simhash storage and searching☆138Updated 8 years ago
- Middleware decorator for go servers.☆37Updated 10 years ago
- QR code printer for your terminal☆10Updated 4 years ago
- Online Change Detection Algorithm☆54Updated 5 years ago
- A pure Go implementation of security question style password recovery that preserves end-to-end cryptographic security.☆29Updated 10 years ago
- Agree is a Go package that makes it trivial to replicate any data structure using Raft.☆27Updated 9 years ago
- 🏠 Explode one-line address strings using Golang☆53Updated 3 years ago
- A probabilistic data structure service and storage☆92Updated 9 years ago