slyrz / warcLinks
Read and write WARC files in Go
☆46Updated 7 years ago
Alternatives and similar repositories for warc
Users that are interested in warc are comparing it to the libraries listed below
Sorting:
- A golang library to work with WARC files from the common crawl☆15Updated 7 years ago
- golang readers for ARC and WARC webarchive formats☆20Updated 2 years ago
- Package mbox parses the mbox file format into messages and formats messages into mbox files☆74Updated 4 months ago
- CLD2 (Compact Language Detector 2) bindings for Go (golang)☆38Updated 5 years ago
- An iCalendar library for Go☆65Updated 4 months ago
- A tokenizer based on Unicode text segmentation (UAX #29), for Go. Split words, sentences and graphemes.☆85Updated last week
- Serve millions of JSON documents via HTTP.☆70Updated 11 months ago
- A Reader/ReaderAt for Go that uses Range requests to get files over HTTP☆28Updated 3 years ago
- mediawiki dump parser for loading up wikipedia data☆107Updated last month
- Generic data structures in Go.☆24Updated last week
- A native golang implementation of cdb (http://cr.yp.to/cdb.html)☆68Updated 6 years ago
- Text summarizer for golang using LexRank☆134Updated last week
- Go XML Pull Parser☆34Updated 9 months ago
- ☆23Updated last month
- Parse JPEG data into segments via code or CLI from pure Go. Read/export/write EXIF data. Read XMP and IPTC metadata.☆78Updated 3 years ago
- Pure Go implementation of cryptographic APIs found in libsodium☆45Updated 5 years ago
- An approximate string matching library for the Go programming language.☆179Updated 2 years ago
- Fast integer map for uint32-to-uint32☆31Updated 3 months ago
- Self-organizing maps in Go☆74Updated 3 years ago
- A Go implementation of the Cassowary constraint solving algorithm.☆78Updated 5 years ago
- Minimal version of utls for parrotting the TLS handshake of popular web browsers☆25Updated 10 months ago
- Go HTTP RoundTripper that Solves the Cloudflare Challenge☆41Updated 6 years ago
- adding badger support to blevesearch☆63Updated 2 years ago
- grobotstxt is a native Go port of Google's robots.txt parser and matcher library.☆113Updated 3 years ago
- Persist a Go object to a JSON file☆102Updated last year
- Generate Man pages from Go source☆159Updated 10 years ago
- Conflict-free replicated JSON implementation in native Go☆99Updated 4 years ago
- Go package for abstracting local, in-memory, and remote (Google Cloud Storage/S3) filesystems☆52Updated 7 years ago
- A golang phonetics algorithm library☆31Updated 10 years ago
- Readability is a library written in Go (golang) to parse, analyze and convert HTML pages into readable content. Originally an Arc90 Exper…☆121Updated 2 years ago