slyrz / warcLinks
Read and write WARC files in Go
☆47Updated 7 years ago
Alternatives and similar repositories for warc
Users that are interested in warc are comparing it to the libraries listed below
Sorting:
- A golang library to work with WARC files from the common crawl☆15Updated 7 years ago
- golang readers for ARC and WARC webarchive formats☆20Updated 2 years ago
- Package mbox parses the mbox file format into messages and formats messages into mbox files☆74Updated 5 months ago
- A Reader/ReaderAt for Go that uses Range requests to get files over HTTP☆28Updated 3 years ago
- mediawiki dump parser for loading up wikipedia data☆108Updated last week
- An iCalendar library for Go☆67Updated 5 months ago
- CLD2 (Compact Language Detector 2) bindings for Go (golang)☆38Updated 6 years ago
- Go implementation of the JWZ email threading algorithm☆31Updated 2 years ago
- Removes most frequent words (stop words) from a text content. Based on a Curated list of language statistics.☆152Updated 2 years ago
- Text summarizer for golang using LexRank☆137Updated last month
- An approximate string matching library for the Go programming language.☆181Updated 3 years ago
- Self-organizing maps in Go☆74Updated 3 years ago
- Serve millions of JSON documents via HTTP.☆70Updated last year
- Parse JPEG data into segments via code or CLI from pure Go. Read/export/write EXIF data. Read XMP and IPTC metadata.☆78Updated 3 years ago
- The gangsta gangsta way to pull email☆109Updated 5 years ago
- Go library for parsing microformats☆73Updated 5 months ago
- A tokenizer based on Unicode text segmentation (UAX #29), for Go. Split graphemes, words, sentences.☆92Updated 2 weeks ago
- Readability is a library written in Go (golang) to parse, analyze and convert HTML pages into readable content. Originally an Arc90 Exper…☆122Updated 2 years ago
- grobotstxt is a native Go port of Google's robots.txt parser and matcher library.☆114Updated 3 years ago
- A Go package that implements the JusText boilerplate removal algorithm☆110Updated 3 years ago
- Go XML Pull Parser☆34Updated 10 months ago
- A simple, lightweight, embedded geocoder for Golang with city level accuracy☆73Updated 10 years ago
- Conflict-free replicated JSON implementation in native Go☆99Updated 4 years ago
- A Go port of the Rapid Automatic Keyword Extraction algorithm (RAKE)☆122Updated 5 months ago
- A pure Go implementation of the smaz compression library for short strings.☆20Updated 9 years ago
- Webpage summary extractor using Facebook Open Graph and arc90's readability☆68Updated 6 years ago
- Command deadleaves finds and prints the import paths of unused Go packages.☆34Updated 9 years ago
- DKIM package for golang☆99Updated 9 months ago
- generate template FuncMap helpers to construct struct literals within a Go template☆39Updated 4 months ago
- Minimalist Go sessions with a secure cookie Store implementation☆26Updated last year