wolfgangmeyers / go-warcLinks
A golang library to work with WARC files from the common crawl
☆15Updated 7 years ago
Alternatives and similar repositories for go-warc
Users that are interested in go-warc are comparing it to the libraries listed below
Sorting:
- Read and write WARC files in Go☆46Updated 7 years ago
- golang readers for ARC and WARC webarchive formats☆20Updated 2 years ago
- The speed of a native map, the safety of sync.RWMutex and the durability of bbolt☆24Updated 5 years ago
- Serve millions of JSON documents via HTTP.☆70Updated 10 months ago
- pure golang spelling based on hunspell dictionaries☆41Updated 9 years ago
- A pure Go implementation of the smaz compression library for short strings.☆20Updated 9 years ago
- Tokenizers and lemmatizers for Go☆110Updated 2 weeks ago
- DNS client & server package for Go☆42Updated 6 years ago
- dmmclust is a package for clustering short texts, based on Yin and Wang (2014)☆26Updated 7 years ago
- Space efficient streaming quantile estimator☆21Updated 2 years ago
- Package ara provides a dialer with customizable resolver.☆16Updated 4 months ago
- A golang phonetics algorithm library☆31Updated 10 years ago
- A simple, lightweight, embedded geocoder for Golang with city level accuracy☆73Updated 10 years ago
- adding badger support to blevesearch☆63Updated 2 years ago
- Convert a nested map to a flat map of sql-friendly columns☆16Updated last year
- Go package for abstracting local, in-memory, and remote (Google Cloud Storage/S3) filesystems☆52Updated 7 years ago
- Document Indexing and Searching Library in Go☆19Updated 5 years ago
- Command deadleaves finds and prints the import paths of unused Go packages.☆34Updated 9 years ago
- ☆26Updated last year
- Increasing bleve indexing performance with sharding☆20Updated 7 years ago
- CLD2 (Compact Language Detector 2) bindings for Go (golang)☆38Updated 5 years ago
- Pure Go implementation of cryptographic APIs found in libsodium☆45Updated 4 years ago
- Latent Dirichlet Allocation☆31Updated 3 years ago
- A Go package for n-gram based text categorization, with support for utf-8 and raw text☆72Updated 9 months ago
- Small wrapper around golang.org/x/crypto/openpgp☆22Updated 8 years ago
- A tokenizer based on Unicode text segmentation (UAX #29), for Go. Split words, sentences and graphemes.☆68Updated this week
- A Golang library for dumping SQL text☆41Updated 2 years ago
- Take screenshot of a web page☆21Updated 8 years ago
- Simple Go library for executing lots of operations spread over any number of threads☆75Updated 2 years ago
- Miscellaneous tools for processing WARC files from the CommonCrawl☆24Updated 11 years ago