wolfgangmeyers / go-warcLinks
A golang library to work with WARC files from the common crawl
☆15Updated 7 years ago
Alternatives and similar repositories for go-warc
Users that are interested in go-warc are comparing it to the libraries listed below
Sorting:
- Read and write WARC files in Go☆47Updated 7 years ago
- golang readers for ARC and WARC webarchive formats☆20Updated 2 years ago
- The speed of a native map, the safety of sync.RWMutex and the durability of bbolt☆24Updated 5 years ago
- Serve millions of JSON documents via HTTP.☆70Updated last year
- adding badger support to blevesearch☆63Updated 2 years ago
- Tokenizers and lemmatizers for Go☆113Updated 4 months ago
- A pure Go implementation of the smaz compression library for short strings.☆20Updated 10 years ago
- CLD2 (Compact Language Detector 2) bindings for Go (golang)☆38Updated 6 years ago
- DNS client & server package for Go☆42Updated 6 years ago
- dmmclust is a package for clustering short texts, based on Yin and Wang (2014)☆26Updated 8 years ago
- A tokenizer based on Unicode text segmentation (UAX #29), for Go. Split graphemes, words, sentences.☆96Updated this week
- A golang phonetics algorithm library☆31Updated 11 years ago
- Miscellaneous tools for processing WARC files from the CommonCrawl☆25Updated 12 years ago
- A simple, lightweight, embedded geocoder for Golang with city level accuracy☆73Updated 10 years ago
- Go package for abstracting local, in-memory, and remote (Google Cloud Storage/S3) filesystems☆52Updated 7 years ago
- This package helps to work with huge amount of data, which cannot be stored in RAM☆43Updated 3 years ago
- Removes most frequent words (stop words) from a text content. Based on a Curated list of language statistics.☆152Updated 2 years ago
- Roaring Bitmaps - compressed bitmaps in Go☆48Updated 11 years ago
- A Go implementation of the WordNet API☆39Updated 6 years ago
- Decode embedded EXIF meta data from image files written in Pure Golang☆33Updated 2 years ago
- Simple Go library for executing lots of operations spread over any number of threads☆76Updated 3 weeks ago
- A go utility for scraping web page metadata, supporting open graph, schema.org and more.☆13Updated 10 years ago
- Document Indexing and Searching Library in Go☆19Updated 5 years ago
- Fast integer map for uint32-to-uint32☆33Updated 6 months ago
- pure golang spelling based on hunspell dictionaries☆41Updated 9 years ago
- Increasing bleve indexing performance with sharding☆20Updated 7 years ago
- Dictionary Password Validation for Go☆51Updated 9 years ago
- SAX(Simple API for XML)-like API for golang☆23Updated 4 years ago
- Easy file permissions for golang. Easily get and set file permission bits.☆53Updated 4 years ago
- A Go package for n-gram based text categorization, with support for utf-8 and raw text☆73Updated last year