wolfgangmeyers / go-warcLinks
A golang library to work with WARC files from the common crawl
☆15Updated 7 years ago
Alternatives and similar repositories for go-warc
Users that are interested in go-warc are comparing it to the libraries listed below
Sorting:
- Read and write WARC files in Go☆47Updated 7 years ago
- golang readers for ARC and WARC webarchive formats☆20Updated 2 years ago
- The speed of a native map, the safety of sync.RWMutex and the durability of bbolt☆24Updated 5 years ago
- Serve millions of JSON documents via HTTP.☆70Updated last year
- adding badger support to blevesearch☆63Updated 2 years ago
- A simple, lightweight, embedded geocoder for Golang with city level accuracy☆73Updated 10 years ago
- Tokenizers and lemmatizers for Go☆113Updated 2 months ago
- Increasing bleve indexing performance with sharding☆20Updated 7 years ago
- A pure Go implementation of the smaz compression library for short strings.☆20Updated 9 years ago
- Removes most frequent words (stop words) from a text content. Based on a Curated list of language statistics.☆152Updated 2 years ago
- CLD2 (Compact Language Detector 2) bindings for Go (golang)☆38Updated 6 years ago
- DNS client & server package for Go☆42Updated 6 years ago
- dmmclust is a package for clustering short texts, based on Yin and Wang (2014)☆26Updated 7 years ago
- Latent Dirichlet Allocation☆31Updated 3 years ago
- Fast integer map for uint32-to-uint32☆32Updated 5 months ago
- web-based UI editor for bleve index mappings☆23Updated 7 months ago
- Text summarizer for golang using LexRank☆137Updated last month
- a tiny package that implements SMTP server for Go projects☆106Updated last year
- This package helps to work with huge amount of data, which cannot be stored in RAM☆43Updated 3 years ago
- Go package for abstracting local, in-memory, and remote (Google Cloud Storage/S3) filesystems☆52Updated 7 years ago
- dirscanner is a recursive file lister which uses channels for go.☆25Updated 6 years ago
- Simple Go library for executing lots of operations spread over any number of threads☆75Updated 2 years ago
- Stemmer packages for Go programming language. Includes English, German and Dutch stemmers.☆53Updated 8 years ago
- Decode embedded EXIF meta data from image files written in Pure Golang☆33Updated 2 years ago
- Miscellaneous tools for processing WARC files from the CommonCrawl☆24Updated 11 years ago
- Command deadleaves finds and prints the import paths of unused Go packages.☆34Updated 9 years ago
- A go utility for scraping web page metadata, supporting open graph, schema.org and more.☆13Updated 10 years ago
- An IP lookup system utilizing open datasets☆61Updated 3 years ago
- golang library for smtp based email validation☆54Updated 4 years ago
- Dictionary Password Validation for Go☆49Updated 9 years ago