wolfgangmeyers / go-warcLinks
A golang library to work with WARC files from the common crawl
☆15Updated 7 years ago
Alternatives and similar repositories for go-warc
Users that are interested in go-warc are comparing it to the libraries listed below
Sorting:
- Read and write WARC files in Go☆47Updated 7 years ago
- The speed of a native map, the safety of sync.RWMutex and the durability of bbolt☆24Updated 5 years ago
- golang readers for ARC and WARC webarchive formats☆20Updated 2 years ago
- A golang phonetics algorithm library☆31Updated 11 years ago
- A pure Go implementation of the smaz compression library for short strings.☆20Updated 10 years ago
- Latent Dirichlet Allocation☆31Updated 3 years ago
- Tokenizers and lemmatizers for Go☆113Updated 3 months ago
- A tokenizer based on Unicode text segmentation (UAX #29), for Go. Split graphemes, words, sentences.☆94Updated last month
- DNS client & server package for Go☆42Updated 6 years ago
- Serve millions of JSON documents via HTTP.☆70Updated last year
- adding badger support to blevesearch☆63Updated 2 years ago
- Increasing bleve indexing performance with sharding☆20Updated 7 years ago
- ☆26Updated last year
- A simple, lightweight, embedded geocoder for Golang with city level accuracy☆73Updated 10 years ago
- dmmclust is a package for clustering short texts, based on Yin and Wang (2014)☆26Updated 8 years ago
- Decode embedded EXIF meta data from image files written in Pure Golang☆33Updated 2 years ago
- CLD2 (Compact Language Detector 2) bindings for Go (golang)☆38Updated 6 years ago
- golang library to make https://chartjs.org/ plots (this is vanilla #golang, not gopherjs)☆48Updated 5 years ago
- Miscellaneous tools for processing WARC files from the CommonCrawl☆24Updated 11 years ago
- Roaring Bitmaps - compressed bitmaps in Go☆48Updated 11 years ago
- My sandbox for go experiments☆27Updated 5 years ago
- Fast integer map for uint32-to-uint32☆32Updated 6 months ago
- A go utility for scraping web page metadata, supporting open graph, schema.org and more.☆13Updated 10 years ago
- Parser for HTML microdata, schema.org☆34Updated 9 years ago
- Stemmer packages for Go programming language. Includes English, German and Dutch stemmers.☆54Updated 9 years ago
- Go package for abstracting local, in-memory, and remote (Google Cloud Storage/S3) filesystems☆52Updated 7 years ago
- A fast, tested, and predictable way to clean, aggregate, and transform data☆35Updated 6 years ago
- 2D locality queries in Go☆34Updated 6 months ago
- Dictionary Password Validation for Go☆51Updated 9 years ago
- Amazon S3 storage interface for a Go cache☆29Updated 11 years ago