ChrisCates / CommonCrawlerLinks
🕸 A simple way to extract data from Common Crawl
☆34Updated 5 years ago
Alternatives and similar repositories for CommonCrawler
Users that are interested in CommonCrawler are comparing it to the libraries listed below
Sorting:
- Livestreaming via IPFS☆23Updated last year
- Miscellaneous tools for processing WARC files from the CommonCrawl☆24Updated 11 years ago
- Go port of secret-handshake☆45Updated 5 months ago
- simple base58 codec☆20Updated 7 years ago
- Go implementation of haadcode's orbit-db☆11Updated 8 years ago
- Start Go command line apps with ease☆16Updated 4 months ago
- Go language wrapper around the natty NAT-traversal utility☆35Updated 6 years ago
- Datastore implementation using badger as backend.☆57Updated 2 months ago
- Golang WARC (Web ARChive) Library☆30Updated 5 years ago
- web-based UI editor for bleve index mappings☆23Updated last month
- Read and write WARC files in Go☆45Updated 7 years ago
- Websocket implementation for fasthttp.☆53Updated last year
- Read and use word2vec vectors in Go☆56Updated 6 years ago
- Go Stanford NLP POS Tagger wrapper☆39Updated 8 years ago
- 🔍 Lens is an opt-in search engine and data collection tool to aid content discovery of the distributed web☆61Updated 5 years ago
- Simple Go library for executing lots of operations spread over any number of threads☆74Updated 2 years ago
- Go client for newsapi (https://newsapi.org/)☆37Updated 5 years ago
- A real-time collaborative Markdown editor and document repository with simple organization and project-based management☆52Updated 2 years ago
- Structured scraper for Go☆25Updated 7 years ago
- Take screenshot of a web page☆21Updated 7 years ago
- adding badger support to blevesearch☆62Updated 2 years ago
- Summarizes text☆38Updated 9 years ago
- doc2vec , word2vec, implemented by golang. word embedding representation☆41Updated 7 years ago
- Document Indexing and Searching Library in Go☆19Updated 5 years ago
- A Go package for n-gram based text categorization, with support for utf-8 and raw text☆73Updated 6 months ago
- Go CAPTCHA☆11Updated last year
- 📟 Tiny utility Go client for HackerNews API.☆17Updated 7 years ago
- distributed data sync with operational transformation/transforms☆87Updated 5 years ago
- A client and server side solution for zero knowledge authentication, in Go☆15Updated 8 years ago
- Golang implementation of the Paice/Husk Stemming Algorithm☆29Updated 11 years ago