ChrisCates / CommonCrawlerLinks
🕸 A simple way to extract data from Common Crawl
☆34Updated 5 years ago
Alternatives and similar repositories for CommonCrawler
Users that are interested in CommonCrawler are comparing it to the libraries listed below
Sorting:
- 🔍 Lens is an opt-in search engine and data collection tool to aid content discovery of the distributed web☆61Updated 6 years ago
- Read and write WARC files in Go☆48Updated 7 years ago
- Text summarizer for golang using LexRank☆137Updated 4 months ago
- Datastore implementation using badger as backend.☆58Updated 5 months ago
- Removes most frequent words (stop words) from a text content. Based on a Curated list of language statistics.☆153Updated 2 years ago
- Summarizes text☆39Updated 10 years ago
- Gonudb is an append-only key/value datastore written in Go.☆19Updated 2 years ago
- Go port of secret-handshake☆45Updated last year
- Zero knowledge push relay☆33Updated 6 years ago
- Weighted PageRank implementation in Go☆87Updated 4 years ago
- adding badger support to blevesearch☆63Updated 2 years ago
- simple base58 codec☆20Updated 8 years ago
- TextRank implementation in Golang with extendable features (summarization, phrase extraction) and multithreading (goroutine).☆222Updated 7 months ago
- Miscellaneous tools for processing WARC files from the CommonCrawl☆25Updated 12 years ago
- Simple Email Parser☆47Updated 9 years ago
- structured data differ with near-linear time complexity☆19Updated 4 years ago
- package lingo provides the data structures and algorithms required for natural language processing☆158Updated 2 years ago
- Nifty library to manage, query and store RDF triples. Make RDF great again!☆116Updated 6 years ago
- HTTP on top of libp2p☆67Updated 5 months ago
- The Bloom Tree☆31Updated 5 years ago
- An example app providing an HTTP/REST/JSON front-end to bleve☆134Updated 11 months ago
- User-space deniable data encryption client.☆94Updated 2 years ago
- A RiveScript interpreter for Go. RiveScript is a scripting language for chatterbots.☆62Updated 2 years ago
- PageRank implementation in Go☆102Updated 2 years ago
- goprocess - like Context, but with good close semantics.☆73Updated 5 years ago
- Go implementation of haadcode's orbit-db☆11Updated 9 years ago
- A list of media, code and info about Dgraph.☆23Updated 5 years ago
- key-value datastore interfaces☆245Updated last month
- an object to manage sets of peers, their addresses and other metadata☆89Updated 3 years ago
- High-level Database Abstraction Layer for Go☆60Updated 2 years ago