ChrisCates / CommonCrawlerLinks
πΈ A simple way to extract data from Common Crawl
β34Updated 5 years ago
Alternatives and similar repositories for CommonCrawler
Users that are interested in CommonCrawler are comparing it to the libraries listed below
Sorting:
- π Lens is an opt-in search engine and data collection tool to aid content discovery of the distributed webβ61Updated 5 years ago
- simple base58 codecβ20Updated 8 years ago
- Go port of secret-handshakeβ45Updated 7 months ago
- Miscellaneous tools for processing WARC files from the CommonCrawlβ24Updated 11 years ago
- NAT port mapping library for go-libp2pβ62Updated 3 years ago
- Advanced declarative web scrapingβ30Updated 2 years ago
- Simple Email Parserβ47Updated 9 years ago
- A Go implementation of the OTR 3 protocol, with libotr 4.1.0 feature parityβ74Updated 2 years ago
- textextract is a tiny library (87 lines of Go) that identifies where the article content is in a HTML page (as opposed to navigation, heaβ¦β11Updated 6 years ago
- Weighted PageRank implementation in Goβ86Updated 4 years ago
- A RiveScript interpreter for Go. RiveScript is a scripting language for chatterbots.β61Updated last year
- Streaming decoder for JSON arraysβ37Updated 10 years ago
- Datastore implementation using badger as backend.β57Updated 3 months ago
- A client and server side solution for zero knowledge authentication, in Goβ15Updated 8 years ago
- Zero knowledge push relayβ33Updated 5 years ago
- Summarizes textβ39Updated 9 years ago
- Go package for abstracting local, in-memory, and remote (Google Cloud Storage/S3) filesystemsβ52Updated 6 years ago
- Go client for newsapi (https://newsapi.org/)β37Updated 5 years ago
- Text summarizer for golang using LexRankβ132Updated last year
- Fast, decentralized and git-trackable database.β49Updated 5 years ago
- Livestreaming via IPFSβ23Updated 2 years ago
- Shamir's Secret Sharing Algorithm implementation in golang combined with PGP and a mail delivery systemβ35Updated 7 years ago
- Removes most frequent words (stop words) from a text content. Based on a Curated list of language statistics.β148Updated last year
- A real-time collaborative Markdown editor and document repository with simple organization and project-based managementβ52Updated 2 years ago
- An implementation of Noise in Goβ42Updated 7 years ago
- Go language wrapper around the natty NAT-traversal utilityβ35Updated 6 years ago
- Compare various different Hashing Algorithmsβ22Updated 7 years ago
- orc - Onion router control protocol library.β39Updated 7 years ago
- A peer to peer service registry and discovery tool.β39Updated 11 years ago
- SuperHacker is the ultimate utility to make you look like a hacker.β48Updated 3 years ago