crawlerclub / ceLinks
Html article content extractor in Golang.
☆12Updated 2 years ago
Alternatives and similar repositories for ce
Users that are interested in ce are comparing it to the libraries listed below
Sorting:
- Unsupervised Word Discovery☆10Updated 5 years ago
- Read and use word2vec vectors in Go☆56Updated 6 years ago
- Go-flashtext is a flashtext implement written in Go (Golang). It is based on the FlashText algorithm.☆19Updated 4 years ago
- Go implementation of today's most used tokenizers☆43Updated 4 years ago
- Natural Language Processing Toolkit in Golang☆64Updated 5 years ago
- gRPC server for hnswlib☆14Updated 2 years ago
- An Inverted Index generator implemented in Go used for text search in large document sets.☆18Updated 5 years ago
- doc2vec , word2vec, implemented by golang. word embedding representation☆41Updated 7 years ago
- A streaming ETL for fish☆13Updated 6 years ago
- tfidf provides TF-IDF functionality☆12Updated last year
- Bleve Extensions☆48Updated last year
- Go Based Lightweight RAG / LLM Tool with CLI + API☆15Updated last year
- Package assocentity returns the mean distance from tokens to an entity and its synonyms☆17Updated this week
- String similarity functions, String distance's, Jaccard, Levenshtein, Hamming, Jaro-Winkler, Q-grams, N-grams, LCS - Longest Common Subse…☆61Updated 7 years ago
- Crawler4U, a general purpose focused crawler☆38Updated 4 years ago
- A declarative, SQL-like DSL for data integration tasks.☆14Updated 6 years ago
- Genetic Algorithm and Particle Swarm Optimization☆33Updated 3 years ago
- Facebook fastText database in SQLite with Go API☆35Updated 4 years ago
- A Go package that implements the JusText boilerplate removal algorithm☆109Updated 2 years ago
- concurrent map implementation using bucket list like a skip list.☆10Updated 3 years ago
- sparse levenshtein automaton in go☆24Updated 5 years ago
- Tagify produces a set of tags from a given source. Source can be either an HTML page, a Markdown document or a plain text. Supports Engli…☆39Updated 11 months ago
- Latent Dirichlet Allocation☆30Updated 3 years ago
- Text classifier for Go, aka document categorization.☆41Updated 9 years ago
- A general purpose application which can be used to host read-only access to one or more Bleve indexes☆13Updated 8 years ago
- High-level performant CSV encoding and decoding library☆18Updated 8 months ago
- Convenience packages for data science in Go.☆30Updated 3 months ago
- Go client for txtai☆79Updated 2 weeks ago
- bm25 is a scoring function that helps with information retrieval☆14Updated 4 years ago
- Package hyperscan provides Go bindings for 01org/hyperscan high-performance regular expression matching library.☆21Updated 6 years ago