crawlerclub / ceLinks
Html article content extractor in Golang.
☆12Updated 3 years ago
Alternatives and similar repositories for ce
Users that are interested in ce are comparing it to the libraries listed below
Sorting:
- Unsupervised Word Discovery☆10Updated 6 years ago
- Go-flashtext is a flashtext implement written in Go (Golang). It is based on the FlashText algorithm.☆19Updated 4 years ago
- Read and use word2vec vectors in Go☆57Updated 7 years ago
- gRPC server for hnswlib☆16Updated 2 years ago
- Natural Language Processing Toolkit in Golang☆64Updated 5 years ago
- Go implementation of today's most used tokenizers☆44Updated 4 years ago
- doc2vec , word2vec, implemented by golang. word embedding representation☆41Updated 7 years ago
- Functional Meaning Representation and Semantic Parsing Framework☆78Updated 3 years ago
- Facebook fastText database in SQLite with Go API☆35Updated 5 years ago
- A general purpose application which can be used to host read-only access to one or more Bleve indexes☆12Updated 9 years ago
- A library for efficient similarity search and clustering of dense vectors. It's a Go wrapper of faiss (https://github.com/facebookresearc…☆25Updated 2 years ago
- A declarative, SQL-like DSL for data integration tasks.☆14Updated 7 years ago
- Bleve Extensions☆46Updated last year
- A streaming ETL for fish☆13Updated 6 years ago
- An implementation of the Goose HTML Content / Article Extractor algorithm in golang☆40Updated 4 years ago
- Event matching for log records☆11Updated 11 years ago
- Distributed Approximate Nearest Neighbors Database https://anndb.com☆37Updated 2 months ago
- Inference Llama 2 in Go☆39Updated 2 years ago
- A simple library for loading word2vec binary model.☆12Updated 10 years ago
- Binary heap priority queues in Go☆32Updated 4 years ago
- Go Based Lightweight RAG / LLM Tool with CLI + API☆14Updated 2 years ago
- Latent Dirichlet Allocation☆31Updated 3 years ago
- A Go package that implements the JusText boilerplate removal algorithm☆110Updated 2 years ago
- An Inverted Index generator implemented in Go used for text search in large document sets.☆18Updated 5 years ago
- Go client for txtai☆80Updated 2 months ago
- Best way to use ChatGPT/GPT-3 with Go: zero dependencies, tokenizer, under 1500 LOC☆14Updated last year
- tfidf provides TF-IDF functionality☆13Updated 2 years ago
- bm25 is a scoring function that helps with information retrieval☆14Updated 5 years ago
- Tagify produces a set of tags from a given source. Source can be either an HTML page, a Markdown document or a plain text. Supports Engli…☆41Updated last year
- shoco is a compressor for small text strings. [Not maintained].☆10Updated 6 years ago