crawlerclub / ceLinks
Html article content extractor in Golang.
☆12Updated 3 years ago
Alternatives and similar repositories for ce
Users that are interested in ce are comparing it to the libraries listed below
Sorting:
- Unsupervised Word Discovery☆10Updated 6 years ago
- Go-flashtext is a flashtext implement written in Go (Golang). It is based on the FlashText algorithm.☆19Updated 4 years ago
- gRPC server for hnswlib☆16Updated 2 years ago
- Natural Language Processing Toolkit in Golang☆64Updated 5 years ago
- Read and use word2vec vectors in Go☆57Updated 7 years ago
- Go implementation of today's most used tokenizers☆44Updated 4 years ago
- doc2vec , word2vec, implemented by golang. word embedding representation☆41Updated 7 years ago
- A declarative, SQL-like DSL for data integration tasks.☆14Updated 7 years ago
- Orchestration engine & UI for your customized LLM flow.☆21Updated last year
- A streaming ETL for fish☆13Updated 6 years ago
- A Go package that implements the JusText boilerplate removal algorithm☆110Updated 3 years ago
- Bleve Extensions☆46Updated last year
- xgboost go wrapper for c_api☆22Updated 7 years ago
- A simple library for loading word2vec binary model.☆12Updated 10 years ago
- Facebook fastText database in SQLite with Go API☆35Updated 5 years ago
- A general purpose application which can be used to host read-only access to one or more Bleve indexes☆12Updated 9 years ago
- bm25 is a scoring function that helps with information retrieval☆14Updated 5 years ago
- A standalone lightweight full-text search engine built on top of blevesearch and Go with multiple storage (scorch, boltdb, leveldb, badge…☆159Updated 6 years ago
- Event matching for log records☆11Updated 11 years ago
- A library for efficient similarity search and clustering of dense vectors. It's a Go wrapper of faiss (https://github.com/facebookresearc…☆25Updated 2 years ago
- Binary heap priority queues in Go☆32Updated 4 years ago
- Inference Llama 2 in Go☆39Updated 2 years ago
- shoco is a compressor for small text strings. [Not maintained].☆10Updated 6 years ago
- tfidf provides TF-IDF functionality☆13Updated 2 years ago
- ZenModel is a framework for building LLM applications with agentic workflow☆74Updated last year
- Type-safe, automatic, asynchronous batch processing.☆22Updated last year
- News Content / Article Extractor written in Go☆32Updated 10 years ago
- An implementation of the Goose HTML Content / Article Extractor algorithm in golang☆40Updated 4 years ago
- Golang RESTful Client for HanLP☆13Updated last year
- A lightweight job scheduler based on priority queue with timeout, retry, replica, context cancellation and easy semantics for job chainin…☆63Updated 5 years ago