crawlerclub / ce
Html article content extractor in Golang.
☆12Updated 2 years ago
Alternatives and similar repositories for ce
Users that are interested in ce are comparing it to the libraries listed below
Sorting:
- Unsupervised Word Discovery☆10Updated 5 years ago
- Go-flashtext is a flashtext implement written in Go (Golang). It is based on the FlashText algorithm.☆19Updated 4 years ago
- Natural Language Processing Toolkit in Golang☆64Updated 5 years ago
- Read and use word2vec vectors in Go☆56Updated 6 years ago
- doc2vec , word2vec, implemented by golang. word embedding representation☆41Updated 7 years ago
- gRPC server for hnswlib☆14Updated 2 years ago
- Go implementation of today's most used tokenizers☆42Updated 4 years ago
- A streaming ETL for fish☆13Updated 6 years ago
- A library for efficient similarity search and clustering of dense vectors. It's a Go wrapper of faiss (https://github.com/facebookresearc…☆24Updated 2 years ago
- Go Based Lightweight RAG / LLM Tool with CLI + API☆14Updated last year
- A declarative, SQL-like DSL for data integration tasks.☆14Updated 6 years ago
- Inference Llama 2 in Go☆39Updated last year
- bm25 is a scoring function that helps with information retrieval☆14Updated 4 years ago
- general-purpose fast, stateless, and deterministic feature extractor written in golang for use in machine learning☆12Updated 7 years ago
- Go data structures☆14Updated 5 years ago
- Functional Meaning Representation and Semantic Parsing Framework☆78Updated 2 years ago
- Binary heap priority queues in Go☆31Updated 4 years ago
- xgboost go wrapper for c_api☆22Updated 7 years ago
- A general purpose application which can be used to host read-only access to one or more Bleve indexes☆13Updated 8 years ago
- A tool to find all duplicates in large sets of text documents.☆16Updated 3 years ago
- An Inverted Index generator implemented in Go used for text search in large document sets.☆18Updated 5 years ago
- Go library for accessing the Paddle API☆10Updated 3 years ago
- ☆15Updated 4 years ago
- Extract content from HTML by removing unwanted boilerplate text.☆9Updated 7 years ago
- ⏱ Benchmarks of machine learning inference for Go☆31Updated last year
- Unofficial C binding for Onnxruntime in Golang.☆18Updated 3 months ago
- Facebook fastText database in SQLite with Go API☆35Updated 4 years ago
- Best way to use ChatGPT/GPT-3 with Go: zero dependencies, tokenizer, under 1500 LOC☆13Updated 9 months ago
- Bleve Extensions☆48Updated last year
- ZenModel is a framework for building LLM applications with agentic workflow☆67Updated 6 months ago