markusmobius / go-trafilatura
go-trafilatura is a Go port of the trafilatura Python library.
☆52Updated 2 months ago
Alternatives and similar repositories for go-trafilatura:
Users that are interested in go-trafilatura are comparing it to the libraries listed below
- A lemmatizer implemented in Go☆86Updated 2 years ago
- Go-DomDistiller is a Go port of the DOM Distiller library which implements Reader mode in Chrome for Android and Desktop. It has no depen…☆66Updated 4 months ago
- Go client for txtai☆74Updated 3 weeks ago
- Go bindings for HuggingFace Tokenizer☆102Updated 2 months ago
- Go module for fetching embeddings from embeddings providers☆43Updated last month
- go parser for human readable dates ported from the dateparser python package☆53Updated 6 months ago
- NLP tokenizers written in Go language☆206Updated last month
- Support for reading and writing PDF files in Go.☆31Updated this week
- SQLite FTS5-based search engine for Hugo pages☆35Updated 2 weeks ago
- Simple Go package to convert HTML to plain text☆145Updated last year
- vader sentiment analysis in go☆45Updated 2 years ago
- Unofficial (Golang) Go bindings for the Hugging Face Inference API☆62Updated last month
- Pagser is a simple, extensible, configurable parse and deserialize html page to struct based on goquery and struct tags for golang crawle…☆105Updated last year
- A Go package that implements the JusText boilerplate removal algorithm☆108Updated 2 years ago
- Go implementation of the SentencePiece tokenizer☆24Updated 4 months ago
- Go implementation of @qdrant/fastembed.☆55Updated 8 months ago
- Readability is a library written in Go (golang) to parse, analyze and convert HTML pages into readable content. Originally an Arc90 Exper…☆119Updated last year
- A high effective golang library for parsing big-sized sitemaps and avoiding high memory usage. The sitemap parser was written on golang w…☆36Updated last year
- A Go implementation of the Thumbhash image placeholder generation algorithm.☆78Updated 6 months ago
- Natural Language Processing Toolkit in Golang☆63Updated 4 years ago
- TextRank implementation in Golang with extendable features (summarization, phrase extraction) and multithreading (goroutine).☆207Updated 3 years ago
- Article spinning and spintax/spinning syntax engine written in Go, useful for A/B, testing pieces of text/articles and creating more natu…☆59Updated 3 years ago
- Go Bindings for BERT NLP Models☆101Updated 5 years ago
- Removes most frequent words (stop words) from a text content. Based on a Curated list of language statistics.☆147Updated last year
- Go implementation of today's most used tokenizers☆41Updated 4 years ago
- Natural language detection package in pure Go☆172Updated 4 years ago
- Headless browser for Go for TDD workflows☆78Updated this week
- A Go port of the Rapid Automatic Keyword Extraction algorithm (RAKE)☆118Updated 3 weeks ago
- Golinkedin is a library written in pure golang for scraping Linkedin☆41Updated 9 months ago
- grobotstxt is a native Go port of Google's robots.txt parser and matcher library.☆107Updated 2 years ago