PaperCutSoftware / pdfsearchLinks
A full text search library for PDFs.
☆67Updated 5 years ago
Alternatives and similar repositories for pdfsearch
Users that are interested in pdfsearch are comparing it to the libraries listed below
Sorting:
- Go client for txtai☆79Updated last month
- A Go package that implements the JusText boilerplate removal algorithm☆110Updated 2 years ago
- package lingo provides the data structures and algorithms required for natural language processing☆156Updated 2 years ago
- ☆18Updated 4 years ago
- A tokenizer based on Unicode text segmentation (UAX #29), for Go. Split graphemes, words, sentences.☆87Updated this week
- SQLite FTS5-based search engine for Hugo pages☆36Updated 4 months ago
- Production grade LLM-ops in Golang☆57Updated last week
- Removes most frequent words (stop words) from a text content. Based on a Curated list of language statistics.☆151Updated 2 years ago
- Pratt parser implementation in Go☆46Updated 3 years ago
- A simple tool to collect and process quite a few web news from multiple sources☆35Updated 3 years ago
- A fast, tested, and predictable way to clean, aggregate, and transform data☆35Updated 6 years ago
- Text summarizer for golang using LexRank☆134Updated 2 weeks ago
- Go module for fetching embeddings from embeddings providers☆54Updated 3 months ago
- Search any text-based document☆23Updated 5 years ago
- A small wrapper around the parser and ast packages☆24Updated last year
- 🔮 Graph Layout Algorithms in Go☆94Updated 6 months ago
- Convenience packages for data science in Go.☆31Updated last week
- Go implementation of today's most used tokenizers☆44Updated 4 years ago
- sqlite3 binding for go☆61Updated 2 years ago
- Document Indexing and Searching Library in Go☆19Updated 5 years ago
- tfidf provides TF-IDF functionality☆12Updated last year
- A Go library that uses pdfium (via cgo) to render pdfs to images☆42Updated 6 years ago
- Tagify produces a set of tags from a given source. Source can be either an HTML page, a Markdown document or a plain text. Supports Engli…☆39Updated last year
- PDF rendering library for Go using TeX algorithms.☆242Updated 5 months ago
- Gocal is a simple clone of pcal. It's a tool to create monthly calendars in PDF with a few gimmicks.☆27Updated 9 months ago
- A Go package for n-gram based text categorization, with support for utf-8 and raw text☆72Updated 10 months ago
- Dataframe library for Go.☆16Updated 2 years ago
- XML stream parser for Go☆110Updated last year
- Make any Go function into a API (FaaS)☆116Updated 11 months ago
- Generate software design diagrams in Go☆41Updated last year