PaperCutSoftware / pdfsearchLinks
A full text search library for PDFs.
☆67Updated 5 years ago
Alternatives and similar repositories for pdfsearch
Users that are interested in pdfsearch are comparing it to the libraries listed below
Sorting:
- A Go package that implements the JusText boilerplate removal algorithm☆110Updated 3 years ago
- Text summarizer for golang using LexRank☆137Updated 4 months ago
- package lingo provides the data structures and algorithms required for natural language processing☆158Updated 2 years ago
- Go client for txtai☆81Updated 2 weeks ago
- Removes most frequent words (stop words) from a text content. Based on a Curated list of language statistics.☆153Updated 2 years ago
- A tokenizer based on Unicode text segmentation (UAX #29), for Go. Split graphemes, words, sentences.☆98Updated last week
- 🔮 Graph Layout Algorithms in Go☆95Updated 9 months ago
- A Go library that uses pdfium (via cgo) to render pdfs to images☆42Updated 7 years ago
- A fast, tested, and predictable way to clean, aggregate, and transform data☆35Updated 6 years ago
- PipeIt is a text transformation, conversion, cleansing and extraction tool.☆80Updated 4 years ago
- Programatic document generation as a HTTP service. Render PDFs using LaTeX templates and JSON.☆221Updated 11 months ago
- Data table structure in Go, now developed at https://github.com/cogentcore/core/tree/main/tensor☆117Updated last year
- Turn asterisk-indented text lines into mind maps☆108Updated 5 years ago
- ☆18Updated 4 years ago
- A real-time collaborative Markdown editor and document repository with simple organization and project-based management☆57Updated 5 months ago
- Self-organizing maps in Go☆74Updated 3 years ago
- Natural Language Processing Toolkit in Golang☆64Updated 5 years ago
- Fake English word generator for Go and CLI☆44Updated 4 years ago
- A small wrapper around the parser and ast packages☆23Updated last year
- A PDF renderer for the goldmark markdown parser.☆142Updated last month
- Pratt parser implementation in Go☆46Updated 3 years ago
- A simple tool to collect and process quite a few web news from multiple sources☆36Updated 3 years ago
- A tiny event broker☆30Updated last year
- Go implementation of today's most used tokenizers☆44Updated 5 years ago
- distributed data sync with operational transformation/transforms☆87Updated 6 years ago
- An experimental PDF reader for go☆57Updated 13 years ago
- A Go package for n-gram based text categorization, with support for utf-8 and raw text☆73Updated last year
- Go code to help create various charts, e.g. C3, D3, Rickshaw, go-chart, etc.☆54Updated last week
- tfidf provides TF-IDF functionality☆13Updated 2 years ago
- SQLite FTS5-based search engine for Hugo pages☆38Updated 7 months ago