PaperCutSoftware / pdfsearchLinks
A full text search library for PDFs.
☆67Updated 5 years ago
Alternatives and similar repositories for pdfsearch
Users that are interested in pdfsearch are comparing it to the libraries listed below
Sorting:
- A Go package that implements the JusText boilerplate removal algorithm☆110Updated 3 years ago
- package lingo provides the data structures and algorithms required for natural language processing☆158Updated 2 years ago
- Go client for txtai☆80Updated 2 weeks ago
- A fast, tested, and predictable way to clean, aggregate, and transform data☆35Updated 6 years ago
- An Inverted Index generator implemented in Go used for text search in large document sets.☆18Updated 6 years ago
- Tagify produces a set of tags from a given source. Source can be either an HTML page, a Markdown document or a plain text. Supports Engli…☆39Updated last year
- Removes most frequent words (stop words) from a text content. Based on a Curated list of language statistics.☆152Updated 2 years ago
- A tokenizer based on Unicode text segmentation (UAX #29), for Go. Split graphemes, words, sentences.☆96Updated last month
- Text summarizer for golang using LexRank☆137Updated 3 months ago
- ☆18Updated 4 years ago
- Go implementation of today's most used tokenizers☆44Updated 5 years ago
- Read and use word2vec vectors in Go☆58Updated 7 years ago
- A small wrapper around the parser and ast packages☆23Updated last year
- A Go library that uses pdfium (via cgo) to render pdfs to images☆42Updated 6 years ago
- sqlite3 binding for go☆61Updated 2 years ago
- example bleve application for indexing and search beers and breweries☆91Updated 10 months ago
- Natural Language Processing Toolkit in Golang☆64Updated 5 years ago
- An experimental PDF reader for go☆57Updated 13 years ago
- TextRank implementation in Golang with extendable features (summarization, phrase extraction) and multithreading (goroutine).☆221Updated 6 months ago
- A Go library for performing Unicode Text Segmentation as described in Unicode Standard Annex #29☆88Updated 3 years ago
- LaTeX to PDF print µService in Go☆20Updated 2 years ago
- Convenience packages for data science in Go.☆32Updated last week
- Self-organizing maps in Go☆74Updated 3 years ago
- Search any text-based document☆23Updated 5 years ago
- Gocal is a simple clone of pcal. It's a tool to create monthly calendars in PDF with a few gimmicks.☆27Updated last year
- A package to allow one to concurrently go through a filesystem with ease☆103Updated 4 years ago
- FillPDF - Fill PDF forms☆84Updated 2 years ago
- Package for creating interpreters☆29Updated 7 years ago
- [DEPRECATED] Pure Go implementation of Potrace vectorization library☆55Updated 2 years ago
- Fake English word generator for Go and CLI☆44Updated 4 years ago