PaperCutSoftware / pdfsearchLinks
A full text search library for PDFs.
☆66Updated 4 years ago
Alternatives and similar repositories for pdfsearch
Users that are interested in pdfsearch are comparing it to the libraries listed below
Sorting:
- A Go package that implements the JusText boilerplate removal algorithm☆109Updated 2 years ago
- A fast, tested, and predictable way to clean, aggregate, and transform data☆35Updated 5 years ago
- Text summarizer for golang using LexRank☆132Updated last year
- In memory cache server with query capabilities☆26Updated last year
- package lingo provides the data structures and algorithms required for natural language processing☆156Updated 2 years ago
- Go client for txtai☆79Updated last month
- Read and use word2vec vectors in Go☆56Updated 6 years ago
- Go implementation of today's most used tokenizers☆44Updated 4 years ago
- An experimental PDF reader for go☆58Updated 12 years ago
- Removes most frequent words (stop words) from a text content. Based on a Curated list of language statistics.☆148Updated last year
- ☆44Updated 5 years ago
- A simple tool to collect and process quite a few web news from multiple sources☆34Updated 3 years ago
- sqlite3 binding for go☆61Updated 2 years ago
- Fake English word generator for Go and CLI☆44Updated 4 years ago
- A Go package for n-gram based text categorization, with support for utf-8 and raw text☆73Updated 7 months ago
- A Go implementation of the WordNet API☆39Updated 6 years ago
- Search any text-based document☆23Updated 4 years ago
- Generative Adversarial Network in Go via Gorgonia☆87Updated 3 years ago
- ☆18Updated 4 years ago
- Self-organizing maps in Go☆74Updated 3 years ago
- Natural Language Processing Toolkit in Golang☆64Updated 5 years ago
- Serve millions of JSON documents via HTTP.☆70Updated 8 months ago
- An easy-to-use, lightweight embedded on-disk database built on Badger for use in your Go programs.☆52Updated 4 years ago
- Document Indexing and Searching Library in Go☆19Updated 5 years ago
- tfidf provides TF-IDF functionality☆12Updated last year
- A Go library for performing Unicode Text Segmentation as described in Unicode Standard Annex #29☆89Updated 2 years ago
- PipeIt is a text transformation, conversion, cleansing and extraction tool.☆80Updated 3 years ago
- Flowgraph package for scalable asynchronous system development☆63Updated 4 years ago
- A set of tools for working with JSON, CSV and Excel workbooks☆78Updated 2 months ago
- An Inverted Index generator implemented in Go used for text search in large document sets.☆18Updated 5 years ago