JalfResi / justextLinks
A Go package that implements the JusText boilerplate removal algorithm
☆110Updated 3 years ago
Alternatives and similar repositories for justext
Users that are interested in justext are comparing it to the libraries listed below
Sorting:
- Ngram index for golang☆114Updated 9 years ago
- package lingo provides the data structures and algorithms required for natural language processing☆158Updated 2 years ago
- A Go library for performing Unicode Text Segmentation as described in Unicode Standard Annex #29☆88Updated 3 years ago
- CLD2 (Compact Language Detector 2) bindings for Go (golang)☆38Updated 6 years ago
- Natural Language Processing Toolkit in Golang☆64Updated 5 years ago
- An approximate string matching library for the Go programming language.☆182Updated 3 years ago
- I'm trying to learn how to use ragel in Go libraries. As I'm implementing things for practice I'll add them here. I'll be using Go 1.1, t…☆65Updated 12 years ago
- mediawiki dump parser for loading up wikipedia data☆108Updated last month
- Middleware for keeping track of users, login states and permissions☆88Updated 3 weeks ago
- A Go implementation of the readability algorithm by arc90 labs☆135Updated 3 years ago
- simhash storage and searching☆138Updated 8 years ago
- Package mafsa implements Minimal Acyclic Finite State Automata in Go, essentially a high-speed, memory-efficient, Unicode-friendly set of…☆295Updated 6 years ago
- High Performance Porter2 Stemmer☆47Updated 5 years ago
- A Go package for n-gram based text categorization, with support for utf-8 and raw text☆73Updated last year
- Read and use word2vec vectors in Go☆58Updated 7 years ago
- Bayesian text classifier with flexible tokenizers and storage backends for Go☆158Updated 5 years ago
- Go bindings for FANN, library for artificial neural networks☆117Updated 10 years ago
- Self-organizing maps in Go☆74Updated 3 years ago
- A lemmatizer implemented in Go☆91Updated 7 months ago
- Stream bytes to multiple independent Readers #golang☆96Updated 2 years ago
- adding badger support to blevesearch☆63Updated 2 years ago
- grobotstxt is a native Go port of Google's robots.txt parser and matcher library.☆114Updated 3 years ago
- doc2vec , word2vec, implemented by golang. word embedding representation☆41Updated 7 years ago
- Multiclass Naive Bayesian Classification☆75Updated 7 years ago
- Real-time Go struct to JS object synchronisation over SSE and WebSockets☆186Updated 7 months ago
- sparse levenshtein automaton in go☆24Updated 5 years ago
- Word Stemming in Go☆82Updated 7 years ago
- Minimal Perfect Hashing for Go☆191Updated last year
- Embeds static resources into go files for single binary compilation + works with http.FileSystem + symlinks☆67Updated 9 years ago
- example bleve application for indexing and search beers and breweries☆91Updated 9 months ago