JalfResi / justextLinks
A Go package that implements the JusText boilerplate removal algorithm
☆110Updated 3 years ago
Alternatives and similar repositories for justext
Users that are interested in justext are comparing it to the libraries listed below
Sorting:
- A Go library for performing Unicode Text Segmentation as described in Unicode Standard Annex #29☆88Updated 2 years ago
- Ngram index for golang☆114Updated 9 years ago
- Read and use word2vec vectors in Go☆57Updated 7 years ago
- package lingo provides the data structures and algorithms required for natural language processing☆158Updated 2 years ago
- simhash storage and searching☆138Updated 8 years ago
- CLD2 (Compact Language Detector 2) bindings for Go (golang)☆38Updated 6 years ago
- Middleware for keeping track of users, login states and permissions☆88Updated 2 months ago
- Pure-Go full text indexer and search library☆94Updated 10 years ago
- Stemmer packages for Go programming language. Includes English, German and Dutch stemmers.☆53Updated 8 years ago
- High Performance Porter2 Stemmer☆47Updated 5 years ago
- An approximate string matching library for the Go programming language.☆181Updated 3 years ago
- Natural Language Processing Toolkit in Golang☆64Updated 5 years ago
- A Go package for n-gram based text categorization, with support for utf-8 and raw text☆72Updated 11 months ago
- Minimal Perfect Hashing for Go☆191Updated last year
- Stream bytes to multiple independent Readers #golang☆96Updated last year
- Utilities for working with discrete probability distributions and other tools useful for doing NLP work☆95Updated 13 years ago
- Bayesian text classifier with flexible tokenizers and storage backends for Go☆158Updated 5 years ago
- example bleve application for indexing and search beers and breweries☆91Updated 8 months ago
- Embeds static resources into go files for single binary compilation + works with http.FileSystem + symlinks☆67Updated 9 years ago
- ☆50Updated 4 years ago
- Word Stemming in Go☆82Updated 7 years ago
- doc2vec , word2vec, implemented by golang. word embedding representation☆41Updated 7 years ago
- Very fast, very unsafe serialization for Go☆147Updated 3 years ago
- I'm trying to learn how to use ragel in Go libraries. As I'm implementing things for practice I'll add them here. I'll be using Go 1.1, t…☆65Updated 12 years ago
- schema is a Go package providing access to database schema metadata, for database/sql drivers.☆56Updated last year
- A pure Go implementation of the smaz compression library for short strings.☆82Updated 3 years ago
- Genex package for Go☆76Updated 5 years ago
- Go library for performing computations in word2vec binary models☆203Updated 3 years ago
- A lemmatizer implemented in Go☆91Updated 6 months ago
- sqlite3 binding for go☆61Updated 2 years ago