JalfResi / justext
A Go package that implements the JusText boilerplate removal algorithm
☆108Updated 2 years ago
Alternatives and similar repositories for justext:
Users that are interested in justext are comparing it to the libraries listed below
- Read and use word2vec vectors in Go☆55Updated 6 years ago
- Middleware for keeping track of users, login states and permissions☆89Updated last year
- High Performance Porter2 Stemmer☆45Updated 4 years ago
- Stream bytes to multiple independent Readers #golang☆91Updated last year
- I'm trying to learn how to use ragel in Go libraries. As I'm implementing things for practice I'll add them here. I'll be using Go 1.1, t…☆64Updated 11 years ago
- A small library to help write parsers for Domain Specific Languages using pure go code.☆40Updated last week
- Ngram index for golang☆114Updated 8 years ago
- A Go library for performing Unicode Text Segmentation as described in Unicode Standard Annex #29☆89Updated 2 years ago
- Embeddable in memory key/value store for strings in golang☆53Updated 6 years ago
- CLD2 (Compact Language Detector 2) bindings for Go (golang)☆38Updated 5 years ago
- Unicode transliterator for #golang☆80Updated 9 years ago
- An XPath 1.0 implementation written in the Go programming language.☆148Updated 3 years ago
- GoLang Library for Browser Capabilities Project☆49Updated last year
- An approximate string matching library for the Go programming language.☆177Updated 2 years ago
- Extensible arithmetic parsing lib for go☆67Updated 5 years ago
- doc2vec , word2vec, implemented by golang. word embedding representation☆41Updated 6 years ago
- Image resizing in pure Go and SIMD☆213Updated 7 years ago
- Embeds static resources into go files for single binary compilation + works with http.FileSystem + symlinks☆67Updated 8 years ago
- Go HTTP gzip compression package☆14Updated 3 years ago
- Decentralized, sequential, lexicographically sortable unique id☆83Updated 4 years ago
- Stemmer packages for Go programming language. Includes English, German and Dutch stemmers.☆53Updated 8 years ago
- Very fast, very unsafe serialization for Go☆146Updated 2 years ago
- Fast generation of 192-bit UUIDs☆84Updated last year
- An implementation of the Goose HTML Content / Article Extractor algorithm in golang☆40Updated 3 years ago
- Package mafsa implements Minimal Acyclic Finite State Automata in Go, essentially a high-speed, memory-efficient, Unicode-friendly set of…☆295Updated 5 years ago
- Period provides a set of missing Time Range to Go, it cover all basic operations regardings time ranges.☆46Updated 9 years ago
- The slice package sorts Go slices.☆111Updated 6 years ago
- dmmclust is a package for clustering short texts, based on Yin and Wang (2014)☆25Updated 7 years ago
- Calculate text distance (similarity) in Golang - Experimental implementation☆93Updated 4 years ago
- A Go implementation of the readability algorithm by arc90 labs☆132Updated 2 years ago