pebbe / textcat
A Go package for n-gram based text categorization, with support for utf-8 and raw text
☆72Updated 3 years ago
Related projects: ⓘ
- GNU Aspell spell checking library bindings for Go (golang)☆47Updated 4 years ago
- High Performance Porter2 Stemmer☆46Updated 3 years ago
- Read and use word2vec vectors in Go☆56Updated 6 years ago
- A package for Go that can be used for range queries on large number of intervals☆42Updated 7 years ago
- Fast identification of character sequences in text or documents (multi-lingual)☆18Updated 8 years ago
- All-in-one text tokenizer for Go. Super-fast. Lots of features.☆13Updated 8 years ago
- Stemmer packages for Go programming language. Includes English, German and Dutch stemmers.☆52Updated 7 years ago
- shoco is a compressor for small text strings.☆11Updated 5 years ago
- Word Stemming in Go☆78Updated 6 years ago
- libsvm go version☆73Updated 8 years ago
- Counters over sliding windows☆19Updated 8 years ago
- One-pass running statistics☆51Updated 2 years ago
- Utilities for working with discrete probability distributions and other tools useful for doing NLP work☆97Updated 12 years ago
- Super-efficient, in-memory key/value data store☆21Updated 7 years ago
- Various parsing utilities, such as IP, time, and top-level-domain, in Go☆24Updated 8 years ago
- dmmclust is a package for clustering short texts, based on Yin and Wang (2014)☆25Updated 6 years ago
- A Go library for performing Unicode Text Segmentation as described in Unicode Standard Annex #29☆88Updated last year
- Split (rows and columns), sort, and search☆55Updated last year
- A pure Go implementation of the smaz compression library for short strings.☆20Updated 8 years ago
- A multicore csv reader library in Go☆47Updated 5 months ago
- ☆56Updated this week
- Pattern recognition package in Go lang.☆65Updated 11 years ago
- Probability distributions and associated methods in Go☆38Updated 9 years ago
- Summarizes text☆38Updated 9 years ago
- Go difference algorithm☆58Updated 10 years ago
- Package pace provides a threadsafe counter for measuring ticks in the specified timeframe.☆9Updated last year
- A Go package that implements the JusText boilerplate removal algorithm☆102Updated last year
- Counter Data structure for Golang using CountMin Sketch with a fixed amount of memory☆44Updated 6 years ago
- I'm trying to learn how to use ragel in Go libraries. As I'm implementing things for practice I'll add them here. I'll be using Go 1.1, t…☆64Updated 11 years ago
- An implementation of the Goose HTML Content / Article Extractor algorithm in golang☆40Updated 3 years ago