pebbe / textcat
A Go package for n-gram based text categorization, with support for utf-8 and raw text
☆72Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for textcat
- GNU Aspell spell checking library bindings for Go (golang)☆47Updated 4 years ago
- High Performance Porter2 Stemmer☆46Updated 4 years ago
- Split (rows and columns), sort, and search☆55Updated last year
- Probability distributions and associated methods in Go☆40Updated 9 years ago
- All-in-one text tokenizer for Go. Super-fast. Lots of features.☆13Updated 8 years ago
- Read and use word2vec vectors in Go☆56Updated 6 years ago
- libsvm go version☆73Updated 8 years ago
- A Go package that implements the JusText boilerplate removal algorithm☆102Updated 2 years ago
- A package for Go that can be used for range queries on large number of intervals☆42Updated 7 years ago
- I'm trying to learn how to use ragel in Go libraries. As I'm implementing things for practice I'll add them here. I'll be using Go 1.1, t…☆64Updated 11 years ago
- Fast identification of character sequences in text or documents (multi-lingual)☆18Updated 8 years ago
- Counters over sliding windows☆19Updated 8 years ago
- Counter Data structure for Golang using CountMin Sketch with a fixed amount of memory☆44Updated 6 years ago
- Go bindings for the Apache Lucy full text search library. The Apache Lucy search engine library provides full-text search for dynamic pro…☆47Updated 10 years ago
- Word Stemming in Go☆79Updated 6 years ago
- dmmclust is a package for clustering short texts, based on Yin and Wang (2014)☆25Updated 6 years ago
- Stemmer packages for Go programming language. Includes English, German and Dutch stemmers.☆53Updated 7 years ago
- shoco is a compressor for small text strings.☆10Updated 5 years ago
- a pure Go port of ultrajson☆75Updated 4 years ago
- a Library for building auto-complete services with Golang and Redis☆0Updated 5 months ago
- Package for concurrently walking files☆104Updated 8 years ago
- CSRF protection middleware via context for Go.☆21Updated 6 years ago
- Utilities for working with discrete probability distributions and other tools useful for doing NLP work☆96Updated 13 years ago
- A multicore csv reader library in Go☆47Updated 7 months ago
- Library to extract text from HTML files☆11Updated 8 years ago
- Various parsing utilities, such as IP, time, and top-level-domain, in Go☆24Updated 8 years ago
- Go package to convert natural language strings to numbers☆43Updated 4 years ago
- A Go library for performing Unicode Text Segmentation as described in Unicode Standard Annex #29☆89Updated last year
- Ngram index for golang☆114Updated 8 years ago