dustin / go-wikiparseLinks
mediawiki dump parser for loading up wikipedia data
☆106Updated last week
Alternatives and similar repositories for go-wikiparse
Users that are interested in go-wikiparse are comparing it to the libraries listed below
Sorting:
- An approximate string matching library for the Go programming language.☆177Updated 2 years ago
- Ngram index for golang☆114Updated 9 years ago
- Stemmer packages for Go programming language. Includes English, German and Dutch stemmers.☆53Updated 8 years ago
- Word Stemming in Go☆82Updated 7 years ago
- A simple, lightweight, embedded geocoder for Golang with city level accuracy☆73Updated 9 years ago
- CLD2 (Compact Language Detector 2) bindings for Go (golang)☆38Updated 5 years ago
- High Performance Porter2 Stemmer☆46Updated 4 years ago
- Takes a full name and splits it into individual name parts☆44Updated 10 months ago
- A Go package that implements the JusText boilerplate removal algorithm☆109Updated 2 years ago
- simhash storage and searching☆138Updated 8 years ago
- Genex package for Go☆76Updated 5 years ago
- A Go library for performing Unicode Text Segmentation as described in Unicode Standard Annex #29☆89Updated 2 years ago
- K-Means algorithm implementation in Go☆118Updated 10 years ago
- GNU Aspell spell checking library bindings for Go (golang)☆47Updated 5 years ago
- ☆102Updated 8 years ago
- package lingo provides the data structures and algorithms required for natural language processing☆156Updated 2 years ago
- A Go package for n-gram based text categorization, with support for utf-8 and raw text☆72Updated 8 months ago
- Package mafsa implements Minimal Acyclic Finite State Automata in Go, essentially a high-speed, memory-efficient, Unicode-friendly set of…☆296Updated 6 years ago
- A Go preprocessor for package scoped reflection☆106Updated 7 years ago
- SMTP server library for Go☆213Updated 4 years ago
- Golang pkg for URL parsing and normalization☆160Updated 3 years ago
- grobotstxt is a native Go port of Google's robots.txt parser and matcher library.☆111Updated 3 years ago
- go.tesseract is a wrapper for the tesseract-ocr library.☆67Updated 4 years ago
- Email parsing and mail creation library for golang☆95Updated last year
- A generic patricia trie (also called radix tree) implemented in Go (Golang)☆286Updated last month
- Offline language detection☆47Updated 8 years ago
- The latlong package maps from a latitude and longitude to a timezone.☆390Updated 2 years ago
- JSONGen is a tool for generating native Golang types from JSON objects.☆210Updated 10 years ago
- a persistence-layer Go package for loading/embedding SQL file contents for use in Go programs☆96Updated 6 years ago
- A JSON-LD processor for Go☆114Updated 6 years ago