jlubawy / go-boilerpipe
Golang port of the boilerpipe Java library used for the removal of boilerplate and extraction of text content from HTML documents.
☆70Updated 10 months ago
Alternatives and similar repositories for go-boilerpipe:
Users that are interested in go-boilerpipe are comparing it to the libraries listed below
- An implementation of the Goose HTML Content / Article Extractor algorithm in golang☆40Updated 3 years ago
- Multiclass Naive Bayesian Classification☆75Updated 6 years ago
- Named Entity Recognition for golang via MITIE☆33Updated 6 years ago
- A Go package that implements the JusText boilerplate removal algorithm☆108Updated 2 years ago
- simhash storage and searching☆138Updated 7 years ago
- Read and write WARC files in Go☆45Updated 6 years ago
- A small library in golang, that detects the language of a text. (text categorization)☆153Updated last year
- Go Stanford NLP POS Tagger wrapper☆38Updated 8 years ago
- A Go implementation of the readability algorithm by arc90 labs☆132Updated 2 years ago
- CLD2 (Compact Language Detector 2) bindings for Go (golang)☆38Updated 5 years ago
- A simple, lightweight, embedded geocoder for Golang with city level accuracy☆73Updated 9 years ago
- Stemmer packages for Go programming language. Includes English, German and Dutch stemmers.☆53Updated 8 years ago
- package lingo provides the data structures and algorithms required for natural language processing☆154Updated 2 years ago
- High Performance Porter2 Stemmer☆45Updated 4 years ago
- News Content / Article Extractor written in Go☆32Updated 9 years ago
- Html Content / Article Extractor in Golang☆442Updated 10 months ago
- Text summarizer for golang using LexRank☆128Updated 11 months ago
- Chrome Automation Library using Google Chrome Remote Debugger API in Go☆85Updated 3 years ago
- Offline language detection☆47Updated 7 years ago
- Pluck text in a fast and intuitive way☆215Updated 5 years ago
- Guess the natural language of a text in Go☆58Updated 7 years ago
- Takes a full name and splits it into individual name parts☆43Updated 5 months ago
- Golang package to extract useful text from a HTML document☆40Updated last year
- Utilities for working with discrete probability distributions and other tools useful for doing NLP work☆96Updated 13 years ago
- Read and use word2vec vectors in Go☆55Updated 6 years ago
- Split (rows and columns), sort, and search☆55Updated 2 years ago
- 🔮 Use TensorFlow models in Go to evaluate Images (and more soon!)☆63Updated 6 years ago
- A Go implementation of the WordNet API☆39Updated 5 years ago
- A Go package for n-gram based text categorization, with support for utf-8 and raw text☆72Updated 3 months ago
- Trigram search library for Go☆69Updated 10 years ago