diasks2 / pragmatic_segmenterLinks
Pragmatic Segmenter is a rule-based sentence boundary detection gem that works out-of-the-box across many languages.
☆573Updated last year
Alternatives and similar repositories for pragmatic_segmenter
Users that are interested in pragmatic_segmenter are comparing it to the libraries listed below
Sorting:
- Lemmatizer for text in English. Inspired by Python's nltk.corpus.reader.wordnet.morphy☆110Updated 3 years ago
- Simple Ruby client for Wikidata☆35Updated last year
- English Part-of-Speech Tagger Library; a Ruby port of Lingua::Tagger☆272Updated 6 months ago
- Wikipedia information extraction library☆175Updated last year
- Namae (名前) parses personal names and splits them into their component parts.☆167Updated last year
- A scalable and shareable repository of text annotation☆31Updated 3 weeks ago
- Ruby bindings to the Stanford Core NLP tools (English, French, German).☆435Updated 3 months ago
- Read text and metadata from files and documents (.doc, .docx, .pages, .odt, .rtf, .pdf)☆502Updated 2 years ago
- A pure Ruby interface to the WordNet database☆91Updated 5 years ago
- A multilingual tokenizer to split a string into tokens☆91Updated last year
- Find a needle (a document or record) in a haystack using string similarity and (optionally) regular expression rules. Uses Dice's Coeffic…☆681Updated 4 years ago
- Ruby implementation of the PageRank and TextRank algorithms.☆75Updated 3 months ago
- Ruby Binding for Stanford Pos-Tagger and Name Entity Recognizer☆92Updated 11 years ago
- Ruby wrapper for correcting spelling and grammar mistakes based on the context of complete sentences.☆478Updated 5 years ago
- Natural language processing framework for Ruby.☆1,371Updated 3 months ago
- Approximate String Matching library☆385Updated last month
- Measure text similarity using weighted ngrams.☆18Updated 11 years ago
- A Ruby interface to the WordNet® Lexical Database.☆138Updated 2 years ago
- A Ruby natural language processor.☆163Updated 3 years ago
- Implementation of the Rapid Automatic Keyword Extraction algorithm in Ruby, a multi-word keywords extraction.☆37Updated 11 years ago
- ☆842Updated 2 years ago
- A generic, language-neutral framework for extending Ruby objects with linguistic methods.☆279Updated 9 years ago
- natural language parsing of recipe ingredients, making sense of amounts, units, and ingredients☆198Updated 3 years ago
- Expose libstemmer_c to Ruby☆250Updated 3 years ago
- Text readability analyzer using Flesch-Kincaid and others☆68Updated 5 years ago
- Fast Ruby FFI string edit distance algorithms☆80Updated 12 years ago
- A language detection library for Ruby that uses bloom filters for speed.☆682Updated 2 years ago
- A general classifier module to allow Bayesian and other types of classifications. A fork of cardmagic/classifier.☆555Updated last year
- Ruby port of UEALite Stemmer - a conservative stemmer for search and indexing☆54Updated 2 years ago
- A command-line toolkit to extract text content and category data from Wikipedia dump files☆174Updated 2 years ago