diasks2 / pragmatic_segmenterLinks
Pragmatic Segmenter is a rule-based sentence boundary detection gem that works out-of-the-box across many languages.
☆576Updated last year
Alternatives and similar repositories for pragmatic_segmenter
Users that are interested in pragmatic_segmenter are comparing it to the libraries listed below
Sorting:
- Lemmatizer for text in English. Inspired by Python's nltk.corpus.reader.wordnet.morphy☆112Updated 4 years ago
- Simple Ruby client for Wikidata☆35Updated last year
- Wikipedia information extraction library☆175Updated last year
- Ruby bindings to the Stanford Core NLP tools (English, French, German).☆435Updated 5 months ago
- A pure Ruby interface to the WordNet database☆91Updated 6 years ago
- A scalable and shareable repository of text annotation☆33Updated last week
- English Part-of-Speech Tagger Library; a Ruby port of Lingua::Tagger☆273Updated 9 months ago
- A generic, language-neutral framework for extending Ruby objects with linguistic methods.☆280Updated 9 years ago
- A Ruby interface to the WordNet® Lexical Database.☆139Updated 2 years ago
- Ruby wrapper for correcting spelling and grammar mistakes based on the context of complete sentences.☆478Updated 6 years ago
- Expose libstemmer_c to Ruby☆250Updated 3 years ago
- Approximate String Matching library☆387Updated last month
- Namae (名前) parses personal names and splits them into their component parts.☆168Updated last year
- Read text and metadata from files and documents (.doc, .docx, .pages, .odt, .rtf, .pdf)☆502Updated 2 years ago
- Ruby port of UEALite Stemmer - a conservative stemmer for search and indexing☆54Updated 3 weeks ago
- Ruby Binding for Stanford Pos-Tagger and Name Entity Recognizer☆92Updated 11 years ago
- Natural language processing framework for Ruby.☆1,371Updated 5 months ago
- Ruby implementation of the PageRank and TextRank algorithms.☆75Updated 5 months ago
- Wicked fast Conditional Random Fields for Ruby☆37Updated 2 years ago
- A multilingual tokenizer to split a string into tokens☆91Updated last year
- Project for filtering stopwords☆78Updated last year
- ☆852Updated 2 years ago
- Find a needle (a document or record) in a haystack using string similarity and (optionally) regular expression rules. Uses Dice's Coeffic…☆684Updated 4 years ago
- Implementation of the Rapid Automatic Keyword Extraction algorithm in Ruby, a multi-word keywords extraction.☆37Updated 11 years ago
- Fuzzy document finding in Ruby☆23Updated 8 years ago
- A Ruby wrapper for Latent Dirichlet Allocation (LDA).☆134Updated 5 years ago
- Text readability analyzer using Flesch-Kincaid and others☆68Updated 5 years ago
- Ruby bindings to the OpenNLP Java toolkit.☆91Updated 5 months ago
- A language detection library for Ruby that uses bloom filters for speed.☆682Updated 3 years ago
- Locality Sensitive Hashing in Ruby☆33Updated 12 years ago