diasks2 / pragmatic_tokenizerView external linksLinks
A multilingual tokenizer to split a string into tokens
☆93Aug 11, 2024Updated last year
Alternatives and similar repositories for pragmatic_tokenizer
Users that are interested in pragmatic_tokenizer are comparing it to the libraries listed below
Sorting:
- Pragmatic Segmenter is a rule-based sentence boundary detection gem that works out-of-the-box across many languages.☆589Aug 11, 2024Updated last year
- High performance unsupervised text tokenization for Ruby☆21Dec 27, 2023Updated 2 years ago
- Fast transformer inference for Ruby☆596Jan 9, 2026Updated last month
- Composable pipelines for Enumerators.☆205Jan 3, 2026Updated last month
- Efficient text classification and representation learning for Ruby☆221Updated this week
- An evolutionary computation framework for Ruby☆63May 10, 2025Updated 9 months ago
- Simple and customizable text tokenization gem.☆31Sep 28, 2021Updated 4 years ago
- Ruby implementation of Global Vectors for Word Representation☆16Apr 4, 2015Updated 10 years ago
- Curated List: Practical Natural Language Processing done in Ruby☆1,073Jun 27, 2023Updated 2 years ago
- Ruby gem to calculate statistics from text to determine readability, complexity and grade level of a particular corpus.☆38Feb 2, 2026Updated last week
- A collection of links to Ruby Natural Language Processing (NLP) libraries, tools and software☆1,287Mar 5, 2023Updated 2 years ago
- Use decorators on Ruby methods!☆256May 9, 2024Updated last year
- Organize ActiveRecord models into a tree using PostgreSQL's ltree datatype☆141Mar 20, 2021Updated 4 years ago
- Ruby gem (native extension in Rust) providing implementations of various string metrics☆77May 7, 2022Updated 3 years ago
- Comparison of Ruby (Puma, Falcon), Crystal, Go, Node.js, and Python.☆18Nov 6, 2025Updated 3 months ago
- Xf - Transform Functions☆61Sep 14, 2018Updated 7 years ago
- file metadata parsing, done cheap☆67Dec 11, 2024Updated last year
- A gem for memoization in Ruby☆202Sep 4, 2025Updated 5 months ago
- Smart spellchecker for Ruby code and docs☆45Apr 16, 2024Updated last year
- Stemming for Ruby, powered by Snowball☆45Dec 31, 2025Updated last month
- Named-entity recognition for Ruby☆180Dec 28, 2025Updated last month
- Fastest Json parser for Ruby, wrapper for simdjson☆306Aug 29, 2025Updated 5 months ago
- 🔎 Investigating your ruby code dependencies☆110Jan 24, 2023Updated 3 years ago
- A high-level interface to the CMU Link Grammar. (Github mirror)☆77Dec 24, 2020Updated 5 years ago
- undercover warns about methods, classes and blocks that were changed without tests, to help you easily find untested code and reduce the …☆826Feb 6, 2026Updated last week
- A fast and accurate rule-based sentence segmentation tool for Ruby.☆52Jan 29, 2026Updated 2 weeks ago
- Strict interfaces in Ruby☆91Apr 19, 2024Updated last year
- A pure Ruby interface to the WordNet database☆91Aug 22, 2019Updated 6 years ago
- ObjectTracer tracks objects and records their activities☆448May 4, 2024Updated last year
- Bmg - Relational Algebra for Modern Times☆240Oct 4, 2025Updated 4 months ago
- Advisory locking for ActiveRecord☆692Jan 21, 2026Updated 3 weeks ago
- Thruster server definition for Capybara☆32Dec 6, 2024Updated last year
- Word Count Analyzer is a Ruby gem that analyzes a string for potential areas of the text that might cause word count discrepancies depend…☆20Oct 8, 2023Updated 2 years ago
- Data processing & ETL framework for Ruby☆1,776Jan 10, 2026Updated last month
- Framework-agnostic Ruby Gem for imgproxy with support for Ruby on Rails' most popular image attachment options☆203May 20, 2025Updated 8 months ago
- Time series forecasting for Ruby☆432Feb 5, 2026Updated last week
- ImageInfo finds the size and type of a single or multiple images from the web by fetching as little as needed in batches.☆89Jul 23, 2025Updated 6 months ago
- Fast and distributed workflow runner using ActiveJob and Redis☆1,097Nov 20, 2025Updated 2 months ago
- Generate user-friendly, pseudo-random codes without ambiguous letters or numbers (e.g. 0 vs O vs o)☆18Nov 23, 2015Updated 10 years ago