A multilingual tokenizer to split a string into tokens
☆93Aug 11, 2024Updated last year
Alternatives and similar repositories for pragmatic_tokenizer
Users that are interested in pragmatic_tokenizer are comparing it to the libraries listed below
Sorting:
- Pragmatic Segmenter is a rule-based sentence boundary detection gem that works out-of-the-box across many languages.☆590Aug 11, 2024Updated last year
- High performance unsupervised text tokenization for Ruby☆20Dec 27, 2023Updated 2 years ago
- Fast transformer inference for Ruby☆599Jan 9, 2026Updated last month
- Composable pipelines for Enumerators.☆205Jan 3, 2026Updated 2 months ago
- Efficient text classification and representation learning for Ruby☆221Feb 19, 2026Updated 2 weeks ago
- An evolutionary computation framework for Ruby☆63May 10, 2025Updated 9 months ago
- Simple and customizable text tokenization gem.☆31Sep 28, 2021Updated 4 years ago
- Ruby implementation of Global Vectors for Word Representation☆16Apr 4, 2015Updated 10 years ago
- Curated List: Practical Natural Language Processing done in Ruby☆1,074Jun 27, 2023Updated 2 years ago
- Ruby gem to calculate statistics from text to determine readability, complexity and grade level of a particular corpus.☆38Feb 2, 2026Updated last month
- A collection of links to Ruby Natural Language Processing (NLP) libraries, tools and software☆1,285Mar 5, 2023Updated 3 years ago
- Use decorators on Ruby methods!☆256May 9, 2024Updated last year
- Organize ActiveRecord models into a tree using PostgreSQL's ltree datatype☆141Mar 20, 2021Updated 4 years ago
- Ruby gem (native extension in Rust) providing implementations of various string metrics☆77May 7, 2022Updated 3 years ago
- Comparison of Ruby (Puma, Falcon), Crystal, Go, Node.js, and Python.☆19Nov 6, 2025Updated 4 months ago
- Xf - Transform Functions☆61Sep 14, 2018Updated 7 years ago
- file metadata parsing, done cheap☆70Dec 11, 2024Updated last year
- A gem for memoization in Ruby☆202Sep 4, 2025Updated 6 months ago
- Smart spellchecker for Ruby code and docs☆45Apr 16, 2024Updated last year
- Stemming for Ruby, powered by Snowball☆46Feb 12, 2026Updated 3 weeks ago
- Named-entity recognition for Ruby☆181Feb 27, 2026Updated last week
- Fastest Json parser for Ruby, wrapper for simdjson☆308Aug 29, 2025Updated 6 months ago
- 🔎 Investigating your ruby code dependencies☆110Jan 24, 2023Updated 3 years ago
- A high-level interface to the CMU Link Grammar. (Github mirror)☆77Dec 24, 2020Updated 5 years ago
- undercover warns about methods, classes and blocks that were changed without tests, to help you easily find untested code and reduce the …☆830Updated this week
- A fast and accurate rule-based sentence segmentation tool for Ruby.☆52Jan 29, 2026Updated last month
- Strict interfaces in Ruby☆91Apr 19, 2024Updated last year
- A pure Ruby interface to the WordNet database☆91Aug 22, 2019Updated 6 years ago
- ObjectTracer tracks objects and records their activities☆448May 4, 2024Updated last year
- Bmg - Relational Algebra for Modern Times☆241Oct 4, 2025Updated 5 months ago
- Advisory locking for ActiveRecord☆693Jan 21, 2026Updated last month
- Thruster server definition for Capybara☆33Dec 6, 2024Updated last year
- Streaming downloads using Net::HTTP, http.rb or HTTPX☆1,059Feb 23, 2026Updated last week
- Word Count Analyzer is a Ruby gem that analyzes a string for potential areas of the text that might cause word count discrepancies depend…☆20Oct 8, 2023Updated 2 years ago
- Data processing & ETL framework for Ruby☆1,775Jan 10, 2026Updated last month
- Framework-agnostic Ruby Gem for imgproxy with support for Ruby on Rails' most popular image attachment options☆204May 20, 2025Updated 9 months ago
- Time series forecasting for Ruby☆433Feb 5, 2026Updated last month
- ImageInfo finds the size and type of a single or multiple images from the web by fetching as little as needed in batches.☆89Jul 23, 2025Updated 7 months ago
- Fast and distributed workflow runner using ActiveJob and Redis☆1,096Nov 20, 2025Updated 3 months ago