A simple tokenizer in Ruby for NLP tasks.
☆46Apr 3, 2017Updated 8 years ago
Alternatives and similar repositories for tokenizer
Users that are interested in tokenizer are comparing it to the libraries listed below
Sorting:
- Various distance and similarity measures for machine learning.☆30Jun 22, 2020Updated 5 years ago
- ☆15Aug 12, 2013Updated 12 years ago
- Accurate Bayesian sentence tokenizer in Ruby.☆80Apr 30, 2014Updated 11 years ago
- Ruby Microdata parser for RDF.rb☆33Jan 8, 2024Updated 2 years ago
- **Deprecated (Part of the stdlib since Ruby 2.0)**☆36Oct 4, 2016Updated 9 years ago
- Bayesian inference for Ruby, powered by CmdStan☆25Jan 13, 2026Updated last month
- High performance unsupervised text tokenization for Ruby☆20Dec 27, 2023Updated 2 years ago
- High performance t-SNE for Ruby☆21Feb 19, 2026Updated last week
- Community maintained development kit for Prismic and the Ruby language☆46Jul 6, 2023Updated 2 years ago
- Sentiment analysis for the German language☆20Apr 24, 2016Updated 9 years ago
- A super simple way to do AB tests in Rails☆81Apr 18, 2014Updated 11 years ago
- A Chinese Word Segmentation(中文分词) routine in pure Ruby☆110Sep 12, 2017Updated 8 years ago
- Simple kNN Classifier written in Ruby☆60Mar 24, 2021Updated 4 years ago
- Implementation of Moses Charikar's simhashes in Ruby☆45Nov 26, 2014Updated 11 years ago
- Curated List: Practical Natural Language Processing done in Ruby☆1,074Jun 27, 2023Updated 2 years ago
- Comprehensive search solution for ActiveRecord and MySQL.☆61Mar 11, 2017Updated 8 years ago
- Publish Rails application metrics to statsd.☆30Aug 10, 2024Updated last year
- Mudis is a fast, thread-safe, in-memory, sharded LRU cache for Ruby applications. Rails and Hanami (or any Rack) compatible.☆28Feb 8, 2026Updated 2 weeks ago
- Display money in Chinese characters.☆31Jul 28, 2020Updated 5 years ago
- Ability to execute crystal code in a fashion similar to pry edit.☆34Oct 30, 2021Updated 4 years ago
- Ruby library for working with the Apple News API☆29Jun 8, 2021Updated 4 years ago
- Ping-API test script examples.☆10Jan 5, 2016Updated 10 years ago
- A framework, data and configs for generating and building Tesseract OCR lang.traineddata model files, specifically for Japanese☆10Dec 9, 2013Updated 12 years ago
- Simple and customizable text tokenization gem.☆31Sep 28, 2021Updated 4 years ago
- Focused Crawler for VT's CTRNet☆10May 13, 2013Updated 12 years ago
- Speech ANDroid Apps☆20Jan 22, 2014Updated 12 years ago
- 🌈Make your debug life a little bit more colorful☆10Jan 2, 2020Updated 6 years ago
- Ruby interface for Moby wordlists☆14Feb 14, 2013Updated 13 years ago
- (Labeled) Latent Dirichlet Allocation on a sentence level with Gibbs Sampling☆10Mar 27, 2014Updated 11 years ago
- "Save as DAISY" add-in for Microsoft Word☆10Dec 22, 2025Updated 2 months ago
- vscode-translation 翻译插件☆10Mar 3, 2022Updated 3 years ago
- A port of ruby 2.0 to native php 5.4+☆10Sep 21, 2013Updated 12 years ago
- Grecka is a python script to convert Greek to Greeklish based on ELOT 743☆12Aug 4, 2018Updated 7 years ago
- A specialized Amazon AWS gem for finding Album Art running on top of the 'sucker' gem☆16Sep 10, 2013Updated 12 years ago
- 中文语料:大量人工标注样本,非常有价值 !!!☆11Aug 15, 2019Updated 6 years ago
- Concurrent index migrations for Rails☆46Dec 27, 2025Updated 2 months ago
- 💰 A cryptocurrency price monitoring tool☆15Dec 27, 2023Updated 2 years ago
- 落地页管理系统☆10Oct 24, 2019Updated 6 years ago
- Solarized style for Qt Creator's syntax highlighter☆31Aug 22, 2016Updated 9 years ago