keithrbennett / rika
A JRuby command line application and library for Apache Tika to extract text and metadata from files of various formats.
☆53Updated last week
Alternatives and similar repositories for rika:
Users that are interested in rika are comparing it to the libraries listed below
- High speed text tokenization for Ruby☆68Updated this week
- Edge stream anomaly detection for Ruby☆54Updated last month
- Eliminates the drudgery of handcrafting an `autoload` statement for each Ruby source code file in your project☆50Updated last year
- annoy-rb provides Ruby bindings for the Annoy (Approximate Nearest Neighbors Oh Yeah).☆35Updated 4 months ago
- Fast, pure-Ruby Aho-Corasick string search☆32Updated 5 months ago
- Puma plugin for starting a Ngrok Tunnel☆45Updated 4 years ago
- A list of languages based upon ISO-639-1 and ISO-639-3 with functions to retrieve only common languages.☆88Updated 6 years ago
- Explicit soft deletion for ActiveRecord via deleted_at and default scope☆70Updated last month
- Breakout detection for Ruby☆46Updated last month
- Ambry is a database and ORM replacement for (mostly) static models and small datasets. It provides ActiveModel compatibility, and flexibl…☆57Updated 3 years ago
- A tool for truncating HTML strings efficiently☆61Updated 2 months ago
- Class to show progress during script run☆68Updated 2 weeks ago
- Filename sanitization for Ruby☆224Updated 2 years ago
- biggs is a small ruby gem/rails plugin for formatting postal addresses from over 60 countries.☆149Updated last year
- Additional sidekiq middleware☆92Updated 8 years ago
- Recurring / Periodic / Scheduled / Cron job extension for Sidekiq☆88Updated last year
- Naive Bayes text classification implementation as an OmniCat classifier strategy. (#ruby #naivebayes)☆32Updated 4 years ago
- k-means clustering in Ruby☆96Updated 4 years ago
- Allows you to rescue ActiveRecord::RecordNotFound for a specific model☆60Updated 9 years ago
- High performance topic modeling for Ruby☆65Updated 3 months ago
- Bundler plugin for showing gem diffs☆44Updated 4 months ago
- Ruby: sort UTF8 Strings alphabetical via Enumerable extension☆68Updated 5 years ago
- external/replacement version of rake stats☆117Updated last month
- A SAX-based XML parser for parsing large files into manageable chunks☆126Updated 2 years ago
- Puma integration with systemd for better daemonising under modern Linux systems: notify, status, watchdog☆38Updated 3 years ago
- WebDav client library☆80Updated last year
- Simple configuration library that works well with ENV vars and config files☆23Updated 2 years ago
- Rails console history for Heroku, Docker, and more☆80Updated this week
- Modularize your monolith without friction☆46Updated 3 months ago
- is_crawler does exactly what you might think it does: determine if the supplied string matches a known crawler or bot.☆31Updated 7 years ago