A JRuby command line application and library for Apache Tika to extract text and metadata from files of various formats.
☆54May 1, 2025Updated 11 months ago
Alternatives and similar repositories for rika
Users that are interested in rika are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A DropWizard wrapper around Apache Tika.☆10Dec 22, 2016Updated 9 years ago
- Refines Ruby Simplecov test coverage data as CLI, MCP server, and library☆30Updated this week
- A program to record HTTP statistics. More info here: http://blog.lanyonm.org/articles/2015/03/29/golang-http-stats-collector.html☆13Mar 4, 2016Updated 10 years ago
- LockJar manages Java Jars for Ruby☆45May 4, 2016Updated 9 years ago
- Image recognition on Spark cluster powered by Deeplearning4j and Apache Tika☆14May 16, 2017Updated 8 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Basic Ember.js visualizations built in D3.☆92Sep 2, 2013Updated 12 years ago
- Relational algebra optimizer☆22Jan 24, 2014Updated 12 years ago
- Locality Sensitive Hashing in Ruby☆33Oct 18, 2013Updated 12 years ago
- Implements the "Metaphone" phonetic algorithm adapted for Russian language☆19Oct 23, 2013Updated 12 years ago
- ☆34Oct 31, 2025Updated 5 months ago
- A simple Ruby library built to handle easy conversion and manipulation of colors.☆51May 23, 2024Updated last year
- Stanford CoreNLP NER addon for Apache Tika's NamerEntityParser☆13Feb 26, 2022Updated 4 years ago
- add `grep` subcommand to bundler☆15Dec 28, 2023Updated 2 years ago
- Erlang driver for libphonenumber.☆14Mar 9, 2026Updated last month
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Efficient indexing and retrieval of OCR bounding boxes in Solr☆22Mar 13, 2019Updated 7 years ago
- Hadoop integration code for working with with Apache cTAKES☆10Feb 11, 2014Updated 12 years ago
- ☆10Dec 31, 2023Updated 2 years ago
- Simplified Ruby is a minimal subset of the Ruby syntax that allows to take any problem description and turn it into idiomatic, easy to un…☆14May 27, 2019Updated 6 years ago
- OSS2017 - Open Science for Synthesis: Gulf Research Program☆10May 12, 2019Updated 6 years ago
- This is the ETL lib package. It provides an API to munge and prepare JSON, TSV and other data using Apache Tika and JSON parsing/loading …☆18Jan 27, 2024Updated 2 years ago
- Hacking GitHub into Cinnamon Linux☆30Nov 4, 2016Updated 9 years ago
- Docker-based development environment for hacking Ruby MRI☆30Aug 6, 2022Updated 3 years ago
- ActiveRecord database anonymization using views☆13Jan 1, 2026Updated 3 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- bundler support for jars for jruby☆211Mar 12, 2026Updated last month
- Elixir driver for the Datomic REST API☆47Feb 6, 2016Updated 10 years ago
- A Ruby library for using MuPDF☆31Updated this week
- Age classification from text using PAN16, blogs, Fisher Callhome, and Cancer Forum☆18Jul 1, 2022Updated 3 years ago
- Pre-render and mount React components from Ruby☆15Jan 11, 2021Updated 5 years ago
- ☆11Feb 4, 2026Updated 2 months ago
- A Nutch 2.2.1 plugin which allows users to shuffle off the responsibility for retrieving pages to a selenium hub/node spoke system. This …☆16Jun 9, 2016Updated 9 years ago
- Unicode command-line codepoint dumper☆20Apr 8, 2024Updated 2 years ago
- ☆10Jul 25, 2018Updated 7 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Bluesky text parser that outputs parsed text with rich text facets☆16Feb 25, 2025Updated last year
- Bundler plugin for showing gem diffs☆44Jan 2, 2025Updated last year
- Full-text RSS feed for https://meduza.io☆21Jun 3, 2015Updated 10 years ago
- Measure text similarity using weighted ngrams.☆18Feb 27, 2014Updated 12 years ago
- WindSR Dataset contains more than 22,000 pairs of HR/LR wind speed images, which are processed using the NASA's GEOS-5 Nature Run dataset…☆12Jan 18, 2024Updated 2 years ago
- Padrino can can use all CanCan goodies☆29Oct 24, 2014Updated 11 years ago
- RESTful wrapper for the Joshua machine translation decoder☆14Oct 25, 2016Updated 9 years ago