Bulk loading for elastic search
☆187Dec 16, 2023Updated 2 years ago
Alternatives and similar repositories for wonderdog
Users that are interested in wonderdog are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- an experimental graph server☆21Jun 25, 2011Updated 14 years ago
- Machine learning and natural language processing with Apache Pig☆53Dec 17, 2013Updated 12 years ago
- playing around with the common crawl dataset☆70Aug 18, 2012Updated 13 years ago
- A set of examples and utilities for using Pig with Cassandra. For the latest jar release, check the Downloads link.☆84Aug 21, 2014Updated 11 years ago
- Text clustering service for the web☆25Mar 30, 2019Updated 7 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Hadoop library for large-scale data processing, now an Apache Incubator project☆581Jul 8, 2014Updated 11 years ago
- A grouping of Apache Pig examples.☆65Oct 13, 2020Updated 5 years ago
- Mirror of Apache Whirr☆96Apr 28, 2017Updated 9 years ago
- CommonCrawl Hello World example☆33Jun 25, 2014Updated 11 years ago
- Bigtop is a project for the development of packaging and tests of the Apache Hadoop ecosystem. The primary goal of Bigtop is to build a …☆51Jul 4, 2011Updated 14 years ago
- A project for code to create models from existing corpora and distribute models.☆42Apr 11, 2012Updated 14 years ago
- The ElasticSearch View Plugin provides a simple way to render ElasticSearch documents in HTML, XML or text☆49Mar 3, 2013Updated 13 years ago
- Nerdfight monitoring service☆10Feb 22, 2017Updated 9 years ago
- Elastical has moved to https://github.com/ramv/node-elastical and this repo is no longer maintained. Please update your bookmarks!☆100Feb 1, 2013Updated 13 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- A JRuby DSL for Cascading☆41Sep 23, 2015Updated 10 years ago
- A small library to add some convenience methods to Scala encompassing predicate logic☆21Mar 16, 2016Updated 10 years ago
- This is a prototype app that store items into a Hazelcast map and queue based on the description in https://wiki.mozilla.org/Socorro:Clie…☆17Apr 11, 2011Updated 15 years ago
- Crux is a reporting application for HBase. Crux provides a simple web based graphical interface to access HBase, query data and create re…☆100Apr 9, 2013Updated 13 years ago
- realtime search/indexing system☆59May 27, 2014Updated 12 years ago
- Pattern matching in javascript☆48Mar 8, 2010Updated 16 years ago
- MConn is a framework to build custom service-discovery-solutions on top Mesosphere's Marathon☆10Jul 27, 2015Updated 10 years ago
- useful JVM classes for the mrjob hadoop streaming framework☆31Jun 20, 2013Updated 12 years ago
- It counts☆62Dec 17, 2012Updated 13 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A flexible, partial, out-of-order and real-time typeahead search library☆567Nov 13, 2013Updated 12 years ago
- ☆19Mar 24, 2022Updated 4 years ago
- scikit-learn: machine learning in Python☆13Mar 14, 2025Updated last year
- Mirror of Apache HCatalog☆59Apr 14, 2023Updated 3 years ago
- Spec files and other things needed to package up elasticsearch☆77Jul 3, 2013Updated 12 years ago
- bigram / trigram analysis of wikipedia; mainly mutual info☆22Mar 6, 2012Updated 14 years ago
- Make your Hibernate Search more Elastic ! WARNING : project suspended !☆16Apr 24, 2011Updated 15 years ago
- http://static.googleusercontent.com/external_content/untrusted_dlcp/research.google.com/en//pubs/archive/36266.pdf☆14Apr 25, 2012Updated 14 years ago
- ☆12Aug 22, 2011Updated 14 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- do all first links on wikipedia _really_ lead to philosophy?☆23Feb 6, 2013Updated 13 years ago
- A Hadoop toolkit for web-scale information retrieval research☆86Dec 12, 2014Updated 11 years ago
- Redis bulk-loader for Apache Pig☆39Apr 21, 2012Updated 14 years ago
- ☆11May 22, 2015Updated 11 years ago
- Generate citations for a list of URLs☆28Aug 12, 2013Updated 12 years ago
- Git pre-commits hooks for doing Python code formatting checks☆20Feb 7, 2012Updated 14 years ago
- UT Austin Machine Learning Group Latent Variable Modeling Toolkit☆26Feb 2, 2012Updated 14 years ago