UnderstandLingBV/Tuktu

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/UnderstandLingBV/Tuktu)

UnderstandLingBV / Tuktu

Big Data Science Swiss Army Knife - http://www.tuktu.io --

☆59

Alternatives and similar repositories for Tuktu

Users that are interested in Tuktu are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

jankatins / pydatapipes
View on GitHub
Python data pipelines similar to R
☆12Oct 23, 2016Updated 9 years ago
tristanpenman / chordial
View on GitHub
A simple Scala implementation of Chord, a distributed lookup protocol
☆23Oct 1, 2025Updated 9 months ago
spoddutur / graph-knowledge-browser
View on GitHub
Real-time query spark and visualise it as graph.
☆24Oct 4, 2017Updated 8 years ago
fancyspeed / semi-lda
View on GitHub
Semi-supervised Latent Dirichlet Allocation (LDA)
☆12Dec 21, 2017Updated 8 years ago
gangeshwark / all-datasets-links
View on GitHub
Curated list of all dataset websites that I find
☆84Oct 17, 2018Updated 7 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
apache / incubator-retired-amaterasu
View on GitHub
Apache Amaterasu
☆56Oct 18, 2019Updated 6 years ago
snowplow-archive / icebucket
View on GitHub
UNRELEASED. An opinionated framework for analytics-on-write on event streams using key-value storage
☆14Sep 10, 2015Updated 10 years ago
FRosner / spawncamping-dds
View on GitHub
Data-Driven Spark allows quick data exploration based on Apache Spark.
☆29Jan 6, 2017Updated 9 years ago
comsysto / jumbodb
View on GitHub
☆21Jan 9, 2019Updated 7 years ago
datasetu / vermillion
View on GitHub
A high-performance, scalable middleware for time-series and static-file data exchange.
☆14Jul 20, 2023Updated 3 years ago
rajivgrover009 / Deep-Learning
View on GitHub
☆13Aug 17, 2017Updated 8 years ago
hawkular / cassalog
View on GitHub
A Cassandra schema change management tool for applications running on the JVM
☆14Apr 19, 2018Updated 8 years ago
jimmycallin / pydsm
View on GitHub
A Python framework for exploring distributional semantic models.
☆85Dec 12, 2015Updated 10 years ago
mdup / easycluster
View on GitHub
☆16May 27, 2015Updated 11 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
qubole / quark
View on GitHub
Quark is a data virtualization engine over analytic databases.
☆101Jul 13, 2017Updated 9 years ago
landlord / flatmate
View on GitHub
Reduce memory usage by running multiple applications in the same JVM.
☆13Jul 11, 2019Updated 7 years ago
alexbyrnes / FCC-Political-Ads
View on GitHub
Archive of political ad data from the Federal Communications Commission
☆21Oct 25, 2017Updated 8 years ago
maropu / hivemall-spark
View on GitHub
A Hivemall wrapper for Spark
☆31Apr 21, 2016Updated 10 years ago
MLWave / normalized-compression-neighbors
View on GitHub
Document or binary file vectorization with Normalized Compression Distance in Python.
☆17Oct 14, 2015Updated 10 years ago
duanebester / akka-ws-test
View on GitHub
Realtime feedback of Akka Stream processing via WebSockets
☆16Dec 9, 2019Updated 6 years ago
teradata-aster-field / toaster
View on GitHub
Tools for Aster in R
☆11Mar 29, 2017Updated 9 years ago
lucidworks / simple-category-extraction-component
View on GitHub
Simple FieldCache based query introspection Solr Search Component - solves the 'red sofa' problem
☆11Jan 27, 2025Updated last year
sjyk / sampleclean
View on GitHub
SampleClean+BlinkDB
☆18May 21, 2014Updated 12 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
ethanhe42 / named-entity-recognition
View on GitHub
name entity recognition with recurrent neural network(RNN) in tensorflow
☆16Feb 9, 2022Updated 4 years ago
O2-Czech-Republic / proxima-platform
View on GitHub
The Proxima platform.
☆23Jul 13, 2026Updated last week
mitsuhiko / small-ctor
View on GitHub
Minimal, dependency free implementation of the ctor crate
☆17Aug 1, 2024Updated last year
kootenpv / inthenews.io
View on GitHub
Get the latest and greatest in news (on Python)
☆19Feb 27, 2016Updated 10 years ago
willemt / dogebox
View on GitHub
☆12Apr 29, 2014Updated 12 years ago
castagna / hbase-rdf
View on GitHub
☆24Oct 13, 2020Updated 5 years ago
szilard / GBM-multicore
View on GitHub
GBM multicore scaling: h2o, xgboost and lightgbm on multicore and multi-socket systems
☆20May 13, 2018Updated 8 years ago
jexxa-projects / Jexxa
View on GitHub
Jexxa - A Ports and Adapters Framework for Java
☆14Jul 15, 2026Updated last week
ojedatony1616 / exploratory_transformation
View on GitHub
Repository for exploratory data transformation & visualization talk
☆27Oct 9, 2016Updated 9 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
ldoguin / couchbase-java-importer
View on GitHub
This is a pluggable importer for Couchbase
☆13Jan 20, 2016Updated 10 years ago
IBMSparkGPU / CUDA-MLlib
View on GitHub
CUDA kernel and JNI code which is called by Apache Spark's MLlib.
☆19Jun 18, 2016Updated 10 years ago
Placeware / ThisPlace
View on GitHub
Remember a 3x3 m² location anywhere in the world with just four words.
☆22Jul 4, 2017Updated 9 years ago
ScalaConsultants / akka-periscope
View on GitHub
Akka plugin to collect various data about actors
☆17Aug 19, 2024Updated last year
datamindedbe / lighthouse
View on GitHub
Lighthouse is a library for data lakes built on top of Apache Spark. It provides high-level APIs in Scala to streamline data pipelines an…
☆64Sep 6, 2024Updated last year
Archimagus / ludumdare36-cardsmith
View on GitHub
☆10Jan 10, 2017Updated 9 years ago
softwaremill / akka-http-session-faq
View on GitHub
☆20Mar 13, 2020Updated 6 years ago