Big Data Science Swiss Army Knife - http://www.tuktu.io --
☆60Feb 15, 2018Updated 8 years ago
Alternatives and similar repositories for Tuktu
Users that are interested in Tuktu are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A simple Scala implementation of Chord, a distributed lookup protocol☆24Oct 1, 2025Updated 5 months ago
- Real-time query spark and visualise it as graph.☆24Oct 4, 2017Updated 8 years ago
- Words -> Phrases; NLP☆11Apr 8, 2016Updated 9 years ago
- Semi-supervised Latent Dirichlet Allocation (LDA)☆12Dec 21, 2017Updated 8 years ago
- NiFi provenance reporting tasks☆14Sep 21, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Apache Amaterasu☆56Oct 18, 2019Updated 6 years ago
- Reactive framework for creating transport & storage-transparent microservices with Vert.x☆14Apr 16, 2025Updated 11 months ago
- A high-performance, scalable middleware for time-series and static-file data exchange.☆14Jul 20, 2023Updated 2 years ago
- Data-Driven Spark allows quick data exploration based on Apache Spark.☆29Jan 6, 2017Updated 9 years ago
- A Python framework for exploring distributional semantic models.☆85Dec 12, 2015Updated 10 years ago
- HashCats Auto Clicker is a versatile tool that enhances your gaming experience by automating various actions within the HashCats game☆18Updated this week
- System for mining Wikipedia Usage data to read our collective mind☆20Sep 28, 2014Updated 11 years ago
- Quark is a data virtualization engine over analytic databases.☆101Jul 13, 2017Updated 8 years ago
- A python implementation of the neural network joint language model and an extension of it using global source context.☆11May 17, 2017Updated 8 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A Hivemall wrapper for Spark☆31Apr 21, 2016Updated 9 years ago
- Document or binary file vectorization with Normalized Compression Distance in Python.☆17Oct 14, 2015Updated 10 years ago
- Write SQL in Scala☆30Nov 25, 2025Updated 4 months ago
- Simple FieldCache based query introspection Solr Search Component - solves the 'red sofa' problem☆11Jan 27, 2025Updated last year
- ☆12Feb 28, 2023Updated 3 years ago
- Content for akka presentation and study material☆20May 5, 2024Updated last year
- Spanish text summarization demo using CoreNLP☆10Sep 13, 2014Updated 11 years ago
- The Proxima platform.☆22Jan 23, 2026Updated 2 months ago
- GBM multicore scaling: h2o, xgboost and lightgbm on multicore and multi-socket systems☆20May 13, 2018Updated 7 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- akka http service for serving spark machine learning models☆15Aug 11, 2017Updated 8 years ago
- Akka plugin to collect various data about actors☆17Aug 19, 2024Updated last year
- Lighthouse is a library for data lakes built on top of Apache Spark. It provides high-level APIs in Scala to streamline data pipelines an…☆62Sep 6, 2024Updated last year
- ScalikeJDBC example for Domain Driven Design Repository implementation.☆24Mar 7, 2019Updated 7 years ago
- CUDA kernel and JNI code which is called by Apache Spark's MLlib.☆19Jun 18, 2016Updated 9 years ago
- ☆10May 3, 2015Updated 10 years ago
- Map Reduce implemented in Lua☆23May 19, 2017Updated 8 years ago
- ☆18Nov 5, 2018Updated 7 years ago
- nginx-based hack to cache non-DockerHub registries (k8s.gcr.io, quay.io, your own)☆26Jun 29, 2018Updated 7 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A service to manage your Cuckoo filters☆18Mar 11, 2018Updated 8 years ago
- Testing tool in Scala for HTTP JSON API☆234Updated this week
- Convert RDF data to relational databases☆18Feb 26, 2018Updated 8 years ago
- Tutorial on parallelization tools for distributed computing (multiple computers or cluster nodes) in R, Python, Matlab, and C.☆21Apr 3, 2019Updated 6 years ago
- 🏋️♂️ Vercel Runtimes by @f3l1x☆14Mar 6, 2026Updated 2 weeks ago
- ☆21Mar 13, 2020Updated 6 years ago
- ☆32May 17, 2015Updated 10 years ago