Keyword extraction package for Spark.
☆12Jan 15, 2017Updated 9 years ago
Alternatives and similar repositories for spark-miner
Users that are interested in spark-miner are comparing it to the libraries listed below
Sorting:
- Topic Modeling with LDA in Scala and Spark☆31Sep 25, 2018Updated 7 years ago
- protein embedding project☆12May 3, 2018Updated 7 years ago
- The repository of the hands-on introduction to machine learning workshop of the DataLearn 2019 track at DataHack 2019.☆10Sep 1, 2019Updated 6 years ago
- Analysis on stop reasons☆10Jun 17, 2024Updated last year
- ☆10Jun 29, 2021Updated 4 years ago
- !!!!(DEMO)!!!! !!! CHECK OUT THE NEW VERSİON !!! Counting Close People with Yolov7☆13Sep 14, 2022Updated 3 years ago
- CascadER: Cross-Modal Cascading for Knowledge Graph Link Prediction (arXiv 22)☆13Jun 17, 2022Updated 3 years ago
- Code to reproduce the paper "Do causal predictors generalize better to new domains?"☆15Feb 7, 2025Updated last year
- Repository for Booking.com Data Challenge 6th Place Solution☆10Feb 17, 2021Updated 5 years ago
- ☆27Sep 10, 2025Updated 5 months ago
- Repository to go along with the paper "Plumber: Diagnosing and Removing Performance Bottlenecks in Machine Learning Data Pipelines"☆10Mar 31, 2022Updated 3 years ago
- Deep Learning library for Python. Convnets, recurrent neural networks, and more. Runs on Theano or TensorFlow.☆12Dec 24, 2016Updated 9 years ago
- A scala implementation of Support Vector Machines☆17Dec 4, 2013Updated 12 years ago
- Spark interface to the TileDB storage manager [please see README]☆17Dec 23, 2024Updated last year
- 📖 A review of KGEM packages and frameworks at https://pykeen.github.io/kgem-software-review.☆12Jun 24, 2024Updated last year
- ☆14Feb 12, 2024Updated 2 years ago
- ☆15Nov 20, 2024Updated last year
- Resources from the Question Generation Shared Task & Evaluation Challenge 2010☆12Dec 21, 2010Updated 15 years ago
- My personal cheat sheets☆12Jan 9, 2026Updated last month
- Winning data science solution for Energy Hack NL 2018. Sonnet: forecasting station load caused by solar panels.☆11May 28, 2018Updated 7 years ago
- A python tool to examine datasets for consistency. Performs approximately 150 tests. For EDA (Exploratory Data Analysis) and interpretabl…☆10Apr 4, 2024Updated last year
- ☆16Dec 15, 2025Updated 2 months ago
- ☆13Mar 29, 2023Updated 2 years ago
- deep multi-instance learning for rna protein binding prediction☆10May 21, 2017Updated 8 years ago
- User-friendly extensions to MeSH☆11Feb 4, 2016Updated 10 years ago
- Cascading and Scalding wrapper for HBase with advanced read features☆54Feb 11, 2020Updated 6 years ago
- Fork of RecurrentGPT with modifications☆10Sep 18, 2024Updated last year
- ☆13Apr 23, 2025Updated 10 months ago
- Scala non-blocking Aerospike client (archived as unmaintained)☆20Jan 25, 2019Updated 7 years ago
- ☆13Dec 5, 2024Updated last year
- Some scoring functions for predicting the effects of mutations on protein sequences using ESM-2☆11Dec 10, 2023Updated 2 years ago
- PIA - Starter Kit de una asistente personal (chatbot) usando Chatito y RasaNLU☆11Mar 23, 2018Updated 7 years ago
- pretrained LookingGlass language model for biological read-length DNA sequences, and related models derived from transfer learning☆15Feb 19, 2026Updated 2 weeks ago
- ☆65May 22, 2014Updated 11 years ago
- ☆12May 2, 2025Updated 10 months ago
- Self Supervised Learning for Time Series Using Similarity Distillation☆12Jun 29, 2022Updated 3 years ago
- Code for MICCAI 2017 paper on binary sparse convolutions for semantic segmentation of medical images☆11Jun 15, 2017Updated 8 years ago
- JanusDNA☆25Updated this week
- ☆10Oct 31, 2023Updated 2 years ago