Spark Implementation of Google Facets Overview https://github.com/PAIR-code/facets
☆56Oct 16, 2023Updated 2 years ago
Alternatives and similar repositories for facets-overview-spark
Users that are interested in facets-overview-spark are comparing it to the libraries listed below
Sorting:
- This project includes various scripts for Ensage.☆11Jan 5, 2015Updated 11 years ago
- An experiment with movie scenes and contrastive learning☆11Feb 1, 2025Updated last year
- A collection tools/scripts to explore the ListenBrainz data using Apache Spark.☆16Jan 19, 2020Updated 6 years ago
- A Survey on Learning to Hash☆10Apr 10, 2018Updated 7 years ago
- Gradle Plugin for ScalaPB☆12Jul 1, 2020Updated 5 years ago
- Java and Scala client libraries for Concord☆13Feb 15, 2017Updated 9 years ago
- Data-Driven Spark allows quick data exploration based on Apache Spark.☆29Jan 6, 2017Updated 9 years ago
- Implementation of DeepFM using keras. And train on libsvm format file☆13Aug 15, 2019Updated 6 years ago
- Feature engineering toolkit for Spark MLlib.☆12Apr 1, 2017Updated 8 years ago
- Extract, Transform, Load (ETL) refers to a process in database usage and especially in data warehousing. This repository contains a s…☆21Mar 20, 2017Updated 9 years ago
- ☆10Jul 20, 2023Updated 2 years ago
- Writing application logic for Spark jobs that can be unit-tested without a SparkContext☆76Jan 27, 2019Updated 7 years ago
- Contextual Recommendation Implementation for Research Purposes☆19Jul 3, 2024Updated last year
- Simple clock/cron process that monitors a specific directory and run jobs based on its filename.☆10Jun 8, 2020Updated 5 years ago
- Run FeatureTools to automate Feature Engineering distributionally on Spark.☆11Oct 11, 2018Updated 7 years ago
- Python API for Deequ☆41Nov 10, 2020Updated 5 years ago
- A small utility module to make it simple to build BentoML Services into images inside Kubernetes clusters.☆10Dec 15, 2020Updated 5 years ago
- 一个比Spark-Parquet还快5~100倍的存储格式☆12Feb 22, 2016Updated 10 years ago
- Pyspark Notebook With Docker☆11Aug 18, 2015Updated 10 years ago
- Openscoring application for the Docker distributed applications platform☆12Nov 8, 2020Updated 5 years ago
- S3 backed ContentsManager for jupyter notebooks☆14Feb 10, 2016Updated 10 years ago
- ☆10Jan 12, 2021Updated 5 years ago
- 将deepwalk、node2vector和阿里的文章:Billion-scale Commodity Embedding for E-commerce Recommendation in Alibaba用代码实现☆15Apr 14, 2020Updated 5 years ago
- XPath extension for extraction from interactive web sites. NOTE: This code is currently out of sync. A more recent, but precompiled versi…☆27Feb 27, 2013Updated 13 years ago
- VoltDB Click Stream Processing Example.☆16Jan 2, 2018Updated 8 years ago
- A balancing plate simulator.☆13Sep 20, 2022Updated 3 years ago
- A command-line tool that summarizes the size of a codebase by language, showing lines of code with and without comments and blank lines.☆50Mar 6, 2026Updated 2 weeks ago
- Fulfills a GitHub workflow_job webhooks into a Pub/Sub queue.☆12Mar 13, 2025Updated last year
- A platform for online learning that curtails data latency and saves you cost.☆47Jan 6, 2022Updated 4 years ago
- This is the code of reproducing the results of our paper: On the importance of Hyperparameter Optimization for Model-based Reinforcement …☆16Aug 19, 2021Updated 4 years ago
- The Spring Travel reference application, modified to work on Cloud Foundry.☆18Aug 24, 2015Updated 10 years ago
- ☆107Nov 9, 2022Updated 3 years ago
- ☆21Nov 5, 2018Updated 7 years ago
- A parallel implementation of word2vec based on Spark☆22May 19, 2017Updated 8 years ago
- Explains machine learning models fast using the Anchor algorithm originally proposed by marcotcr in 2018☆15Dec 19, 2025Updated 3 months ago
- ☆19Sep 4, 2023Updated 2 years ago
- Example for experimenting with how JupyterHub can be configured to work with Kerberos☆33Oct 17, 2017Updated 8 years ago
- A database API for Scala☆12Apr 11, 2020Updated 5 years ago
- ☆16Feb 24, 2017Updated 9 years ago