yanzhao-irit / data-lake-metadata-management-systemLinks
☆13Updated 3 years ago
Alternatives and similar repositories for data-lake-metadata-management-system
Users that are interested in data-lake-metadata-management-system are comparing it to the libraries listed below
Sorting:
- Python and Scala APIs for enhanced Spark analytics☆12Updated 8 years ago
- Twitter sentiment analysis using Spark and Stanford CoreNLP and visualization using elasticsearch and kibana☆20Updated 7 years ago
- Parameter Server implementation in Apache Flink.☆14Updated 7 years ago
- Tutorials on session-based recommender systems☆11Updated 8 years ago
- This project demonstrates the use of generic bi-directional LSTM models for predicting importance of words in a spoken dialgoue for under…☆10Updated 2 years ago
- deep entity resolution lite version☆11Updated 5 years ago
- Kylo integration with PDND (previously DAF).☆19Updated 2 years ago
- Run large scale tensor and coupled matrix-tensor factorization on top of stock Hadoop.☆18Updated 7 years ago
- Building blocks and patterns for building data prep transformations and feature engineering in Spark.☆16Updated 9 years ago
- SQLFlow client library for Python☆29Updated 2 years ago
- 大数据【企业级360°全方位用户画像】标签开发部分源码☆19Updated 4 years ago
- Short Text Similarity as described in https://dl.acm.org/citation.cfm?id=2806475☆16Updated 6 years ago
- Spark in Kaggle competitions☆10Updated 9 years ago
- Provides the implementation of a topic detection framework developed for the MULTISENSOR project.☆9Updated 9 years ago
- Sample code for Splice Community☆10Updated 2 years ago
- Implementation of NetClus:Ranking-Based Clustering of Heterogeneous Information Networks with Star Network Schema☆10Updated 9 years ago
- Implementation of the Chinese Whispers graph clustering algorithm☆8Updated 7 years ago
- This project is a unified ETL platform that support various data processing technologies, including Spark, Hive, Hadoop, Python, Linux Sh…☆17Updated 9 years ago
- ☆17Updated last year
- Distributed implementation of Robust PLSA using Spark☆12Updated 4 years ago
- Classifying economics articles using Latent Dirichlet Allocation☆8Updated 8 years ago
- Condor allows for the specification of synopsis-based streaming jobs on top of general dataflow systems. Condor provides a collection of …☆13Updated last year
- Automatic feature engineering using Generative Adversarial Networks using Deeplearning4j and Apache Spark.☆21Updated 2 years ago
- insight data engineering fellow project☆15Updated 8 years ago
- Model management example using Polyaxon, Argo and Seldon☆23Updated 6 years ago
- TF-Tile: an efficient sparse representation for real-valued data☆14Updated 2 years ago
- A SQL parser and analyzer for sql flavors including MySQL, PostgreSQL, BigQuery Standard SQL, Presto SQL and Hive SQL.☆10Updated 2 years ago
- Code for "Boosted Generative Models", AAAI 2018.☆20Updated 7 years ago
- DataBright: Towards a Global Exchange for Decentralized Data Ownership and Trusted Computation☆13Updated 7 years ago
- ☆37Updated 6 years ago