A simple Spark LDA example. to demonstrate a full fletched clustering algorithm, with data cleaning using the processess like lemmatization , stemming etc.
☆23Oct 8, 2016Updated 9 years ago
Alternatives and similar repositories for spark-LDA-example
Users that are interested in spark-LDA-example are comparing it to the libraries listed below
Sorting:
- Email Analysis Tool based on Hadoop☆20Apr 26, 2021Updated 4 years ago
- spark,NLP,新词发现,自然语言处理☆23Mar 16, 2018Updated 7 years ago
- A tool for translating Scala source code into readable and maintainable Java code☆13Jan 3, 2026Updated 2 months ago
- 爬虫与机器学习☆48Jul 19, 2017Updated 8 years ago
- Topic Modeling with LDA in Scala and Spark☆31Sep 25, 2018Updated 7 years ago
- A batch-processing system base on Spring Boot and Spring Batch. 一个基于SpringBoot和SpringBatch的批处理系统。☆10Sep 10, 2018Updated 7 years ago
- This Pinyin Analysis plugin is used to do conversion between Chinese characters and Pinyin.☆10Mar 28, 2019Updated 6 years ago
- Sample code for blog posts☆15Oct 26, 2012Updated 13 years ago
- zdh系列-基于java的经营风控引擎☆13Jan 24, 2026Updated last month
- ☆11Sep 1, 2022Updated 3 years ago
- Spark projects. Learning book "Machine Learning with Spark"☆10Jun 3, 2017Updated 8 years ago
- Links to all the source code and solutions I reference in my O'Reilly Introduction to Docker video tutorial☆11Dec 10, 2014Updated 11 years ago
- 机器学习项目☆38Mar 13, 2017Updated 8 years ago
- 一个为spark批量导入数据到hbase的库☆42Nov 18, 2016Updated 9 years ago
- ☆14Apr 12, 2022Updated 3 years ago
- An embeddable double sided accounting ledger built on PG/SQLx☆10Feb 16, 2026Updated 2 weeks ago
- ☆12Jan 27, 2023Updated 3 years ago
- chinese word segmentation based on rnn☆13Oct 14, 2016Updated 9 years ago
- Demos for "Intro to Reactive Programming" talk☆11Sep 19, 2015Updated 10 years ago
- Example of running Bevy inside a Web Worker☆10Nov 28, 2024Updated last year
- A generic, maven-based, scala project template☆18Mar 9, 2016Updated 9 years ago
- A custom watcher plugin for Elasticsearch that feeds Apache Kafka☆11Mar 9, 2018Updated 7 years ago
- Python and Scala APIs for enhanced Spark analytics☆12Mar 15, 2017Updated 8 years ago
- This project uses a Long Short-Term Memory (LSTM) model, a type of recurrent neural network, to predict the future price of Bitcoin.☆14Jul 13, 2023Updated 2 years ago
- springmvc+phoenix操作hbase的web架构☆10Aug 20, 2018Updated 7 years ago
- Java implementation for KillrVideo project☆11Jun 4, 2018Updated 7 years ago
- The Bounded framework for Scala, Akka and Domain Driven Design☆11Nov 26, 2024Updated last year
- Meedan's Open Source Arabic/English Translation Memory☆33Nov 4, 2009Updated 16 years ago
- ServiceFramework 示例项目☆10Apr 2, 2016Updated 9 years ago
- ☆41Jul 19, 2019Updated 6 years ago
- cryptosharing☆10Feb 27, 2022Updated 4 years ago
- 基于hanlp工具包的es分词插件☆10Mar 20, 2018Updated 7 years ago
- Data repository for Karuta. Can write into MySQL and Oracle databases.☆11Nov 5, 2025Updated 4 months ago
- 新词发现分布式机器学习算法。☆15Jul 21, 2014Updated 11 years ago
- Creates a Lucene index out of files from a local folder☆13Aug 8, 2014Updated 11 years ago
- A simple example usage of HBase on Trusted Analytics Platform.☆10Jul 6, 2016Updated 9 years ago
- Break Away: Programming And Coding Interviews, published by Packt☆13Jan 30, 2023Updated 3 years ago
- List customize [dot] files config.☆11May 14, 2025Updated 9 months ago
- dw etl 工具 mysql 增量、全量抽取 to hive. 合并 hive 数据表, 等数据平台清洗工具☆10Dec 21, 2016Updated 9 years ago