zouzias / spark-lucenerddView external linksLinks
Spark RDD with Lucene's query and entity linkage capabilities
☆128Sep 8, 2025Updated 5 months ago
Alternatives and similar repositories for spark-lucenerdd
Users that are interested in spark-lucenerdd are comparing it to the libraries listed below
Sorting:
- Examples of spark-lucenerdd☆15Oct 6, 2023Updated 2 years ago
- Big Data search with Spark and Lucene☆18Dec 15, 2023Updated 2 years ago
- Analytic UIMA pipelines using Spark☆24Nov 27, 2015Updated 10 years ago
- Creates a Lucene index out of files from a local folder☆13Aug 8, 2014Updated 11 years ago
- A small java library for NLP Interchange Format (NIF) for NER(D) systems☆10Sep 13, 2022Updated 3 years ago
- Hadoop integration code for working with with Apache cTAKES☆10Feb 11, 2014Updated 12 years ago
- ☆11Apr 24, 2018Updated 7 years ago
- Datalog implementation in Scala.☆12Jun 17, 2014Updated 11 years ago
- ML Featurizer is a library to enable users to create additional features from raw data with ease☆14Apr 8, 2024Updated last year
- Some examples to demonstrate using the threejs framework from JSweet.☆11Dec 10, 2019Updated 6 years ago
- Examples for the Activate conference☆11Sep 11, 2019Updated 6 years ago
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆74Nov 9, 2023Updated 2 years ago
- SamzaSQL: Streaming SQL implementation on top of Apache Samza and Apache Kafka☆29Jun 8, 2016Updated 9 years ago
- Some popular algorithms(dbscan,knn,fm etc.) on spark☆32May 29, 2018Updated 7 years ago
- Way to run Uima Pipelines on Apache Spark☆10Jul 19, 2021Updated 4 years ago
- Distributed SQL query engine for running interactive analytic queries against big data sources.☆44Dec 12, 2016Updated 9 years ago
- Docker containers with Apache Accumulo and Apache Spark environment.☆12Jan 22, 2016Updated 10 years ago
- presto's elasticsearch connector☆11Dec 7, 2016Updated 9 years ago
- Apache-Spark based Data Flow(ETL) Framework which supports multiple read, write destinations of different types and also support multiple…☆26Jun 7, 2021Updated 4 years ago
- sql解析和执行,能够执行hive, spark, flink, 以及对应对TensorFlow, Deeplearning4j的算法SQL执行☆11Sep 16, 2022Updated 3 years ago
- Resources for 3D Deep Learning☆12Sep 7, 2017Updated 8 years ago
- An sbt plugin to configure Java Flight Recorder☆10Jul 28, 2024Updated last year
- The Solr Package Directory and Sanctuary☆13Oct 14, 2025Updated 4 months ago
- Engineering Drawing Parser☆10Jan 24, 2019Updated 7 years ago
- Tools for reading data from Solr as a Spark RDD and indexing objects from Spark into Solr using SolrJ.☆445Sep 4, 2025Updated 5 months ago
- Bucketing and partitioning system for Parquet☆30May 22, 2018Updated 7 years ago
- Spark package to "plug" holes in data using SQL based rules ⚡️ 🔌☆29May 15, 2020Updated 5 years ago
- Apache UIMA uimaFIT☆32Nov 27, 2024Updated last year
- Source code for the Style Similarity project: measure style similarity between 3D shapes.☆14Mar 10, 2020Updated 5 years ago
- C as an Embedded Language in Scala☆18Dec 17, 2014Updated 11 years ago
- N-dimensional arrays, with Zarr and HDF5 integrations☆19Feb 26, 2019Updated 6 years ago
- Neo4j Scala client using Akka-Http☆15Mar 9, 2017Updated 8 years ago
- TPC-DS benchmarks including data generation with Spark and queries with Spark☆14May 8, 2017Updated 8 years ago
- Spark SQL index for Parquet tables☆134May 6, 2021Updated 4 years ago
- Scala wrapper around the Google Sheets API☆33Jul 4, 2024Updated last year
- The reference implementation of IDML for the JVM☆43Feb 13, 2025Updated last year
- ☆17Feb 16, 2020Updated 6 years ago
- Bringing Spire to Dotty/Scala 3☆14Feb 19, 2024Updated last year
- An sbt plugin to fix java.lang.OutOfMemoryError: Metaspace/PermGen errors during interactive sbt usage☆14Feb 16, 2017Updated 9 years ago