This project has customization likes custom data sources, plugins written for the distributed systems like Apache Spark, Apache Ignite etc
☆34Oct 6, 2023Updated 2 years ago
Alternatives and similar repositories for big-data-projects
Users that are interested in big-data-projects are comparing it to the libraries listed below
Sorting:
- Examples of Spark 3.0☆45Nov 11, 2020Updated 5 years ago
- An experiment to inject a customized parser using SparkSessionExtension☆16Jan 1, 2018Updated 8 years ago
- Spark-Radiant is Apache Spark Performance and Cost Optimizer☆25Dec 31, 2024Updated last year
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆30Updated this week
- CODO is an ontology for the semantic representation and annotation of COVID-19 data in a machine-readable form for tracking history of th…☆10Apr 19, 2022Updated 3 years ago
- A dbt (data build tool) project you can use for testing purposes or experimentation☆38Nov 23, 2023Updated 2 years ago
- ☆12Updated this week
- Visual tool for SPARQL queries on graphol graphs☆10Oct 3, 2018Updated 7 years ago
- REDCap Electronic Data - I (Ingester/Integrator/Importer)☆10Oct 15, 2018Updated 7 years ago
- ☆10Nov 18, 2021Updated 4 years ago
- Java library for interacting with Consul.☆12Mar 20, 2023Updated 2 years ago
- Delve is a debugger for the Go programming language.☆11Apr 9, 2023Updated 2 years ago
- ☆11Feb 24, 2022Updated 4 years ago
- a simple lakeFS webhook for pre-commit and pre-merge validation of data objects☆12Nov 9, 2023Updated 2 years ago
- Elasticsearch plugin for Sentiment Analysis using Stanford CoreNLP☆11Oct 17, 2018Updated 7 years ago
- Functional programming exercises. Materials for a course in Sofia University.☆11Aug 22, 2015Updated 10 years ago
- Quickly run SchemaSpy on a database and serve the results☆10Mar 24, 2021Updated 4 years ago
- A basic DNN tutorial in PyTorch, for persons without a background in Linux, Python, or remote servers☆10Apr 2, 2020Updated 5 years ago
- A Reactive Sparql Client written in Scala and Akka☆13Sep 18, 2023Updated 2 years ago
- A tutorial for learning how to interact with HubSpot's APIs☆11Jan 5, 2023Updated 3 years ago
- Example for simple Apache Arrow Flight service with Apache Spark and TensorFlow clients☆37Mar 9, 2021Updated 5 years ago
- Material for the XSLT 3.0 Interoperability workshop at XML Prague 2016☆10Jun 5, 2016Updated 9 years ago
- ☆16Feb 28, 2026Updated last week
- A series of articles that explore working with data using Datafusion and Apache Arrow.☆10Mar 17, 2021Updated 4 years ago
- A reasonably complete and well-tested golang port of httpbin, with zero dependencies outside the go stdlib.☆11Nov 24, 2025Updated 3 months ago
- Rust FFI example project for Java & Python☆10Jun 8, 2019Updated 6 years ago
- Zookeeper management project under the control of simple rights(简单权限控制下的zookeeper管理项目)☆12Jun 25, 2018Updated 7 years ago
- Simple Go 1.8 plugin test for https://jeremywho.com/go-1.8---plugins/☆10Feb 28, 2017Updated 9 years ago
- A repository to work on the transmodel ontology that provides support to the NeTEx model☆11Feb 17, 2021Updated 5 years ago
- ☆10Jun 14, 2014Updated 11 years ago
- Produce GTFS-realtime data from a SIRI data source.☆12Apr 11, 2022Updated 3 years ago
- Java client for Hawkular☆11Mar 16, 2017Updated 8 years ago
- ☆18May 27, 2025Updated 9 months ago
- The Toxic Comment Classification project is an application that uses deep learning to identify toxic comments as toxic, severe toxic, obs…☆17Jul 27, 2023Updated 2 years ago
- This project demonstrates the use of generic bi-directional LSTM models for predicting importance of words in a spoken dialgoue for under…☆10Mar 24, 2023Updated 2 years ago
- CKAN extension for data.world☆12Dec 5, 2023Updated 2 years ago
- Code for a tutorial for basic concepts working with Akka using Scala.☆21Mar 29, 2013Updated 12 years ago
- MOVIO - Online Virtual Exhibitions☆15Nov 23, 2020Updated 5 years ago
- RDF file extension for DuckDB. Reads well, writes in progress☆14Updated this week