4paradigm/spark

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/4paradigm/spark)

4paradigm / spark

This is OpenMLDB's Spark Distribution, which is particularly optimized for feature extraction. It includes a few novel techniques, such as native implementation of last join and multi-window parallelization. Its APIs are fully compatible with the standard Spark. It is designed to be a component of OpenMLDB (https://github.com/4paradigm/OpenMLDB)…

☆12

Alternatives and similar repositories for spark

Users that are interested in spark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

4paradigm / zetasql
View on GitHub
ZetaSQL - Analyzer Framework for SQL
☆14Oct 23, 2024Updated last year
decis-bench / febench
View on GitHub
A Benchmark for Real-Time Relational Data Feature Extraction (VLDB'23 Best Industry Paper Runnerup)
☆54Sep 9, 2023Updated 2 years ago
4paradigm / canopy
View on GitHub
Canopy is a machine learning learning compiler stack with the capability of adopting high-end FPGAs. As a part of OpenAIOS project, Canop…
☆12May 7, 2021Updated 5 years ago
oom-ai / oomstore
View on GitHub
Lightweight and Fast Feature Store Powered by Go (and Rust).
☆94Feb 28, 2022Updated 4 years ago
tobegit3hub / openmldb-chatgpt-plugin
View on GitHub
The ChatGPT plugin to enhance OpenMLDB.
☆51Apr 6, 2023Updated 3 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
4paradigm / OpenEmbedding
View on GitHub
OpenEmbedding is an open source framework for Tensorflow distributed training acceleration.
☆33Apr 13, 2023Updated 3 years ago
MichaelYin1994 / netman-2018-kpi-anomaly-detection
View on GitHub
2018年国际AIOps挑战赛KPI时序异常检测比赛基于OpenMLDB部署的工程化部署实践方案
☆11Aug 30, 2022Updated 3 years ago
4paradigm / pafka
View on GitHub
Pafka is originated from the OpenAIOS project to leverage an optimized tiered storage access strategy to improve overall performance for …
☆67Jan 2, 2022Updated 4 years ago
4paradigm / FeatInsight
View on GitHub
FeatInsight is a feature platform based on OpenMLDB
☆22Mar 7, 2025Updated last year
4paradigm / pskiplist
View on GitHub
An implementation of the persistent skiplist based on Intel Optane Persistent Memory. It is with Intel's pmemkv as an storage engine
☆13Apr 9, 2021Updated 5 years ago
GridGain-Demos / imc-essentials-in-90-minutes
View on GitHub
O'Reilly Course, In-Memory Computing Essentials
☆10Oct 16, 2020Updated 5 years ago
Styp / java-vbench
View on GitHub
Java 18 - Vector API Benchmark
☆13Aug 31, 2023Updated 2 years ago
openjdk / jdk22u
View on GitHub
https://openjdk.org/projects/jdk-updates last released 2024-07-17
☆10Jul 16, 2024Updated 2 years ago
chuyqa / alluxio-ambari-service
View on GitHub
Custom Service for deploying Apache Alluxio on a running HDP 2.3 / IOP 4.1 Ambari Managed Cluster
☆13Jan 13, 2017Updated 9 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
memark-io / pmedis
View on GitHub
A Redis module to provide support for storing Redis native data structures on PMem.
☆21Jul 18, 2022Updated 4 years ago
kjmrknsn / livy-manager
View on GitHub
Livy Manager - Web UI for Managing Apache Livy Sessions
☆16Dec 7, 2017Updated 8 years ago
christyharagan / dremio-gis
View on GitHub
Dremio GIS UDFs
☆12Sep 25, 2018Updated 7 years ago
jsnell / zlib-bench
View on GitHub
Benchmark script for comparing different versions of zlib
☆15Jul 18, 2017Updated 9 years ago
sunshineclt / n-gram
View on GitHub
Sina News Crawler and Word Segmentation
☆13Dec 20, 2017Updated 8 years ago
ModelTC / pyvlova
View on GitHub
Yet another Polyhedra Compiler for DeepLearning
☆19Apr 14, 2023Updated 3 years ago
unmeshjoshi / actorscheduling
View on GitHub
A simple thin thread demo to showcase how actors work with forkjoinpool
☆10Mar 2, 2018Updated 8 years ago
sematext / sematext-agent-integrations
View on GitHub
Core & Community developed monitoring integrations for Sematext monitoring agent
☆13May 30, 2024Updated 2 years ago
liuzhan001st / ABAQUS_to_FLAC3d
View on GitHub
A MATLAB program, converts model from ABAQUS .inp file to FLAC3d.
☆12Sep 27, 2016Updated 9 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
TRUBA-HPC / mastering_transformers
View on GitHub
Training materials and accompanying documentation for "Mastering Transformers: From Building Blocks to Real World Applications" training.
☆13Sep 13, 2023Updated 2 years ago
Xuhpclab / jxperf
View on GitHub
☆11Jan 4, 2022Updated 4 years ago
Radico / trino-plugins
View on GitHub
Simplified custom plugins for Trino
☆16Jul 29, 2024Updated last year
shuhuai007 / ambari-impala-service
View on GitHub
☆10Aug 30, 2019Updated 6 years ago
eto-ai / spark-video
View on GitHub
Processing videos on Apache Spark
☆13Feb 14, 2022Updated 4 years ago
NetEase / lakehouse-benchmark
View on GitHub
A benchmark tool for lakehouses.
☆14Mar 12, 2023Updated 3 years ago
neelts / figma-split-vectors
View on GitHub
Split Vectors Figma Plugin
☆13Mar 16, 2024Updated 2 years ago
k255 / drill-gis
View on GitHub
Spatial queries with Apache Drill
☆20Nov 2, 2017Updated 8 years ago
tracikkaynakplatform / kos
View on GitHub
☆13Nov 13, 2024Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
amplab / zipg
View on GitHub
A Memory-efficient Graph Store for Interactive Queries
☆13Sep 1, 2021Updated 4 years ago
4paradigm / pmemstore
View on GitHub
Key/Value Datastore for Persistent Memory
☆27Jun 9, 2021Updated 5 years ago
xpleaf / minidubbo
View on GitHub
A Full RPC Framework Based on Netty.
☆14May 19, 2018Updated 8 years ago
gg-daddy / system-design-primer
View on GitHub
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
☆15Dec 1, 2018Updated 7 years ago
yuanyan / know-your-chrome
View on GitHub
know your chrome
☆21Dec 18, 2022Updated 3 years ago
tecton-ai / airflow-tecton
View on GitHub
Airflow provider for use with Tecton.
☆11Aug 26, 2024Updated last year
banburytang / List-of-Chinese-Open-Source-Project-Financing
View on GitHub
☆16Nov 2, 2022Updated 3 years ago