The Lightning Catalog is an open-source data catalog designed for preparing data at any scale in ad-hoc analytics, data virtualization, data warehousing, lake houses, and ML projects.
☆37Feb 5, 2026Updated last month
Alternatives and similar repositories for lightning-catalog
Users that are interested in lightning-catalog are comparing it to the libraries listed below
Sorting:
- Spark* plug-in for accelerating Spark* SQL performance by using cache and index at SQL data source layer.☆37Jan 3, 2023Updated 3 years ago
- PDF DataSource for Apache Spark, allow to read PDF files directly to the DataFrame and ocr it☆79Apr 27, 2025Updated 10 months ago
- Magic to help Spark pipelines upgrade☆34Sep 29, 2024Updated last year
- 适合2到6岁的宝宝打字游戏☆10May 29, 2020Updated 5 years ago
- Integration of Iceberg table management into Spark SQL☆11Jan 21, 2020Updated 6 years ago
- norm4j (Not an ORM for Java) is a lightweight, SQL-centric alternative to JPA☆14Feb 2, 2026Updated last month
- Write SQL-like queries over JavaScript data structures☆10Jan 30, 2020Updated 6 years ago
- RAG-based Chatbot that helps answer questions around healthy eating & lifestyle choices, based on 1200+ science-backed blog posts of Nutr…☆13Sep 15, 2025Updated 5 months ago
- A sql extension build on spark3 datasource v2 api, ex: hive v2 catalog support amoung multi clusters☆12May 7, 2022Updated 3 years ago
- 基于springboot3、Spring AI、vue3开发的后台管理项目。多注释、少封装、尽量精简,方便二开和代码阅读☆16Jan 15, 2026Updated last month
- Easily stand up Keycloak and SPIRE for testing AI Agents☆29Sep 18, 2025Updated 5 months ago
- The tool to visualise architecture of python packages☆10Aug 16, 2023Updated 2 years ago
- Solidity contracts for Aave leveraged swap☆11Feb 28, 2022Updated 4 years ago
- Pulumi provider for KIND☆11Nov 5, 2021Updated 4 years ago
- ☆12Aug 27, 2024Updated last year
- Automated TPC-DS and TPC-H benchmark for Apache Hive LLAP☆10Jul 18, 2022Updated 3 years ago
- Engineered a production-grade WhatsApp clone backend leveraging Spring Boot's ecosystem☆10Apr 26, 2025Updated 10 months ago
- Apache Arrow Guide☆17Oct 10, 2021Updated 4 years ago
- JumpSpark - A modern cookiecutter template for pyspark projects with batteries included.☆10May 12, 2023Updated 2 years ago
- Simple Python Script that uses a LLM to answer questions about your code☆12Jan 2, 2025Updated last year
- Course Scheduling Management LMS - Low level design with standard design patterns using Java.☆11Jul 27, 2022Updated 3 years ago
- A Spark datasource for the HadoopCryptoLedger library☆13Sep 29, 2025Updated 5 months ago
- SQLAlchemy driver for SAP Sybase SQL Anywhere☆12Mar 9, 2023Updated 3 years ago
- R and Python solutions to applied exercises in An Introduction to Statistical Learning with Applications in R (corrected 7th printing)☆16Jun 4, 2025Updated 9 months ago
- 对YCSB进行了Web化,使ycsb的压测工作可以在web端完成,并实现了分布式压测的能力,一键启动多个客户端,系统提供压测指标实时看板功能,以及指标汇总报告等能力,欢迎试用。☆14Jan 19, 2022Updated 4 years ago
- antlr4_tex2sym parses LaTeX math expressions and converts it into the equivalent SymPy form by using antlr4.☆11Oct 7, 2020Updated 5 years ago
- JUnit testing without dull routine☆31Mar 30, 2017Updated 8 years ago
- A framework for building behaviour-driven tests in fluent Java.☆35Mar 27, 2020Updated 5 years ago
- Multi-Agent Deep RAG☆39Feb 25, 2026Updated last week
- Buddy: Your AI Coding Buddy☆18Aug 27, 2025Updated 6 months ago
- ☆10Mar 11, 2025Updated 11 months ago
- ☆10Feb 24, 2025Updated last year
- NodeJS interface for HMD positional data.☆21Aug 3, 2015Updated 10 years ago
- 对接 Dify不同应用的 API,从而对接自己的业务系统,实现与 Dify 应用的对话流处理,将对话结果流式返回给前端,并将对话结果分发给开发者自行处理☆12Sep 4, 2024Updated last year
- ☆15Updated this week
- HTTP client library wrapping Apache HttpAsyncClient☆17May 5, 2025Updated 10 months ago
- ☆18Dec 6, 2024Updated last year
- Processing videos on Apache Spark☆12Feb 14, 2022Updated 4 years ago
- j.u.s.Stream alternative (synchronous only), reusable, faster, more operators, easier to use.☆18Feb 23, 2026Updated 2 weeks ago